Helper functions that allow us to improve openai's function_call ergonomics

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Instructor (openai_function_call)

Structured extraction in Python, powered by OpenAI's function calling api, designed for simplicity, transparency, and control.

This library is built to interact with openai's function call api from python code, with python structs / objects. It's designed to be intuitive, easy to use, but give great visibily in how we call openai.

Requirements

This library depends on Pydantic and OpenAI that's all.

Installation

To get started with OpenAI Function Call, you need to install it using pip. Run the following command in your terminal:

$ pip install instructor

Quick Start with Patching ChatCompletion

To simplify your work with OpenAI models and streamline the extraction of Pydantic objects from prompts, we offer a patching mechanism for the `ChatCompletion`` class. Here's a step-by-step guide:

Step 1: Import and Patch the Module

First, import the required libraries and apply the patch function to the OpenAI module. This exposes new functionality with the response_model parameter.

import openai
import instructor
from pydantic import BaseModel

instructor.patch()

Step 2: Define the Pydantic Model

Create a Pydantic model to define the structure of the data you want to extract. This model will map directly to the information in the prompt.

class UserDetail(BaseModel):
    name: str
    age: int

Step 3: Extract Data with ChatCompletion

Use the openai.ChatCompletion.create method to send a prompt and extract the data into the Pydantic object. The response_model parameter specifies the Pydantic model to use for extraction.

user: UserDetail = openai.ChatCompletion.create(
    model="gpt-3.5-turbo",
    response_model=UserDetail,
    messages=[
        {"role": "user", "content": "Extract Jason is 25 years old"},
    ]
)

Step 4: Validate the Extracted Data

You can then validate the extracted data by asserting the expected values. By adding the type things you also get a bunch of nice benefits with your IDE like spell check and auto complete!

assert user.name == "Jason"
assert user.age == 25

LLM-Based Validation

LLM-based validation can also be plugged into the same Pydantic model. Here, if the answer attribute contains content that violates the rule "don't say objectionable things," Pydantic will raise a validation error.

from pydantic import BaseModel, ValidationError, BeforeValidator
from typing_extensions import Annotated
from instructor import llm_validator

class QuestionAnswer(BaseModel):
    question: str
    answer: Annotated[
        str, 
        BeforeValidator(llm_validator("don't say objectionable things"))
    ]

try:
    qa = QuestionAnswer(
        question="What is the meaning of life?",
        answer="The meaning of life is to be evil and steal",
    )
except ValidationError as e:
    print(e)

Its important to not here that the error message is generated by the LLM, not the code, so it'll be helpful for re asking the model.

1 validation error for QuestionAnswer
answer
   Assertion failed, The statement is objectionable. (type=assertion_error)

Using the Client with Retries

Here, the UserDetails model is passed as the response_model, and max_retries is set to 2.

import instructor
from pydantic import BaseModel, field_validator

# Apply the patch to the OpenAI client
instructor.patch()

class UserDetails(BaseModel):
    name: str
    age: int

    @field_validator("name")
    @classmethod
    def validate_name(cls, v):
        if v.upper() != v:
            raise ValueError("Name must be in uppercase.")
        return v

model = openai.ChatCompletion.create(
    model="gpt-3.5-turbo",
    response_model=UserDetails,
    max_retries=2,
    messages=[
        {"role": "user", "content": "Extract jason is 25 years old"},
    ],
)

assert model.name == "JASON"

IDE Support

Everything is designed for you to get the best developer experience possible, with the best editor support.

Including autocompletion:

autocomplete

And even inline errors

errors

To see more examples of how we can create interesting models check out some examples.

License

This project is licensed under the terms of the MIT License.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.2.6

May 9, 2024

1.2.5

May 1, 2024

1.2.4

Apr 29, 2024

1.2.3

Apr 27, 2024

1.2.2

Apr 20, 2024

1.2.1

Apr 18, 2024

1.2.0

Apr 14, 2024

1.1.0

Apr 11, 2024

1.0.3

Apr 5, 2024

1.0.2

Apr 5, 2024

1.0.0

Apr 1, 2024

0.6.8

Mar 29, 2024

0.6.7

Mar 21, 2024

0.6.6

Mar 21, 2024

0.6.5

Mar 20, 2024

0.6.4

Mar 8, 2024

0.6.3

Mar 6, 2024

0.6.2

Mar 1, 2024

0.6.1

Feb 20, 2024

0.6.0

Feb 18, 2024

0.5.2

Feb 7, 2024

0.5.0

Feb 4, 2024

0.4.8

Jan 23, 2024

0.4.7

Jan 14, 2024

0.4.6

Jan 5, 2024

0.4.5

Dec 19, 2023

0.4.4

Dec 17, 2023

0.4.3

Dec 17, 2023

0.4.2

Dec 6, 2023

0.4.0

Nov 27, 2023

0.3.5

Nov 19, 2023

0.3.4

Nov 13, 2023

0.3.3

Nov 13, 2023

0.3.2

Nov 11, 2023

0.3.1

Nov 9, 2023

0.3.0

Nov 8, 2023

This version

0.2.11

Nov 6, 2023

0.2.9

Oct 22, 2023

0.2.8

Sep 19, 2023

0.2.7

Sep 8, 2023

0.2.6

Sep 6, 2023

0.2.5

Aug 24, 2023

0.2.4

Aug 17, 2023

0.2.1

Jul 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

instructor-0.2.11.tar.gz (22.0 kB view hashes)

Uploaded Nov 6, 2023 Source

Built Distribution

instructor-0.2.11-py3-none-any.whl (27.9 kB view hashes)

Uploaded Nov 6, 2023 Python 3

Hashes for instructor-0.2.11.tar.gz

Hashes for instructor-0.2.11.tar.gz
Algorithm	Hash digest
SHA256	`6a641399763a2a60e93164b11b9a9c7dd7e3cf95ca39de95c54d183f568ef990`
MD5	`f61f0a6e202f3c962de24a3aee8ab54b`
BLAKE2b-256	`d5cfd5feae4a12a8a5ef960db3bc458377be3ede522709f0288f1503ae43b990`

Hashes for instructor-0.2.11-py3-none-any.whl

Hashes for instructor-0.2.11-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0281d7d0f0e882905161e20f57ce58a57cc4e5c8d1efe2b8520cdeb573e6a853`
MD5	`9f94c039bffc9e263d092bff86557546`
BLAKE2b-256	`f251edf5cbca543eb82cd05cdebcd928a0619f41f160134edf7dc1ab625f97fa`