Draft Pydantic Intro #271

jxnl · 2023-12-12T04:40:49Z

No description provided.

coderabbitai · 2023-12-12T04:40:53Z

Important

Auto Review Skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository.

To trigger a single review, invoke the @coderabbitai review command.

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on X ?

Tips

Chat with CodeRabbit Bot (`@coderabbitai`)

You can reply to a review comment made by CodeRabbit.
You can tag CodeRabbit on specific lines of code or files in the PR by tagging @coderabbitai in a comment.
You can tag @coderabbitai in a PR comment and ask one-off questions about the PR and the codebase. Use quoted replies to pass the context for follow-up questions.

CodeRabbit Commands (invoked as PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger a review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai help to get help.

Additionally, you can add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.

CodeRabbit Configration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
The JSON schema for the configuration file is available here.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/coderabbit-overrides.v2.json

sydney-runkle

Nice first draft 👍

sydney-runkle · 2023-12-12T23:27:52Z

docs/blog/posts/pydantic.md

+
+# Steering Large Language Models with Pydantic
+
+In the past year, significant progress has been made in utilizing large language models. Prompt engineering, in particular, has gained attention, and new prompting techniques are being developed to guide language models toward specific tasks. While many are building chat bots, an even more exciting application is the generation of structured outputs, whether its extracting structured data, augmenting your RAG application, or even generating


I'm guessing you meant to finish the "or even generating..." at the end here, but otherwise I like this as an intro paragraph. As someone who hasn't worked with LLMs a bunch, it could be helpful to add an additional sentence going into more detail about what "prompt engineering" is.

docs/blog/posts/pydantic.md

sydney-runkle

Nice, I like the changes you've made!

docs/blog/posts/pydantic.md

dmontagu · 2023-12-14T15:59:14Z

docs/blog/posts/pydantic.md

+
+In the past year, significant progress has been made in utilizing large language models. Prompt engineering, in particular, has gained attention, and new prompting techniques are being developed to guide language models toward specific tasks. While many are building chat bots, an even more exciting application is the generation of structured outputs, whether its extracting structured data, augmenting your RAG application, or even generating synthetic data.
+
+!!! question "What is Prompt Engineering?"


I feel like most people have heard the term "Prompt Engineering" now so kind of are familiar with the concept, but if the goal with this block is really to clarify for people who aren't familiar, I personally would be inclined to try to phrase it in a less technical way. I read this and it almost sounds like some advanced machine learning technique, maybe even involving math, etc. lol. And the linked article also talks about it in highly technical language that I expect would be confusing for someone who assumes that AI/machine learning is "above their pay grade".

Like, I am assuming the audience of this article might find this intimidating (screenshot from the link below):

Whereas I feel like my intuition for prompt engineering is closer to "Creatively rephrasing how you pose your question to the AI until it consistently gives responses you want". I understand how that might seem reductive, but I feel like it gives a better intuition and is more inviting to newcomers. And I think once you get that "prompt engineering" is something that's ultimately very intuitive (anyone who has used ChatGPT for more than a few questions has probably independently "discovered" the idea of prompt engineering) it's still easy to appreciate more sophisticated techniques, like using Pydantic models to validate outputs/guide inputs.

I guess best to say something like "You can think of it very technically like ..., or as just creatively rephrasing how you pose your question to the AI until it consistently gives responses you want".

docs/blog/posts/pydantic.md

Co-authored-by: David Montague <35119617+dmontagu@users.noreply.github.com>

dmontagu

Overall looks great.

These notes are mostly minor stylistic things, I'll share "bigger-picture" thoughts to you directly.

docs/blog/posts/pydantic.md

dmontagu · 2023-12-14T16:47:40Z

docs/blog/posts/pydantic.md

+client = OpenAI()
+
+
+class Package(BaseModel):


Suggested change

class Package(BaseModel):

class PythonPackage(BaseModel):

and elsewhere below, or change where it says PythonPackage above to Package. But I think PythonPackage is clearer

docs/blog/posts/pydantic.md

dmontagu · 2023-12-14T17:14:46Z

docs/blog/posts/pydantic.md

+)
+```
+
+Now, by using the `response_model` argument, inspired by `FastAPI`, you can specify the desired output, and `instructor` will take care of the rest!


Suggested change

Now, by using the `response_model` argument, inspired by `FastAPI`, you can specify the desired output, and `instructor` will take care of the rest!

Now, by using the `response_model` argument (inspired by `FastAPI`) you can specify the desired output and `instructor` will take care of the rest!

docs/blog/posts/pydantic.md

Co-authored-by: David Montague <35119617+dmontagu@users.noreply.github.com>

samuelcolvin

a few tiny things, but overall I think this looks great.

samuelcolvin · 2023-12-17T19:28:21Z

docs/blog/posts/pydantic.md

+
+In the past year, significant progress has been made in utilizing large language models. Prompt engineering, in particular, has gained attention, and new prompting techniques are being developed to guide language models toward specific tasks. While many are building chat bots, an even more exciting application is the generation of structured outputs, whether its extracting structured data, augmenting your RAG application, or even generating synthetic data.
+
+!!! question "What is Prompt Engineering?"


I guess best to say something like "You can think of it very technically like ..., or as just creatively rephrasing how you pose your question to the AI until it consistently gives responses you want".

samuelcolvin · 2023-12-17T19:29:22Z

docs/blog/posts/pydantic.md

+
+## Pydantic
+
+Unlike libraries like `dataclasses`, `Pydantic` goes a step further and defines a schema for your dataclass. This schema is used to validate data, but also to generate documentation and even to generate a JSON schema, which is perfect for our use case of generating structured data with language models!


technically dataclasses isn't a library, but part of the standard library.

I would say pydantic goes further, in that it 1) enfoces the types in your class and 2) can generate JSON schema for your class.

docs/blog/posts/pydantic.md

Co-authored-by: Samuel Colvin <s@muelcolvin.com>

draft

ca1437f

sydney-runkle reviewed Dec 12, 2023

View reviewed changes

jxnl added 2 commits December 12, 2023 22:17

add blurb

c73b5d8

synthetic data

81226fe

jxnl force-pushed the pydantic-blog branch from ae29133 to 81226fe Compare December 13, 2023 03:26

add validation

7099dfc

sydney-runkle reviewed Dec 13, 2023

View reviewed changes

jxnl added 2 commits December 13, 2023 14:47

nits

ac2fa3d

add link

9341c01

jxnl requested a review from sydney-runkle December 13, 2023 19:51

dmontagu reviewed Dec 14, 2023

View reviewed changes

docs/blog/posts/pydantic.md Outdated Show resolved Hide resolved

jxnl and others added 2 commits December 14, 2023 11:38

Update docs/blog/posts/pydantic.md

25e89a0

Co-authored-by: David Montague <35119617+dmontagu@users.noreply.github.com>

simplify text

da0495d

dmontagu reviewed Dec 14, 2023

View reviewed changes

docs/blog/posts/pydantic.md Outdated Show resolved Hide resolved

jxnl and others added 6 commits December 14, 2023 14:15

bunmp

4fa9a0b

Apply suggestions from code review

75efdd2

Co-authored-by: David Montague <35119617+dmontagu@users.noreply.github.com>

typos

5afe566

reword

f0f812b

Merge branch 'main' into pydantic-blog

bae32d9

add tips

b84d3c0

samuelcolvin reviewed Dec 17, 2023

View reviewed changes

Update pydantic.md

ecf0e67

Co-authored-by: Samuel Colvin <s@muelcolvin.com>

jxnl closed this Dec 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft Pydantic Intro #271

Draft Pydantic Intro #271

jxnl commented Dec 12, 2023

coderabbitai bot commented Dec 12, 2023 •

edited

Loading

Auto Review Skipped

Chat with CodeRabbit Bot (`@coderabbitai`)

CodeRabbit Commands (invoked as PR comments)

CodeRabbit Configration File (`.coderabbit.yaml`)

sydney-runkle left a comment

sydney-runkle Dec 12, 2023

jxnl Dec 13, 2023

sydney-runkle left a comment

dmontagu Dec 14, 2023 •

edited

Loading

samuelcolvin Dec 17, 2023

dmontagu left a comment

dmontagu Dec 14, 2023

dmontagu Dec 14, 2023

samuelcolvin left a comment

samuelcolvin Dec 17, 2023

samuelcolvin Dec 17, 2023


		# Steering Large Language Models with Pydantic

		In the past year, significant progress has been made in utilizing large language models. Prompt engineering, in particular, has gained attention, and new prompting techniques are being developed to guide language models toward specific tasks. While many are building chat bots, an even more exciting application is the generation of structured outputs, whether its extracting structured data, augmenting your RAG application, or even generating


		In the past year, significant progress has been made in utilizing large language models. Prompt engineering, in particular, has gained attention, and new prompting techniques are being developed to guide language models toward specific tasks. While many are building chat bots, an even more exciting application is the generation of structured outputs, whether its extracting structured data, augmenting your RAG application, or even generating synthetic data.

		!!! question "What is Prompt Engineering?"

	Now, by using the `response_model` argument, inspired by `FastAPI`, you can specify the desired output, and `instructor` will take care of the rest!
	Now, by using the `response_model` argument (inspired by `FastAPI`) you can specify the desired output and `instructor` will take care of the rest!


		## Pydantic

		Unlike libraries like `dataclasses`, `Pydantic` goes a step further and defines a schema for your dataclass. This schema is used to validate data, but also to generate documentation and even to generate a JSON schema, which is perfect for our use case of generating structured data with language models!

Draft Pydantic Intro #271

Draft Pydantic Intro #271

Conversation

jxnl commented Dec 12, 2023

coderabbitai bot commented Dec 12, 2023 • edited Loading

Auto Review Skipped

Chat with CodeRabbit Bot (@coderabbitai)

CodeRabbit Commands (invoked as PR comments)

CodeRabbit Configration File (.coderabbit.yaml)

sydney-runkle left a comment

Choose a reason for hiding this comment

sydney-runkle Dec 12, 2023

Choose a reason for hiding this comment

jxnl Dec 13, 2023

Choose a reason for hiding this comment

sydney-runkle left a comment

Choose a reason for hiding this comment

dmontagu Dec 14, 2023 • edited Loading

Choose a reason for hiding this comment

samuelcolvin Dec 17, 2023

Choose a reason for hiding this comment

dmontagu left a comment

Choose a reason for hiding this comment

dmontagu Dec 14, 2023

Choose a reason for hiding this comment

dmontagu Dec 14, 2023

Choose a reason for hiding this comment

samuelcolvin left a comment

Choose a reason for hiding this comment

samuelcolvin Dec 17, 2023

Choose a reason for hiding this comment

samuelcolvin Dec 17, 2023

Choose a reason for hiding this comment

coderabbitai bot commented Dec 12, 2023 •

edited

Loading

Chat with CodeRabbit Bot (`@coderabbitai`)

CodeRabbit Configration File (`.coderabbit.yaml`)

dmontagu Dec 14, 2023 •

edited

Loading