Issue with reading Resume #38

bdeva1975 · 2024-04-10T04:14:20Z

I am getting the following error:-

RuntimeError: no validator found for <class 'main.Resume'>, see arbitrary_types_allowed in Config

I tried to use the following code:-

"""Information information_extraction from documents.

The example CV is from https://github.com/xitanggg/open-resume.
"""
import os

Replace "<your_api_key>" with your actual key

os.environ["OPENAI_API_KEY"] = "<api_key>"

from typing import Optional

from langchain.chains import create_extraction_chain_pydantic
from langchain.chat_models import ChatOpenAI
from langchain.document_loaders import PyPDFLoader
from pydantic import BaseModel, Field

class Experience(BaseModel):
# the title doesn't seem to help at all.
start_date: Optional[str] = Field(description="When the job or study started.")
end_date: Optional[str] = Field(description="When the job or study ended.")
description: Optional[str] = Field(description="What the job or study entailed.")
country: Optional[str] = Field(description="The country of the institution.")

class Study(Experience):
degree: Optional[str] = Field(description="The degree obtained or expected.")
institution: Optional[str] = Field(
description="The university, college, or educational institution visited."
)
country: Optional[str] = Field(description="The country of the institution.")
grade: Optional[str] = Field(description="The grade achieved or expected.")

class WorkExperience(Experience):
company: str = Field(description="The company name of the work experience.")
job_title: Optional[str] = Field(description="The job title.")

class Resume(BaseModel):
first_name: Optional[str] = Field(description="The first name of the person.")
last_name: Optional[str] = Field(description="The last name of the person.")
linkedin_url: Optional[str] = Field(
description="The url of the linkedin profile of the person."
)
email_address: Optional[str] = Field(description="The email address of the person.")
nationality: Optional[str] = Field(description="The nationality of the person.")
skill: Optional[str] = Field(description="A skill listed or mentioned in a description.")
study: Optional[Study] = Field(
description="A study that the person completed or is in progress of completing."
)
work_experience: Optional[WorkExperience] = Field(
description="A work experience of the person."
)
hobby: Optional[str] = Field(description="A hobby or recreational activity of the person.")

def parse_cv(pdf_file_path: str) -> str:
"""Parse a resume.
Not totally sure about the return type: is it list[Resume]?
"""
pdf_loader = PyPDFLoader(pdf_file_path)
docs = pdf_loader.load_and_split()
# please note that function calling is not enabled for all models!
llm = ChatOpenAI(model_name="gpt-3.5-turbo")
chain = create_extraction_chain_pydantic(pydantic_schema=Resume, llm=llm)
return chain.run(docs)

if name == "main":
print(parse_cv(
pdf_file_path=r"C:\Users\bdevasish\Downloads\openresume-resume.pdf"
))

The text was updated successfully, but these errors were encountered:

benman1 · 2024-04-10T12:37:31Z

Hi @bdeva1975! Have you seen the new issue template? In this case, it'd be useful to know your Pydantic version. Please have a look at this here: https://github.com/benman1/generative_ai_with_langchain/blob/softupdate/chapter4/information_extraction.ipynb

CSalle · 2024-04-11T11:44:08Z

As Ben said, you are probably running on version 2.X of Pydantic whereas you should be using 1.X. This was the only way I could fix the issue

benman1 closed this as completed Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with reading Resume #38

Issue with reading Resume #38

bdeva1975 commented Apr 10, 2024

benman1 commented Apr 10, 2024

CSalle commented Apr 11, 2024

Issue with reading Resume #38

Issue with reading Resume #38

Comments

bdeva1975 commented Apr 10, 2024

Replace "<your_api_key>" with your actual key

benman1 commented Apr 10, 2024

CSalle commented Apr 11, 2024