## Get The text from the PDF

In [1]:
%pip install pdfplumber pytesseract pdf2image




In [2]:
import pdfplumber
import pytesseract
from pdf2image import convert_from_path

In [3]:
def extract_text_from_pdf(pdf_path):
    text = ""
    try:
        # Try direct text extraction
        with pdfplumber.open(pdf_path) as pdf:
            for page in pdf.pages:
                page_text = page.extract_text()
                if page_text:
                    text += page_text

        if text.strip():
            return text.strip()
    except Exception as e:
        print(f"Direct text extraction failed: {e}")

    # Fallback to OCR for image-based PDFs
    print("Falling back to OCR for image-based PDF.")
    try:
        images = convert_from_path(pdf_path)
        for image in images:
            page_text = pytesseract.image_to_string(image)
            text += page_text + "\n"
    except Exception as e:
        print(f"OCR failed: {e}")

    return text.strip()

In [4]:
pdf_path = "uploaded_resume.pdf"
resume_text = extract_text_from_pdf(pdf_path)

print("\nExtracted Text from PDF:")
print(resume_text)


Extracted Text from PDF:
ROSAN S
DATA ANALYST
91+ 7708126918 / roshandharan00@gmail.com / linkedin.com/in/Rosans
SUMMARY
Motivated and detail-oriented Data Analyst with a strong foundation in data visualization, statistical
analysis, and programming. Proficient in Python, Power BI, Machine learning, SQL , R program and Tableau.I am
eager to apply my skills to real-world projects and contribute to organizational growth. My goal is to leverage
data insights to drive strategic initiatives and deliver measurable business outcomes.
SKILLS
Python Excel Data cleaning & wrangling Hypothesis Testing
Data Visualization SQL R program Data-Driven Decision Making
Power bi Webscraping Descriptive statistics Presentation Skills
CERTIFICATION
MongoDB and the Document Model – MongoDB (Aug 24, 2024)
Communication Skills – TCS iON (Sep 26, 2024)
Association Analysis using Python – Wingspan (Jan 5, 2025)
Data Modeling and Visualization – TCS iON (Mar 11, 2025)
EDUCATION
Bachelor of Computer Science with 

## Set Google GenerativeAI Api Key

In [5]:
%pip install google.generativeai python-dotenv

Note: you may need to restart the kernel to use updated packages.


In [6]:
import google.generativeai as genai
import os
from dotenv import load_dotenv

load_dotenv()
genai.configure(api_key=os.getenv("GOOGLE_API_KEY"))
model = genai.GenerativeModel("gemini-1.5-flash")

In [7]:
response = model.generate_content("What is the capital of India?")


In [8]:
print(response)

response:
GenerateContentResponse(
    done=True,
    iterator=None,
    result=protos.GenerateContentResponse({
      "candidates": [
        {
          "content": {
            "parts": [
              {
                "text": "The capital of India is **New Delhi**.\n"
              }
            ],
            "role": "model"
          },
          "finish_reason": "STOP",
          "avg_logprobs": -0.0026347806677222254
        }
      ],
      "usage_metadata": {
        "prompt_token_count": 7,
        "candidates_token_count": 10,
        "total_token_count": 17
      },
      "model_version": "gemini-1.5-flash"
    }),
)


In [9]:
print(response.text)

The capital of India is **New Delhi**.



## Resume Analysis

In [10]:
def analyze_resume(resume_text, job_description=None):
    if not resume_text:
        return {"error": "Resume text is required for analysis."}
    
    model = genai.GenerativeModel("gemini-1.5-flash")
    
    base_prompt = f"""
    You are an experienced HR with Technical Experience in the field of any one job role from Data Science, Data Analyst, DevOPS, Machine Learning Engineer, Prompt Engineer, AI Engineer, Full Stack Web Development, Big Data Engineering, Marketing Analyst, Human Resource Manager, Software Developer your task is to review the provided resume.
    Please share your professional evaluation on whether the candidate's profile aligns with the role.ALso mention Skills he already have and siggest some skills to imorve his resume , alos suggest some course he might take to improve the skills.Highlight the strengths and weaknesses.

    Resume:
    {resume_text}
    """

    if job_description:
        base_prompt += f"""
        Additionally, compare this resume to the following job description:
        
        Job Description:
        {job_description}
        
        Highlight the strengths and weaknesses of the applicant in relation to the specified job requirements.
        """

    response = model.generate_content(base_prompt)

    analysis = response.text.strip()
    return analysis

In [11]:
print(analyze_resume(resume_text))


## Resume Evaluation: Rosan S - Data Analyst

**Overall Assessment:** Rosan's resume shows promise for an entry-level Data Analyst role, particularly given his recent internship and project experience. However, it needs significant improvement in structure, quantifiable results, and showcasing a broader range of skills.

**Strengths:**

* **Strong foundational skills:** He possesses a solid base in Python, SQL, Power BI, and R, which are essential for data analysis.
* **Relevant Projects:** The Facebook Ad Campaign analysis and Crop Recommendation System projects demonstrate practical application of his skills.  These are good starting points.
* **Certifications:** While some certifications are quite recent, they show a proactive approach to learning and skill development.  They are still in progress, so that timeline is notable.
* **Presentation Skills & Conference Presentation:** Presenting at ICBAC demonstrates communication and presentation abilities, crucial for a data analyst.
* 