##### Copyright 2025 Google LLC.

In [None]:
# @title Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# https://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

# Gemini API: Basic evaluation

<a target="_blank" href="https://colab.research.google.com/github/google-gemini/cookbook/blob/main/examples/prompting/Basic_Evaluation.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" height=30/>

Gemini API's Python SDK can be used for various forms of evaluation, including:
- Providing feedback based on selected criteria
- Comparing multiple texts
- Assigning grades or confidence scores
- Identifying weak areas

Below is an example of using the LLM to enhance text quality through feedback and grading.

In [1]:
%pip install -U -q "google-genai>=1.0.0"

## Configure your API key

To run the following cell, your API key must be stored it in a Colab Secret named `GOOGLE_API_KEY`. If you don't already have an API key, or you're not sure how to create a Colab Secret, see [Authentication](https://github.com/google-gemini/cookbook/blob/main/quickstarts/Authentication.ipynb) for an example.

In [2]:
from google import genai
from google.colab import userdata

GOOGLE_API_KEY=userdata.get('GOOGLE_API_KEY')
client = genai.Client(api_key=GOOGLE_API_KEY)

## Example

Start by defining some system instructions for this problem. For demonstration purposes, the use case involves prompting the model to write an essay with some mistakes. Remember that for generation tasks like writing an essay, you can change the temperature of the model to get more creative answers. Here, you can use `"temperature": 0` for predictability.

In [3]:
from IPython.display import Markdown
from google.genai import types
student_system_prompt = """
  You're a college student. Your job is to write an essay riddled with
  common mistakes and a few major ones. The essay should have mistakes
  regarding clarity, grammar, argumentation, and vocabulary. Ensure your
  essay includes a clear thesis statement. You should write only an essay,
  so do not include any notes.
"""

MODEL_ID = "gemini-2.0-flash" # @param ["gemini-2.0-flash-lite","gemini-2.0-flash","gemini-2.5-pro-exp-03-25"] {"allow-input":true, isTemplate: true}

essay = client.models.generate_content(
    model=MODEL_ID,
    contents="Write an essay about benefits of reading.",
    config=types.GenerateContentConfig(
        temperature=0,
        system_instruction=student_system_prompt
    )
).text

Markdown(essay)

Reading: It's Like, Really Good For You

Okay, so like, everyone always says reading is good for you, right? And I guess they're not totally wrong. My thesis is that reading is beneficial because it makes you sound smarter, helps you learn new words, and, like, keeps you from being bored.

First of all, reading makes you sound way more intelligent. I mean, think about it. If you read a lot of books, you'll just naturally start using bigger words and more complicated sentences. Like, instead of saying "that's bad," you could say "that is a deleterious outcome." See? Way smarter sounding. Plus, people will think you're, like, a genius or something, which is always a good thing. My mom always says I should read more so I can get a better job, and I think this is what she means.

Secondly, reading is a great way to learn new vocabulary. I mean, duh. Every time you read a book, you're exposed to all sorts of new words that you've never heard before. And then, like, you can just look them up in the dictionary and boom! You know what they mean. This is especially helpful for writing essays, because you can use all those new words to make your writing sound more sophisticated. I once read a book about, like, quantum physics, and now I know all sorts of crazy words that I can use to impress my friends.

Finally, reading is a great way to avoid boredom. I mean, what else are you going to do when you're stuck on a long car ride or waiting for the bus? Stare at your phone? That's so boring. But if you have a book, you can just escape into another world and forget all about your problems. Plus, reading is way more productive than watching TV or playing video games. My brother plays video games all day, and he's, like, totally wasting his life.

In conclusion, reading is super beneficial for a variety of reasons. It makes you sound smarter, helps you learn new words, and keeps you from being bored. So, like, go read a book already! You'll be glad you did. It's a win-win situation, really. And, like, it's way better than doing homework.


In [6]:
teacher_system_prompt = f"""
  As a teacher, you are tasked with grading students' essays.
  Please follow these instructions for evaluation:

  1. Evaluate the essay on a scale of 1-5 based on the following criteria:
  - Thesis statement,
  - Clarity and precision of language,
  - Grammar and punctuation,
  - Argumentation

  2. Write a corrected version of the essay, addressing any identified issues
  in the original submission. Point what changes were made.
"""
teacher_response = client.models.generate_content(
    model=MODEL_ID,
    contents=essay,
    config=types.GenerateContentConfig(
        system_instruction=teacher_system_prompt,
        temperature=0
    )
)

Markdown(teacher_response.text)

Okay, here's a breakdown of the essay and a revised version:

**Evaluation:**

*   **Thesis Statement (2/5):** The thesis is present but simplistic and lacks sophistication. While it identifies three benefits, it could be more specific and engaging.
*   **Clarity and Precision of Language (2/5):** The language is very informal and conversational ("like," "duh," "way"). This undermines the essay's credibility. The examples are simplistic.
*   **Grammar and Punctuation (3/5):** Generally correct, but the overuse of "like" and informal phrasing detracts from the overall quality.
*   **Argumentation (2/5):** The arguments are weak and rely on generalizations. There's a lack of evidence or deeper analysis to support the claims. The connection between reading and sounding smarter is superficial.

**Corrected Essay:**

Reading: An Investment in Intellectual Growth

The pervasive notion that reading is beneficial holds considerable merit. Reading fosters intellectual growth by enhancing communication skills, expanding vocabulary, and providing an engaging alternative to passive entertainment.

Firstly, engaging with literature cultivates more effective communication skills. Exposure to diverse writing styles and complex sentence structures refines one's ability to articulate ideas with clarity and precision. For instance, familiarity with sophisticated texts enables one to move beyond simplistic phrasing, replacing a vague statement like "that's bad" with a more nuanced expression such as "that outcome presents significant challenges." This enhanced command of language not only facilitates clearer communication but also projects an image of competence and intellectual acuity.

Secondly, reading serves as a powerful tool for vocabulary acquisition. Encountering unfamiliar words within context encourages active learning and vocabulary expansion. By consulting dictionaries or online resources to decipher the meaning of new terms, readers gradually integrate these words into their own lexicon. This expanded vocabulary proves invaluable in both written and verbal communication, allowing for more precise and nuanced expression. For example, exposure to scientific literature can introduce individuals to specialized terminology, enriching their understanding of complex concepts and enabling them to engage in more informed discussions.

Finally, reading offers a stimulating and productive alternative to passive forms of entertainment. In moments of downtime, such as during commutes or periods of waiting, a book can provide an immersive escape from boredom. Unlike passive activities like watching television or playing video games, reading actively engages the mind, fostering critical thinking and imagination. Furthermore, the knowledge and insights gained from reading can contribute to personal growth and intellectual enrichment, making it a far more rewarding pursuit than mindless entertainment.

In conclusion, reading offers multifaceted benefits that extend beyond mere entertainment. By honing communication skills, expanding vocabulary, and providing a stimulating alternative to passive leisure activities, reading serves as a valuable investment in intellectual growth and personal development. Embracing a habit of reading can empower individuals to become more articulate communicators, lifelong learners, and engaged members of society.

**Changes Made:**

*   **Replaced Informal Language:** Removed "like," "so," "duh," "I mean," and other colloquialisms to create a more formal and academic tone.
*   **Strengthened Thesis Statement:** The thesis is now more specific and sophisticated, focusing on intellectual growth.
*   **Improved Argumentation:** Replaced generalizations with more concrete examples and explanations. For instance, instead of just saying reading makes you sound smarter, the revised essay explains how it improves communication skills.
*   **Enhanced Vocabulary:** Used more precise and varied vocabulary to elevate the writing quality.
*   **Improved Sentence Structure:** Restructured sentences for clarity and flow.
*   **Added Transitions:** Incorporated transitional phrases to improve the coherence of the essay.
*   **Removed Redundancy:** Eliminated repetitive phrases and ideas.
*   **Focused on Credibility:** The overall tone is now more confident and authoritative.


## Next steps

Be sure to explore other examples of prompting in the repository. Try writing your own prompts for evaluating texts.