## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [4]:
# Start with imports - ask ChatGPT to explain any package that you don't know

import os
import json
from dotenv import load_dotenv
from openai import OpenAI
from anthropic import Anthropic
from IPython.display import Markdown, display

In [5]:
# Always remember to do this!
load_dotenv(override=True)

True

In [6]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')
grok_api_key = os.getenv('GROK_API_KEY')
ollama_api_key = os.getenv('OLLAMA_API_KEY')

openai_llm_model = os.getenv('OPENAI_LLM_MODEL')
anthropic_llm_model = os.getenv('ANTHROPIC_LLM_MODEL')
google_llm_model = os.getenv('GOOGLE_LLM_MODEL')
deepseek_llm_model = os.getenv('DEEPSEEK_LLM_MODEL')
groq_llm_model = os.getenv('GROQ_LLM_MODEL')
grok_llm_model = os.getenv('GROK_LLM_MODEL')
ollama_llm_model = os.getenv('OLLAMA_LLM_MODEL')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")

if grok_api_key:
    print(f"Grok API Key exists and begins {grok_api_key[:4]}")
else:
    print("Grok API Key not set (and this is optional)")

if ollama_api_key:
    print(f"Ollama API Key exists and begins {ollama_api_key[:4]}")
else:
    print("Ollama API Key not set (and this is optional)")

OpenAI API Key exists and begins sk-proj-
Anthropic API Key exists and begins sk-ant-
Google API Key exists and begins AI
DeepSeek API Key exists and begins sk-
Groq API Key exists and begins gsk_
Grok API Key exists and begins xai-
Ollama API Key exists and begins olla


In [7]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [8]:
messages

[{'role': 'user',
  'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]

In [9]:
openai = OpenAI()
response = openai.chat.completions.create(
    model=openai_llm_model,
    messages=messages,
)
question = response.choices[0].message.content
print(question)

If you could redesign the educational system from scratch to maximize creativity and critical thinking in students, what fundamental principles and approaches would you incorporate, and how would you assess the success of your design?


In [10]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

In [11]:
# The API we know well OPENAI

model_name = openai_llm_model

response = openai.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Redesigning an educational system from scratch to maximize creativity and critical thinking involves integrating several fundamental principles and approaches that engage students actively, encourage collaboration, and foster a growth mindset. Here are some key elements to consider:

### Fundamental Principles

1. **Student-Centered Learning:**
   - Shift the focus from standardized curricula to personalized learning experiences that resonate with students' interests, abilities, and learning styles.
   - Encourage student agency by allowing them to choose topics or projects that excite them, promoting ownership of their educational journey.

2. **Interdisciplinary Curriculum:**
   - Break down traditional subject barriers and create an integrated curriculum where fields such as science, art, literature, and technology intersect. This encourages students to see connections, fostering creativity and critical thinking.
   - Implement project-based learning (PBL) where students work on complex questions or problems over extended periods, integrating knowledge from different domains.

3. **Inquiry-Based Learning:**
   - Promote a culture of inquiry by encouraging students to ask questions and explore, rather than simply memorizing facts.
   - Use techniques such as guided discovery, where educators pose challenging problems and students actively explore solutions.

4. **Collaborative Learning Environments:**
   - Foster collaboration through group projects, peer reviews, and team-based problem-solving, helping students learn from one another and develop interpersonal skills.
   - Create dynamic learning spaces that facilitate collaboration, such as flexible seating arrangements and access to technology.

5. **Emphasis on Mental Flexibility:**
   - Teach students to embrace ambiguity, adapt to new information, and learn from failure, encouraging resilience and a growth-oriented mindset.
   - Integrate design thinking and entrepreneurship concepts, where students brainstorm solutions to community issues or create their own products or services.

6. **Real-World Connections:**
   - Partner with local businesses, artists, and community organizations to provide students with authentic learning experiences that connect classroom knowledge to real-world challenges.
   - Incorporate internships, mentorships, and service-learning projects that allow students to apply their skills in meaningful contexts.

### Assessment Approaches

1. **Formative and Summative Assessment:**
   - Use diverse assessment methods, such as portfolios, presentations, peer assessments, and reflective journals, that allow students to demonstrate their learning in various ways.
   - Implement formative assessments that provide feedback throughout the learning process, emphasizing growth and improvement rather than merely final grades.

2. **Holistic Assessment Criteria:**
   - Develop rubrics that emphasize creativity, critical thinking, collaboration, and problem-solving skills alongside content mastery.
   - Assess not just the final product but also the process, including students' ability to articulate their thinking, adapt their ideas, and work with others.

3. **Self and Peer Assessment:**
   - Encourage self-assessment and reflection, allowing students to set personal learning goals and track their progress over time.
   - Involve students in peer assessment processes to foster a sense of community and collective learning.

4. **Longitudinal Tracking:**
   - Implement a system for tracking students’ progress over time on creativity and critical thinking skills through various indicators, ensuring continuous evaluation rather than one-time assessments.

### Success Evaluation

1. **Student Engagement Metrics:**
   - Measure engagement through surveys, participation rates, and feedback to gauge how excited and invested students are in their learning experiences.

2. **Skill Development:**
   - Use assessments of critical thinking skills, creativity (e.g., problem-solving tasks), and collaborative ability to evaluate the development of these competencies over time.

3. **Post-Educational Outcomes:**
   - Assess graduates’ success in terms of their adaptability in higher education and the workforce. This can include surveys on job readiness, innovative thinking, and contributions to creative industries.

4. **Community Feedback:**
   - Gather input from parents, educators, and community partners on the perceived value and effectiveness of the education system in fostering creativity and critical thinking.

5. **Continuous Improvement Process:**
   - Establish a feedback loop where data collected from assessments inform ongoing adjustments and improvements to the curriculum and teaching practices.

In conclusion, this redesigned educational system would prioritize a holistic approach to learning, deeply engaged students, and a culture of inquiry, ultimately equipping them with the skills needed to thrive in an ever-changing world.

In [12]:
# Anthropic has a slightly different API, and Max Tokens is required

model_name = anthropic_llm_model
print(model_name)
claude = Anthropic(api_key=anthropic_api_key)

response = claude.messages.create(model=model_name, messages=messages, max_tokens=1024)
answer = response.content[0].text

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

claude-3-7-sonnet-latest


# Redesigning Education for Creativity and Critical Thinking

## Fundamental Principles

1. **Curiosity-Driven Learning**: Structure education around questions rather than answers, with curriculum that evolves based on student inquiry.

2. **Interdisciplinary Focus**: Dissolve rigid subject boundaries to reflect how real-world problems span multiple domains.

3. **Personalized Pathways**: Accommodate diverse learning styles, interests, and developmental timelines.

4. **Failure as Learning**: Create safe environments where failure is reframed as valuable iteration.

5. **Real-World Relevance**: Connect learning directly to authentic challenges and community needs.

## Key Approaches

- **Project-Based Learning**: Extended immersion in complex problems requiring multiple skills and knowledge areas.

- **Mentorship Networks**: Connect students with practitioners and experts beyond classroom teachers.

- **Flexible Scheduling**: Replace rigid time blocks with fluid scheduling that allows deep engagement.

- **Physical Environments**: Design spaces conducive to collaboration, making, and different modes of thinking.

- **Technology Integration**: Use technology as creation tools rather than just consumption platforms.

## Assessment Framework

- **Portfolio-Based Evaluation**: Assess growth through cumulative work demonstrating development over time.

- **Multi-Dimensional Metrics**: Measure collaboration skills, resilience, adaptability, and questioning quality alongside knowledge acquisition.

- **Longitudinal Success Indicators**: Track graduates' ability to adapt to changing circumstances, generate novel solutions, and find fulfillment in their pursuits.

- **Self-Assessment Capacity**: Develop students' ability to evaluate their own work against meaningful standards.

The ultimate success measure would be graduates who can identify meaningful problems, devise innovative approaches to solving them, and continuously reinvent their skills as the world changes.

In [None]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = google_llm_model

response = gemini.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Redesigning the educational system from scratch to maximize creativity and critical thinking requires a radical shift from content delivery to skill development, from passive reception to active inquiry, and from standardized answers to divergent solutions.

Here are the fundamental principles and approaches I would incorporate:

## Fundamental Principles:

1.  **Inquiry-Based Learning:** Every unit, lesson, and activity starts with a compelling question, a real-world problem, or an intriguing phenomenon, fostering curiosity and the desire to seek answers.
2.  **Interdisciplinary Integration:** Subject silos are dissolved. Learning is organized around themes, big ideas, and complex problems that naturally draw from multiple disciplines (science, history, art, math, ethics, etc.).
3.  **Project-Based & Challenge-Based Learning:** The primary mode of learning is through extended, open-ended projects that require students to define problems, conduct research, collaborate, design solutions, iterate, and present their findings.
4.  **Mastery Over Timelines:** Students progress based on demonstrated understanding and mastery of skills, not on age or arbitrary timelines. Learning is personalized, allowing students to delve deeper into areas of interest and take longer where needed.
5.  **Growth Mindset & Psychological Safety:** Failure is reframed as a crucial learning opportunity. The environment explicitly encourages risk-taking, experimentation, questioning, and constructive feedback without fear of judgment.
6.  **Metacognition & Reflection:** Students are explicitly taught *how* to think, *how* to learn, and *how* to evaluate their own thinking processes. Regular reflection is embedded into every learning experience.
7.  **Empowerment & Agency:** Students have significant voice and choice in what they learn, how they learn it, and how they demonstrate their understanding, fostering intrinsic motivation and ownership.
8.  **Community & Real-World Connection:** Learning extends beyond the classroom walls, connecting students with experts, businesses, community organizations, and real-world issues.

## Approaches & Design Elements:

### Curriculum & Pedagogy:

*   **"Big Questions" Curriculum:** Instead of subjects, the curriculum is structured around enduring "big questions" (e.g., "How do societies create justice?", "What is the nature of intelligence?", "How does innovation change the world?"). These questions are explored through modules that integrate various disciplines.
*   **Design Thinking Framework:** Students are consistently guided through the stages of design thinking (Empathize, Define, Ideate, Prototype, Test) to approach problems creatively and iteratively.
*   **Socratic Seminars & Debates:** Regular forums for students to engage in deep, critical discussions, articulate their reasoning, challenge assumptions, and consider diverse perspectives.
*   **Dedicated "Discovery Time" & "Tinkering Labs":** Scheduled blocks where students pursue self-directed interests, explore new tools, experiment with materials, and engage in unstructured play, fostering divergent thinking and intrinsic motivation.
*   **Philosophy & Ethics Integration:** Explicit instruction and discussion on logical fallacies, ethical dilemmas, philosophical reasoning, and the impact of ideas.
*   **Entrepreneurial & Innovation Hubs:** Spaces and opportunities for students to develop and pitch ideas, create products/services, and understand the process of bringing novel solutions to life.
*   **"Reverse Engineering" Projects:** Students deconstruct complex systems (e.g., a piece of software, a historical event, a natural phenomenon) to understand their components, functions, and underlying principles.

### Learning Environment & Structure:

*   **Flexible Learning Spaces:** Classrooms are replaced by adaptable studios, labs, makerspaces, quiet zones, and collaborative hubs, designed to support various learning modes. Outdoor learning environments are also integrated.
*   **Multi-Age Learning Cohorts:** Students are grouped based on interest, project needs, or skill level rather than strict age, encouraging peer mentorship and diverse perspectives.
*   **Educators as Facilitators & Mentors:** Teachers transform into guides, coaches, and co-learners. Their role is to pose insightful questions, provide resources, offer feedback, model critical thinking, and facilitate collaborative inquiry, rather than delivering content.
*   **Digital Fluency & Critical Media Literacy:** Technology is integrated as a tool for research, creation, collaboration, and critical analysis, not just consumption. Students learn to evaluate sources, understand algorithms, and contribute responsibly to digital spaces.
*   **"Knowledge Navigators":** Dedicated roles for individuals who help students find and critically evaluate information, connect with external experts, and manage project resources.

### Role of Technology:

*   **Adaptive Learning Platforms:** AI-powered tools provide personalized learning pathways, recommend resources, and offer immediate, targeted feedback on skill development (e.g., in logic, argumentation, or specific technical skills).
*   **Virtual & Augmented Reality:** Immersive experiences for exploring complex concepts, historical events, scientific phenomena, or distant cultures, enabling deeper engagement and understanding.
*   **Collaborative Design Tools:** Software for brainstorming, mind mapping, prototyping, and project management, mirroring professional environments.

## Assessment of Success:

Traditional grades and standardized tests are largely eliminated, replaced by a holistic, continuous assessment model focused on demonstrating skills and growth.

1.  **Project-Based Portfolios:**
    *   **Content:** Curated collections of student work showcasing projects, research, creative outputs, design iterations, prototypes, and solutions to challenges.
    *   **Focus:** Evaluation emphasizes the *process* (research depth, problem definition, ideation, collaboration, iteration, reflection) as much as the final product.
    *   **Rubrics:** Specific rubrics are used to assess creativity (originality, divergent thinking, risk-taking), critical thinking (analysis, evidence, logical reasoning, argumentation, questioning assumptions), collaboration, communication, and self-direction.

2.  **Performance-Based Assessments:**
    *   **Examples:** Live presentations, debates, scientific experiments, artistic performances, simulations, design challenges, and community service projects where students demonstrate their abilities in real-time.
    *   **Evaluation:** Assessed by multiple observers (educators, peers, community experts) using detailed rubrics.

3.  **Metacognitive Reflections & Learning Journals:**
    *   **Content:** Students regularly document their thinking processes, learning strategies, challenges encountered, solutions devised, and insights gained.
    *   **Focus:** Demonstrates understanding of *how* they think and learn, their ability to self-assess, and their capacity for growth.

4.  **Peer & Self-Assessment:**
    *   **Approach:** Students are taught to critically evaluate their own work and the work of their peers against clear criteria, fostering self-awareness and evaluative skills.

5.  **"Innovation Challenges" & "Problem-Solving Sprints":**
    *   **Approach:** Periodically, students are given novel, time-bound challenges requiring them to apply their critical thinking and creativity to unfamiliar problems, often in teams.
    *   **Evaluation:** Assesses adaptability, resilience, and the ability to generate solutions under pressure.

6.  **Educator Observations & Anecdotal Records:**
    *   **Approach:** Detailed qualitative feedback from educators on student engagement, inquiry, collaboration, resilience, and specific instances of creative problem-solving or critical insight.

7.  **Longitudinal Tracking & Alumni Success:**
    *   **Metrics:** Track the long-term success of graduates in diverse fields (beyond just academic metrics), focusing on their adaptability, innovation in their chosen careers, civic engagement, entrepreneurial ventures, and continued pursuit of learning.
    *   **Surveys & Interviews:** Gather feedback from alumni, employers, and higher education institutions on the preparedness of students for complex, real-world challenges.

8.  **Student & Community Feedback:**
    *   **Approach:** Regular surveys and forums to gather feedback from students, parents, and community partners on the educational experience, perceived growth in critical thinking and creativity, and overall engagement.

By integrating these principles and approaches, and by focusing assessment on the demonstration of critical thinking and creative skills in authentic contexts, this redesigned system would aim to cultivate generations of adaptable, innovative, and deeply thoughtful individuals.

In [None]:
deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = deepseek_llm_model

response = deepseek.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Of course. This is a fascinating and vital thought experiment. Redesigning the educational system to maximize creativity and critical thinking requires a fundamental shift from a model of knowledge transmission to one of cultivation and empowerment.

Here is a blueprint for such a system, built on core principles, implemented through specific approaches, and assessed with meaningful metrics.

---

### **Fundamental Principles**

These principles would form the philosophical bedrock of the new system:

1.  **Curiosity as the Engine, Not Compliance as the Rule:** The primary goal is to ignite and sustain a student's innate curiosity. Learning is framed as an exciting process of discovery, not a chore to be completed for a grade. The question "What do you want to learn?" becomes as important as "What you need to learn."

2.  **Interdisciplinary & Real-World Connection:** Knowledge is not siloed into separate subjects. Math, science, history, art, and language are taught as interconnected tools for understanding and shaping the world. Problems are complex and require integrated solutions.

3.  **The Process Over the Product:** The focus shifts from the "right answer" to the *quality of the thinking* used to get there. Embracing iteration, productive failure, and revision is central to the learning process. A polished final project is less valuable than a messy portfolio showing the evolution of thought.

4.  **Autonomy and Agency:** Students have increasing ownership over their learning journey. This includes choices in what they study (within a framework), how they demonstrate understanding, and the pace at which they progress. They are active participants, not passive recipients.

5.  **The Teacher as a Facilitator and Coach:** The role of the educator transforms from "sage on the stage" to "guide on the side." Teachers are experts in learning processes, asking powerful questions, providing resources, and giving feedback that fosters growth, not just judgment.

---

### **Key Approaches & Structural Changes**

These principles would be brought to life through the following concrete approaches:

**1. Project-Based, Inquiry-Driven Learning (PBL):**
*   **How it works:** The curriculum is organized around large, open-ended, multi-week questions or challenges (e.g., "How can we design a sustainable garden for our community?" or "What caused the fall of the Roman Empire, and what can it teach us about modern societies?").
*   **Why it works:** It forces students to ask questions, research, collaborate, apply knowledge from various disciplines, create solutions, and present their findings. It inherently cultivates both critical thinking (research, analysis) and creativity (design, problem-solving).

**2. Socratic Seminars and Harkness Discussions:**
*   **How it works:** Class time is dedicated to student-led discussions of complex texts, ideas, and problems. The teacher's role is to ask probing questions that deepen the conversation, challenge assumptions, and ensure equitable participation.
*   **Why it works:** It teaches students to analyze arguments, articulate their thoughts, defend positions with evidence, and respectfully critique the ideas of others—the essence of critical thinking.

**3. The Mandatory "Maker" Curriculum:**
*   **How it works:** Every student engages in dedicated time for hands-on creation. This includes traditional arts (painting, music, writing), digital arts (coding, filmmaking, graphic design), and physical fabrication (woodworking, robotics, textiles).
*   **Why it works:** Making is the physical manifestation of creative thought. It teaches design thinking, resilience through trial and error, and the joy of bringing an idea into the world.

**4. Competency-Based Progression:**
*   **How it works:** Students advance upon demonstrating mastery of a specific skill or concept, not based on seat time or age. This eliminates the boredom of waiting for peers and the stress of being left behind. A 12-year-old could be mastering high-school-level math while working on age-appropriate social-emotional skills.
*   **Why it works:** It personalizes learning and reinforces a growth mindset—the belief that ability is developed through effort. The focus is on genuine understanding.

**5. Integrated Social-Emotional Learning (SEL):**
*   **How it works:** Skills like empathy, mindfulness, collaboration, and resilience are explicitly taught and practiced daily. They are not a separate program but are woven into the fabric of every project and discussion.
*   **Why it works:** Creativity and critical thinking are stifled by fear of judgment and a fixed mindset. SEL creates a psychologically safe environment where students feel comfortable taking intellectual risks, sharing wild ideas, and being wrong.

**6. Apprenticeships and Community Engagement:**
*   **How it works:** In later years, students spend significant time learning outside the school walls—interning with scientists, artists, tradespeople, entrepreneurs, and community organizers.
*   **Why it works:** It connects learning to real-world contexts, provides authentic mentors, and allows students to test their skills and passions in a professional setting, fostering both practical critical thinking and creative application.

---

### **Assessing Success: Moving Beyond Standardized Tests**

The assessment system must be as innovative as the teaching system. It would be multi-faceted and focused on growth and capability.

**1. Digital Portfolios (The Primary Tool):**
*   **What it is:** A curated collection of a student's work over time: project write-ups, research papers, creative pieces, video presentations of their defense, code repositories, art portfolios, and, crucially, their reflections on the process.
*   **What it measures:** Depth of thought, growth over time, creativity, mastery of skills, and ability to self-reflect. It provides a rich, holistic picture of the student.

**2. Narrative Feedback and Mastery Transcripts:**
*   **What it is:** Instead of letter grades, students receive detailed, qualitative feedback from teachers and peers on specific competencies (e.g., "effectively synthesizes research from multiple sources," "demonstrates originality in problem-solving").
*   **What it measures:** A "transcript" would be a list of mastered competencies with links to portfolio evidence, providing universities and employers a much clearer picture of a student's abilities than a GPA.

**3. Performance-Based Assessments:**
*   **What it is:** Students are given a complex, novel problem (individually or in a team) and are assessed on their process and solution. For example, "Here is a dataset on local climate patterns. Analyze it and propose a solution to reduce our school's carbon footprint."
*   **What it measures:** The ability to apply critical thinking and creativity *in real-time* to an unfamiliar challenge—a true test of their skills.

**4. Metrics for System-Wide Success:**
*   **Longitudinal Studies:** Tracking graduates into higher education and the workforce to measure indicators of success: innovation (patents, startups, artistic contributions), adaptability (career changes, promotions), and civic engagement (voting, community leadership).
*   **Student Surveys:** Regularly measuring students' intrinsic motivation, curiosity, self-efficacy, and sense of purpose.
*   **"Innovation Index":** A school-level metric measuring the number and quality of student-led projects, community initiatives, and creative outputs generated by the student body.

### **Conclusion**

This redesigned system would not be easier than the current one; it would be more demanding and complex for both students and educators. However, it would be infinitely more meaningful. It would produce graduates who are not walking repositories of information, but **adaptive, innovative, and engaged thinkers**—equipped not just to pass tests, but to solve the complex problems of the future and lead fulfilling, creative lives. The ultimate measure of its success would be a generation that looks at a problem and doesn't ask "What's the answer?" but "What are the possibilities?"

In [None]:
groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
model_name = groq_llm_model

response = groq.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


If I could redesign the educational system from scratch to maximize creativity and critical thinking in students, I would incorporate the following fundamental principles and approaches:

**Principles:**

1. **Student-centered and personalized learning**: Each student learns at their own pace and in their own way. The education system would be tailored to individual students' strengths, interests, and learning styles.
2. **Interdisciplinary and project-based learning**: Learning is an integrated, holistic process. Students would work on projects that combine multiple subjects, encouraging them to see connections and think critically.
3. **Inquiry-based and problem-solving focus**: Students would be presented with open-ended questions and real-world problems to solve, fostering critical thinking, creativity, and collaboration.
4. **Emphasis on creativity, imagination, and innovation**: The system would encourage students to explore, experiment, and take risks, with a focus on generating new ideas and solutions.
5. **Social-emotional learning and well-being**: Students' emotional intelligence, self-awareness, and well-being would be valued and supported, as these are essential for effective learning and creativity.
6. **Flexibility and adaptability**: The education system would be designed to be flexible and adaptable, allowing for continuous innovation and improvement.

**Approaches:**

1. **Competency-based progression**: Students would progress through the education system based on demonstrated mastery of skills and competencies, rather than age or grade level.
2. **Mentorship and coaching**: Students would be paired with experienced mentors and coaches who provide guidance, support, and feedback, helping them navigate the learning process.
3. **Real-world applications and community engagement**: Students would engage in real-world projects and collaborations with community organizations, businesses, and experts, making learning relevant and meaningful.
4. **Technology-integrated and immersive learning**: The education system would leverage technology to provide immersive, interactive, and personalized learning experiences.
5. **Regular feedback and self-assessment**: Students would receive regular feedback from peers, mentors, and themselves, helping them reflect on their learning, set goals, and adjust their approach.

**Assessment and Evaluation:**

To assess the success of the redesigned education system, I would use a combination of quantitative and qualitative metrics, including:

1. **Student portfolios and exhibitions**: Students would showcase their work, reflecting their progress, creativity, and critical thinking.
2. **Peer and self-assessment**: Students would regularly evaluate their own learning and that of their peers, promoting a growth mindset and meta-cognitive awareness.
3. **Mentor and coach evaluations**: Experienced mentors and coaches would assess student progress, providing feedback and guidance.
4. **Project-based evaluations**: Students' projects and solutions would be evaluated on their creativity, innovation, and real-world applicability.
5. **Standardized assessments**: Periodic, standardized assessments would be used to evaluate student mastery of essential skills and competencies.
6. **Long-term outcomes**: The education system would track graduates' success in their chosen fields, assessing the effectiveness of the system in preparing students for an ever-changing world.

**Indicators of Success:**

The following indicators would suggest the redesigned education system is successful:

1. **Increased student engagement and motivation**: Students are enthusiastic about learning, take ownership of their education, and are motivated to continue learning throughout their lives.
2. **Improved creativity and innovation**: Students demonstrate the ability to generate new ideas, think critically, and approach problems with a creative and innovative mindset.
3. **Enhanced collaboration and communication**: Students work effectively in teams, communicate ideas clearly, and demonstrate empathy and understanding of diverse perspectives.
4. **Better preparation for an uncertain future**: Graduates are equipped to adapt to an ever-changing world, with a strong foundation in critical thinking, problem-solving, and creativity.
5. **Increased community engagement and social responsibility**: Students demonstrate a sense of social responsibility, are engaged in their communities, and contribute to the betterment of society.

By incorporating these fundamental principles and approaches, and assessing success through a combination of metrics and indicators, the redesigned education system would prioritize creativity, critical thinking, and student-centered learning, preparing students to thrive in an increasingly complex and interconnected world.

## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [None]:
#GROK from xAI
#Bcause autogen-ext[grpc]==0.6.1 depends on grpcio>=1.70.0,<1.71.dev0 and all versions
#of xai-sdk depend on grpcio>=1.72.1,<2, we can conclude that all versions of xai-sdk and
#autogen-ext[grpc]>=0.4.9.2 are incompatible.
#And because your project depends on autogen-ext[grpc]>=0.4.9.2 and xai-sdk, we can conclude that
#your project's requirements are unsatisfiable.
from xai_sdk import Client
from xai_sdk.chat import user, system

client = Client(api_key=anthropic_api_key)

chat = client.chat.create(model=anthropic_llm_model, temperature=0, messages=messages)

answer = chat.sample().content
print(answer)

display(Markdown(answer))

In [17]:
!ollama pull llama3.2

[?2026h[?25l[1Gpulling manifest ⠋ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠙ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠹ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠸ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠼ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠴ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠦ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠧ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠇ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠏ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest ⠋ [K[?25h[?2026l[?2026h[?25l[1Gpulling manifest [K
pulling dde5aa3fc5ff... 100% ▕████████████████▏ 2.0 GB                         [K
pulling 966de95ca8a6... 100% ▕████████████████▏ 1.4 KB                         [K
pulling fcc5a6bec9da... 100% ▕████████████████▏ 7.7 KB                         [K
pulling a70ff7e570d9... 100% ▕████████████████▏ 6.0 KB                         [K
pulling 56bb8bd477a5... 100% ▕█████

In [18]:
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "llama3.2"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Redesigning the educational system to foster creativity and critical thinking requires a holistic approach that integrates various pedagogical strategies, curricular structures, and assessment methods. Here's a comprehensive outline of my hypothetical design:

**Foundational Principles:**

1. **Competency-Based Learning**: Focus on teaching specific skills and competencies rather than age-based grade levels.
2. **Personalized Learning**: Recognize individual differences in learning styles, pace, and interests to tailor instruction to each student's needs.
3. **Emphasis on Process Over Product**: Encourage experimentation, prototyping, and iteration over traditional product-oriented approaches.
4. **Culturally Responsive Teaching**: Foster an inclusive environment that values diverse perspectives, experiences, and cultural norms.
5. **Community Engagement**: Connect students with local communities to promote social responsibility and global awareness.

**Curricular Structure:**

1. **Interdisciplinary Learning Paths**: Design curricula that integrate multiple subjects, such as STEM fields with the humanities and social sciences.
2. **Theme-Based Integration**: Organize learning into thematic modules that connect across subjects, fostering a holistic understanding of complex systems.
3. **Skills-Based Progression**: Align instruction to specific skill sets (e.g., creativity, problem-solving, communication) rather than traditional content-based progression.

**Approaches to Teaching and Learning:**

1. **Project-Based Learning**: Encourage students to embark on self-directed projects that require critical thinking, analysis, and problem-solving.
2. **Collaborative Workspace Design**: Provide students with flexible, collaborative workspaces that facilitate peer-to-peer learning and idea-sharing.
3. **Experiential Education**: Incorporate hands-on experiences, real-world applications, and service-learning opportunities to build empathy and social responsibility.
4. **Teacher Trainings**: Offer ongoing professional development opportunities for educators to refine their knowledge of modern pedagogies, curriculum design, and assessment strategies.

**Assessment Methods:**

1. **Self-Assessment and Reflection**: Teach students how to evaluate and improve their own learning outcomes using authentic and introspective reflection techniques.
2. **Peer Review and Feedback**: Foster critical thinking through peer-led discussions, presentations, and workshops that promote constructive feedback.
3. **Competency-Based Assessments**: Use project-based, open-ended evaluations to assess problem-solving skills, collaboration, and innovative thinking.
4. **Emphasis on Process, Not Just Product**: Recognize the value of experimentation, iteration, and creativity in the learning process.

**Success Metrics:**

1. **Increased Creative Problem-Solving Skills**: Monitor student performance on creative challenges, prototyping exercises, and open-ended problem-solving tasks.
2. **Improved Critical Thinking Across Disciplines**: Assess students' ability to evaluate evidence, identify biases, and think critically across multiple subjects.
3. **Enhanced Collaboration and Communication**: Track student participation in group projects, peer review activities, and presentations reflecting effective communication strategies.

To assess the success of this design, I would implement a comprehensive evaluation framework that incorporates:

1. **Pilot Programs**: Launch pilot studies to test the effectiveness of these innovations with small groups of students.
2. **Longitudinal Research**: Conduct longitudinal studies to track student growth, outcomes, and perceptions over multiple years.
3. **Teacher Feedback and Training**: Continuously solicit feedback from educators to refine instruction, curriculum design, and assessment strategies.

Remember that no single approach can effectively redesign the educational system overnight. This hypothetical framework would need to be adapted, tested, and refined through ongoing collaboration with educators, policymakers, and students themselves to achieve lasting, meaningful change.

In [19]:
# So where are we?

print(competitors)
print(answers)


['gpt-4o-mini', 'claude-3-7-sonnet-latest', 'deepseek-chat', 'gemini-2.5-flash', 'llama-3.3-70b-versatile', 'llama3.2']
["Redesigning an educational system from scratch to maximize creativity and critical thinking involves integrating several fundamental principles and approaches that engage students actively, encourage collaboration, and foster a growth mindset. Here are some key elements to consider:\n\n### Fundamental Principles\n\n1. **Student-Centered Learning:**\n   - Shift the focus from standardized curricula to personalized learning experiences that resonate with students' interests, abilities, and learning styles.\n   - Encourage student agency by allowing them to choose topics or projects that excite them, promoting ownership of their educational journey.\n\n2. **Interdisciplinary Curriculum:**\n   - Break down traditional subject barriers and create an integrated curriculum where fields such as science, art, literature, and technology intersect. This encourages students to 

In [20]:
# It's nice to know how to use "zip"
for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")


Competitor: gpt-4o-mini

Redesigning an educational system from scratch to maximize creativity and critical thinking involves integrating several fundamental principles and approaches that engage students actively, encourage collaboration, and foster a growth mindset. Here are some key elements to consider:

### Fundamental Principles

1. **Student-Centered Learning:**
   - Shift the focus from standardized curricula to personalized learning experiences that resonate with students' interests, abilities, and learning styles.
   - Encourage student agency by allowing them to choose topics or projects that excite them, promoting ownership of their educational journey.

2. **Interdisciplinary Curriculum:**
   - Break down traditional subject barriers and create an integrated curriculum where fields such as science, art, literature, and technology intersect. This encourages students to see connections, fostering creativity and critical thinking.
   - Implement project-based learning (PBL) w

In [21]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [22]:
print(together)

# Response from competitor 1

Redesigning an educational system from scratch to maximize creativity and critical thinking involves integrating several fundamental principles and approaches that engage students actively, encourage collaboration, and foster a growth mindset. Here are some key elements to consider:

### Fundamental Principles

1. **Student-Centered Learning:**
   - Shift the focus from standardized curricula to personalized learning experiences that resonate with students' interests, abilities, and learning styles.
   - Encourage student agency by allowing them to choose topics or projects that excite them, promoting ownership of their educational journey.

2. **Interdisciplinary Curriculum:**
   - Break down traditional subject barriers and create an integrated curriculum where fields such as science, art, literature, and technology intersect. This encourages students to see connections, fostering creativity and critical thinking.
   - Implement project-based learning (P

In [23]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [24]:
print(judge)

You are judging a competition between 6 competitors.
Each model has been given this question:

If you could redesign the educational system from scratch to maximize creativity and critical thinking in students, what fundamental principles and approaches would you incorporate, and how would you assess the success of your design?

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}

Here are the responses from each competitor:

# Response from competitor 1

Redesigning an educational system from scratch to maximize creativity and critical thinking involves integrating several fundamental principles and approaches that engage students actively, encourage collaboration, and foster a growth mindset. Here are some key elements to consider:

### Fundamental Pri

In [25]:
judge_messages = [{"role": "user", "content": judge}]

In [None]:
# Judgement time by openAI!

openai = OpenAI()
response = openai.chat.completions.create(
    model=openai_llm_model,
    messages=judge_messages,
)
results = response.choices[0].message.content
print(results)


{"results": ["3", "1", "2", "4", "5", "6"]}


In [27]:
# OK let's turn this into results!

results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: deepseek-chat
Rank 2: gpt-4o-mini
Rank 3: claude-3-7-sonnet-latest
Rank 4: gemini-2.5-flash
Rank 5: llama-3.3-70b-versatile
Rank 6: llama3.2


In [None]:
# Judgement time by Anthropic!

model_name = anthropic_llm_model
claude = Anthropic(api_key=anthropic_api_key)

response = claude.messages.create(model=model_name, messages=judge_messages, max_tokens=1024)
answer = response.content[0].text

print(results)

claude-3-7-sonnet-latest
{"results": ["3", "1", "2", "4", "5", "6"]}


In [29]:
# Judgement time by groq!

groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
model_name = groq_llm_model

response = groq.chat.completions.create(model=model_name, messages=judge_messages)
answer = response.choices[0].message.content

print(results)

{"results": ["3", "1", "2", "4", "5", "6"]}


In [30]:
# Judgement time by DeepSeek!

deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = deepseek_llm_model

response = deepseek.chat.completions.create(model=model_name, messages=judge_messages)
answer = response.choices[0].message.content

print(results)

{"results": ["3", "1", "2", "4", "5", "6"]}


In [31]:
# Judgement time by Gemini!
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = google_llm_model

response = gemini.chat.completions.create(model=model_name, messages=judge_messages)
answer = response.choices[0].message.content

print(results)

{"results": ["3", "1", "2", "4", "5", "6"]}


<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>