## What is an Agent?  

From HuggingFace:       
AI agents are programs where LLM outputs control the workflow.        
So, basically one output from LLM is able to decide what tasks are carried out in what sequence.        
      
In practice, describes an AI solution that involves any or all of these: 
1) Multiple LLM calls
2) LLMs with ability to use Tools
3) An environment where LLMs interact
4) A Planner to coordinate activities
5) Autonomy
      
Anthropic:       
Twho types of Agentic Systems:        
1. **Workflows** are systems where LLMs and tools are orchestrated through predefined code paths
2. **Agents** are systems where LLMs dynamically direct their own processes and tool usage, maintaining control over how they accomplish tasks

### Workflows 
Anthropic:      
1. Prompt chaining        
   Decompose into fixed sub-tasks       
   <br />
   <img src="../images/1/workflow_1.jpg" width="600" /><br>
   Yellow boxes- calls LLMS, Blue - code - 
2. Routing        
   Direct an input into a specialized sub-task, ensuring separation of concerns          
   <img src="../images/1/workflow_2.jpg" width="600" /><br>
3. Parallelization      
   Breaking down tasks and running multiple subtasks concurrently     
   <img src="../images/1/workflow_3.jpg" width="600" /> <br>
   Coordinator, Aggregator             
4. Orchestrator-worker       
   Complex tasks are broken down dynamically and combined     
   <img src="../images/1/workflow_4.jpg" width="600" /><br>
   Orchestrator, Synthesizer        
5. Evaluator-Optimizer      
   LLM output is validated by another            
   <img src="../images/1/workflow_5.jpg" width=600 /> <br>

### Agents      
Contrast to workflows,        
1. **Open-ended** - That can keep going 
2. **Feedback Loops** - Allows information to come back and be processed multiple times       
3. **No fixed path**  - less predictable. Concerns about robustness, complete it in a certain time frame in production


### Risks of Agent Frameworks      

- Unpredictable path
- Unpredictable output
- Unpredictable costs
      
**Mitigations**      
- Monitor       
- Guardrails ensures your agents behave safely, consistently, and within your intended boundaries

## Orchestrating Multiple LLMs   
Comparing GPT-4o, Claude, Gemini and DeepSeek        
      
**Vellum Leaderboard**      
https://www.vellum.ai/llm-leaderboard

In [1]:
import os 
import json 
from dotenv import load_dotenv 
from openai import OpenAI 
from anthropic import Anthropic 
from IPython.display import Markdown, display

In [2]:
load_dotenv(override=True) 

True

In [3]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")


OpenAI API Key exists and begins sk-proj-
Anthropic API Key exists and begins sk-ant-
Google API Key exists and begins AI
DeepSeek API Key exists and begins sk-
Groq API Key exists and begins gsk_


In [4]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]   
print(messages)

[{'role': 'user', 'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]


In [5]:
openai = OpenAI()
response = openai.chat.completions.create(
    model="gpt-4o-mini",
    messages=messages,
)
question = response.choices[0].message.content
print(question)

If you could redesign the educational system from the ground up, considering factors such as technology, accessibility, social equity, and lifelong learning, what specific changes would you implement, and how would you measure the success of these changes over time?


In [6]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

In [7]:
# OpenAI
model_name = "gpt-4o-mini"

response = openai.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Redesigning the educational system from the ground up involves a multifaceted approach that addresses technology integration, accessibility, social equity, and lifelong learning. Here’s a comprehensive framework outlining specific changes and metrics for measuring success.

### Proposed Changes

1. **Personalized Learning Pathways:**
   - **Implementation:** Use adaptive learning technologies that tailor educational content to individual student needs, abilities, and interests. Allow learners to progress at their own pace.
   - **Measurement:** Assess student engagement, retention rates, and proficiency levels, tracking individual progress over time compared to traditional benchmarks.

2. **Universal Access to Technology:**
   - **Implementation:** Provide all students with devices (tablets/laptops) and internet access, ensuring that resources are available in underserved communities. Partner with tech companies and local governments.
   - **Measurement:** Evaluate digital literacy levels, frequency of technology use in education, and student performance in tech-related subjects.

3. **Equity in Education Funding:**
   - **Implementation:** Shift to a needs-based funding model that allocates resources according to demographics and economic status rather than property taxes. Ensure additional resources for low-income schools.
   - **Measurement:** Analyze funding disparities over time and corresponding academic outcomes, graduation rates, and college enrollment across different socioeconomic groups.

4. **Integration of Socio-Emotional Learning (SEL):**
   - **Implementation:** Incorporate SEL into the curriculum to foster emotional intelligence, conflict resolution, resilience, and teamwork. Provide training for teachers in SEL facilitation.
   - **Measurement:** Track social-emotional skills via surveys and behavioral assessments, correlating these with academic performance and school climate indicators.

5. **Focus on Lifelong Learning:**
   - **Implementation:** Develop a curriculum that emphasizes critical thinking, problem-solving, and adaptability. Create partnerships with businesses for internships and real-world experiences. Offer adult education programs for upskilling and reskilling.
   - **Measurement:** Monitor enrollment in lifelong learning programs, workplace satisfaction of program alumni, and employability rates for participants.

6. **Community-Based Learning Environments:**
   - **Implementation:** Create learning hubs within communities that offer resources, mentorship, and experiential learning opportunities, such as labs, arts programs, and maker spaces.
   - **Measurement:** Conduct surveys to gauge community engagement, attendance metrics, and program participation, linking these to student academic and emotional outcomes.

7. **Teacher Professional Development:**
   - **Implementation:** Establish ongoing professional development that includes mentorship, peer collaboration, and training on new educational technologies and methodologies.
   - **Measurement:** Evaluate teacher satisfaction, retention rates, and student performance in classes taught by participating educators.

8. **Flexible Learning Environments:**
   - **Implementation:** Design learning spaces that are adaptable to various teaching styles, incorporating outdoor classrooms, co-working spaces, and virtual learning environments.
   - **Measurement:** Assess student feedback on learning environments, engagement levels, and performance outcomes in relation to traditional classroom settings.

### Metrics for Measuring Success Over Time

1. **Academic Performance:**
   - Standardized test scores, grades, and educational attainment levels (high school graduation, college enrollment rates).

2. **Equity Indicators:**
   - Disparities in academic performance and access among various demographic groups (race, socioeconomic status, geographic location).

3. **Student Well-being:**
   - Surveys assessing mental health, social-emotional skills, and overall student satisfaction with their educational experience.

4. **Community and Stakeholder Engagement:**
   - Levels of community involvement in education, including parental engagement and partnerships with local businesses and organizations.

5. **Employment Outcomes:**
   - Tracking student career trajectories post-graduation, including job placement rates, income levels, and long-term career satisfaction.

6. **Lifelong Learning Participation:**
   - Rate of participation in adult education and reskilling programs, as well as the impact of these programs on career advancement.

7. **Technology Utilization:**
   - Metrics on student engagement with learning technologies, including usage rates and proficiency in tech tools.

In summary, a reimagined education system should be holistically designed to foster inclusive, equitable, and adaptable learning environments. Continuous measurement and iterative improvements based on data will ensure that the system evolves to meet the needs of all learners.

In [8]:
# Anthropic has a slightly different API, and Max Tokens is required

model_name = "claude-3-7-sonnet-latest"

claude = Anthropic()
response = claude.messages.create(model=model_name, messages=messages, max_tokens=1000)
answer = response.content[0].text

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

# Reimagining Education: A Ground-Up Redesign

## Core Structural Changes

1. **Personalized Learning Pathways**: Replace rigid grade levels with competency-based progression where students advance based on mastery, not time spent.

2. **Modular Curriculum Design**: Create flexible learning modules that can be assembled into customized educational journeys while ensuring core competencies.

3. **Integrated Knowledge Application**: Dissolve subject boundaries in favor of interdisciplinary problem-solving projects that reflect real-world challenges.

4. **Community Integration**: Embed schools within community centers that serve multiple generations and purposes, making education a community resource.

## Technology Implementation

1. **AI-Enhanced Learning Companions**: Provide each student with AI tools that adapt to their learning pace, preferences, and needs while collecting meaningful data.

2. **Universal Digital Access**: Ensure every student has reliable devices and connectivity, with offline capabilities for areas with infrastructure challenges.

3. **Immersive Learning Environments**: Utilize VR/AR for experiences impossible in traditional settings, like historical recreations or molecular visualization.

## Equity Measures

1. **Resource Allocation Formula**: Fund schools based on need rather than property values, with transparent algorithms accounting for socioeconomic factors.

2. **Multilingual and Culturally Responsive Design**: Build curriculum that reflects diverse perspectives and supports language development for all students.

3. **Accessibility by Design**: Ensure all materials, spaces, and methods work for students with diverse abilities from the outset.

## Success Metrics

1. **Longitudinal Wellbeing Indicators**: Track student thriving across dimensions (cognitive, social-emotional, physical, creative) over decades.

2. **Skills Application Assessment**: Measure how effectively students apply knowledge in novel contexts rather than simply testing recall.

3. **Community Impact Measurement**: Evaluate how educational institutions improve their surrounding communities through engagement and service.

4. **Equity Gap Analysis**: Continually monitor and address disparities in outcomes across demographic groups.

5. **Adaptability Index**: Assess how well graduates navigate changing circumstances and continue learning throughout life.

What specific aspects of this reimagined system would you like me to elaborate on further?

In [9]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
# model_name = "gemini-2.0-flash"
model_name = "gemini-2.5-flash-preview-04-17"

response = gemini.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Okay, let's imagine building an educational system completely from scratch, free from the constraints of legacy structures and outdated assumptions. This redesigned system would be centered on the learner, adaptable, equitable, and integrated with the community and the world of work.

Here are the specific changes I would implement, built around the factors you mentioned:

**Core Philosophy:** The system exists to empower every individual to reach their full potential, contribute meaningfully to society, and navigate a constantly changing world through continuous learning and critical thinking. It prioritizes well-being, creativity, collaboration, and adaptability over rote memorization and standardized test scores.

**Specific Changes:**

1.  **Personalized & Competency-Based Learning Pathways:**
    *   **Change:** Replace age-based grading with flexible, competency-based progression. Students move forward as they master skills and knowledge, not based on age or seat time. Learning pathways are highly individualized, adapting to each student's pace, style, interests, and goals.
    *   **How it addresses factors:**
        *   *Technology:* Personalized learning platforms (AI-driven tutoring, adaptive content, progress tracking dashboards) are essential.
        *   *Accessibility:* Allows students with diverse needs and paces (gifted, learning differences, etc.) to thrive without being held back or overwhelmed.
        *   *Social Equity:* Provides targeted support where needed, ensuring foundational competencies are built regardless of background. Prevents students from falling behind due to early disadvantages.
        *   *Lifelong Learning:* Fosters a mindset of continuous growth and mastery, not just passing tests. Learning becomes about acquiring and applying competencies.

2.  **Integrated Curriculum Focused on Skills, Application, and Interdisciplinarity:**
    *   **Change:** Move away from siloed subjects. The curriculum is organized around real-world challenges, projects, and themes (e.g., "Sustainable Cities," "Understanding Data," "Global Health"). Core literacies (reading, math, science, digital) are taught *within* these integrated contexts. Emphasis is placed on critical thinking, problem-solving, collaboration, communication, creativity, digital fluency, emotional intelligence, and ethical reasoning. Financial literacy, health & well-being, and civic engagement are core components from an early age.
    *   **How it addresses factors:**
        *   *Technology:* Used as a tool for research, collaboration, creation, simulation, and global connection within projects. Digital literacy is embedded.
        *   *Accessibility:* Offers multiple entry points and ways to demonstrate understanding through diverse projects.
        *   *Social Equity:* Makes learning more relevant and engaging for students from all backgrounds, connecting abstract concepts to their lives and communities.
        *   *Lifelong Learning:* Develops adaptable skills applicable to any future challenge or career path. Teaches *how* to learn and apply knowledge.

3.  **Authentic & Diverse Assessment Methods:**
    *   **Change:** Eliminate high-stakes, standardized, multiple-choice tests as the primary measure of success. Assessment is continuous, varied, and focuses on demonstrating competencies through portfolios, projects, presentations, simulations, peer evaluations, and self-reflection. Feedback is growth-oriented.
    *   **How it addresses factors:**
        *   *Technology:* Platforms track competency mastery, manage digital portfolios, and provide automated feedback on certain tasks.
        *   *Accessibility:* Allows students to demonstrate understanding in ways that suit their strengths, rather than being penalized for test anxiety or specific learning differences.
        *   *Social Equity:* Reduces bias inherent in standardized tests and provides a more holistic view of a student's capabilities and growth, not just a snapshot score.
        *   *Lifelong Learning:* Reflects how skills are demonstrated and evaluated in the real world and workplace. Promotes self-assessment as a lifelong skill.

4.  **Universal Access & Inclusive Design:**
    *   **Change:** Guarantee high-quality early childhood education for all children. Ensure all learning environments (physical and digital) are designed using Universal Design for Learning (UDL) principles to accommodate diverse physical, cognitive, and emotional needs from the outset. Provide robust support services (special education, counseling, nutrition, healthcare coordination) integrated *within* the learning environment. Equip every learner and educator with necessary devices and reliable internet access, treated as essential infrastructure.
    *   **How it addresses factors:**
        *   *Technology:* Ensures equitable access to digital tools and resources for *all*.
        *   *Accessibility:* Baked into the system's foundation, removing barriers for students with disabilities and diverse learning profiles.
        *   *Social Equity:* Directly addresses the impact of socioeconomic disadvantage by providing universal access to quality early learning and essential resources/support. Reduces the link between zip code and educational opportunity.
        *   *Lifelong Learning:* Ensures a strong foundation is built early, which is critical for future learning success.

5.  **Equitable Funding & Resource Distribution:**
    *   **Change:** Implement a progressive funding model where resources (funding, qualified educators, technology infrastructure, support staff) are allocated based on student need, rather than local property taxes or enrollment numbers alone. Fund schools and learning centers in disadvantaged areas at significantly higher levels per student to counteract out-of-school disadvantages.
    *   **How it addresses factors:**
        *   *Technology:* Ensures equitable access to infrastructure and support needed to leverage technology effectively across all locations.
        *   *Accessibility:* Provides necessary resources for specialized support staff and inclusive environments.
        *   *Social Equity:* Directly combats systemic inequality by providing targeted resources to lift up underserved communities.
        *   *Lifelong Learning:* Ensures a baseline of quality education that enables all individuals to participate in future learning opportunities.

6.  **Redefined Role of Educators & Continuous Professional Growth:**
    *   **Change:** Educators become highly skilled facilitators, mentors, subject matter experts, and learning designers. The profession is highly valued and compensated accordingly. They receive extensive, ongoing professional development focused on pedagogy, technology integration, understanding diverse learners, socio-emotional learning, and subject matter expertise. Collaborative planning time and mentorship are built into their roles.
    *   **How it addresses factors:**
        *   *Technology:* Educators are trained and supported to effectively integrate and leverage technology.
        *   *Accessibility:* Educators are equipped to implement UDL and support diverse student needs.
        *   *Social Equity:* A diverse, well-trained teaching force is better equipped to support students from all backgrounds and address implicit bias.
        *   *Lifelong Learning:* Educators model and participate in lifelong learning themselves. Their training adapts to new research and societal needs.

7.  **Integrated Lifelong Learning Ecosystem:**
    *   **Change:** Create a seamless system from early childhood through adulthood. Establish community-based learning hubs that offer resources, guidance, and flexible learning opportunities (vocational training, reskilling programs, personal interest courses, micro-credentials, degree pathways) accessible to people of all ages. Link learning credentials (competencies, skills) to a universal digital portfolio recognized by employers and other learning institutions. Provide publicly funded counseling and navigation services for adults seeking education or career changes.
    *   **How it addresses factors:**
        *   *Technology:* Platforms track credentials and skills acquired throughout life; online/blended learning options provide flexibility.
        *   *Accessibility:* Offers learning opportunities at convenient locations/times and in flexible formats for working adults or those with other constraints.
        *   *Social Equity:* Provides pathways for adults who missed opportunities earlier or need to adapt to economic changes, regardless of their financial situation.
        *   *Lifelong Learning:* This *is* the core of this change, building a system where learning is a continuous, supported journey from cradle to grave.

8.  **Community & Real-World Connection:**
    *   **Change:** Learning environments are deeply integrated with local communities and industries. Students engage in community projects, internships, apprenticeships, and mentorships from an early age. Community members (artisans, scientists, elders, business leaders) are invited into learning spaces as mentors and co-learners.
    *   **How it addresses factors:**
        *   *Technology:* Facilitates connection with experts globally, not just locally. Students use tech to document and share community projects.
        *   *Accessibility:* Provides diverse learning experiences outside traditional classrooms, potentially more accessible to some learners.
        *   *Social Equity:* Connects students to social capital and opportunities they might not otherwise access. Makes learning relevant to their community's context.
        *   *Lifelong Learning:* Creates networks and experiences that support transitions into adulthood and career pathways.

**Measuring the Success of These Changes Over Time:**

Success metrics would need to be as diverse and comprehensive as the changes themselves, moving beyond simple test scores. Measurement would be continuous, transparent, and used for iterative improvement, not just accountability.

1.  **Learner Outcomes (Beyond Scores):**
    *   **Competency Attainment:** Track mastery of defined competencies across the curriculum using digital portfolios and project assessments. Look at growth trajectories over time.
    *   **Skill Demonstrations:** Measure performance on complex tasks, collaborative projects, and real-world simulations.
    *   **Well-being & Engagement:** Regular, anonymous surveys on student engagement, motivation, sense of belonging, and emotional health. Track attendance and participation patterns.
    *   **Post-Education Pathways:** Track successful transitions to further learning (higher education, vocational), apprenticeships, meaningful employment, and civic participation. Analyze data by demographic to identify equity gaps.
    *   **Alumni Longitudinal Studies:** Follow graduates over 5, 10, 20 years to assess long-term outcomes in careers, civic life, and continued learning.

2.  **Equity & Accessibility Metrics:**
    *   **Achievement Gap Reduction:** Analyze competency attainment and pathway success data across different demographic groups (socioeconomic status, race/ethnicity, disability status, geographic location) to track the narrowing of disparities.
    *   **Resource Equity Index:** Measure the actual distribution of resources (funding, teacher quality/experience, technology access, support staff ratios) per student, weighted by need, to ensure equitable allocation.
    *   **Participation Rates:** Track enrollment and completion rates in early childhood programs, advanced/specialized pathways, and adult lifelong learning programs across demographics.
    *   **Inclusivity Surveys:** Gather feedback from students, parents, and staff on the perceived inclusivity and accessibility of learning environments and support services.

3.  **System Performance & Adaptability:**
    *   **Educator Satisfaction & Retention:** Surveys on teacher morale, perceived support, effectiveness of professional development, and retention rates.
    *   **System Responsiveness:** Measure the speed and effectiveness with which the curriculum and learning pathways are updated to reflect new knowledge, technologies, and societal needs. Track adoption rates of new pedagogical approaches.
    *   **Community & Employer Feedback:** Regular surveys and feedback loops from community partners and employers on the relevance of skills and competencies demonstrated by learners/graduates.
    *   **Innovation Diffusion:** Track the development and scaling of effective new learning technologies, pedagogies, and support models across the system.
    *   **Cost-Effectiveness:** Evaluate the efficiency of resource allocation in achieving desired outcomes over the long term.

4.  **Lifelong Learning Participation:**
    *   **Adult Participation Rates:** Track enrollment and completion in adult learning programs, reskilling initiatives, and access to public learning resources.
    *   **Credential Portability & Recognition:** Measure how widely the digital competency portfolio and acquired micro-credentials/certifications are accepted and valued by employers and higher education institutions.
    *   **Career Mobility:** Track how ongoing learning correlates with career advancement and successful transitions for adult learners.

**Implementation & Feedback:**

Success would also be measured by the effectiveness of the implementation process itself:
*   **Stakeholder Engagement:** Track the level and quality of involvement from parents, students, educators, community members, and employers in the design, implementation, and ongoing refinement of the system.
*   **Piloting & Iteration:** Measure the success of pilot programs for new models (e.g., competency frameworks, tech platforms) and the speed at which lessons learned are integrated into wider implementation.
*   **Transparency:** Ensure data on system performance, equity gaps, and resource allocation is transparent and accessible to the public.

In essence, measuring success would shift from evaluating individual students primarily on standardized academic performance to evaluating the *system's* effectiveness in fostering growth, equity, well-being, and lifelong adaptability for *all* learners. This requires a robust data infrastructure, a culture of continuous improvement, and a commitment to using measurement to inform action and truly serve the needs of every individual and society as a whole.

In [10]:
deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = "deepseek-chat"

response = deepseek.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

Redesigning the educational system from the ground up requires a holistic approach that prioritizes adaptability, equity, and lifelong learning. Below are key changes I would implement, along with metrics to measure their success:

### **1. Shift from Age-Based to Competency-Based Learning**
   - **Change:** Replace rigid grade levels with personalized, mastery-based progression. Students advance upon demonstrating competency in subjects, not just time spent in class.
   - **Why?** Accommodates different learning paces and reduces stigma around "falling behind."
   - **Measurement:** 
     - % of students achieving mastery benchmarks at their own pace  
     - Reduction in achievement gaps between demographic groups  

### **2. Emphasize Critical Thinking & Real-World Skills Over Standardized Testing**
   - **Change:** Replace high-stakes standardized tests with project-based assessments, problem-solving tasks, and interdisciplinary learning.
   - **Why?** Encourages deeper understanding and application of knowledge.
   - **Measurement:** 
     - Student performance in real-world simulations (e.g., entrepreneurship, civic engagement)  
     - Employer/college feedback on graduates' problem-solving abilities  

### **3. Integrate Technology for Personalized & Inclusive Learning**
   - **Change:** AI-driven adaptive learning platforms provide tailored instruction, while assistive tech ensures accessibility for students with disabilities.
   - **Why?** Reduces inequities in access to quality education.
   - **Measurement:** 
     - Usage rates & learning gains from adaptive software  
     - Improvement in accessibility for students with disabilities  

### **4. Expand Access to Early Childhood & Lifelong Learning**
   - **Change:** Free, universal early childhood education + subsidized adult education (upskilling, career transitions).
   - **Why?** Reduces long-term disparities and supports a dynamic workforce.
   - **Measurement:** 
     - Early literacy/numeracy rates  
     - Adult participation in continued education & career advancement  

### **5. Cultivate Social-Emotional Learning (SEL) & Mental Health Support**
   - **Change:** Mandate SEL curricula (empathy, resilience, collaboration) and provide school-based mental health services.
   - **Why?** Improves well-being and prepares students for modern workplaces.
   - **Measurement:** 
     - Student self-reported well-being & resilience metrics  
     - Reduction in school dropout rates linked to mental health  

### **6. Decentralize & Democratize Education Policy**
   - **Change:** Local communities, educators, and students co-design curricula to reflect cultural relevance and workforce needs.
   - **Why?** Increases engagement and relevance.
   - **Measurement:** 
     - Community satisfaction surveys  
     - Alignment between school programs and local labor market demands  

### **7. Teacher Training & Autonomy**
   - **Change:** Teachers train in adaptive pedagogy, tech integration, and trauma-informed practices. Grant them autonomy to innovate.
   - **Why?** Empowers educators as facilitators, not just lecturers.
   - **Measurement:** 
     - Teacher retention & job satisfaction rates  
     - Student engagement in classrooms with teacher-led innovations  

### **8. Alternative Credentialing & Pathways**
   - **Change:** Recognize micro-credentials, apprenticeships, and online certifications as valid alternatives to traditional degrees.
   - **Why?** Makes education more flexible and aligned with job markets.
   - **Measurement:** 
     - Employment rates of non-traditional credential holders  
     - Employer satisfaction with alternative credentialing systems  

### **Success Metrics Over Time**
- **Short-Term (1-5 years):** Increased engagement, reduced dropout rates, early mastery benchmarks.  
- **Medium-Term (5-15 years):** Narrowed equity gaps, improved workforce readiness, higher adult education participation.  
- **Long-Term (15+ years):** Societal outcomes like reduced income inequality, higher civic engagement, and innovation-driven economic growth.  

This model prioritizes **flexibility, equity, and relevance**, ensuring education evolves with societal needs rather than remaining static. Success would be measured not just by test scores, but by **lifelong learner success and societal well-being**.

In [11]:
groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
model_name = "llama-3.3-70b-versatile"

response = groq.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


If I could redesign the educational system from the ground up, I would implement the following changes, considering factors such as technology, accessibility, social equity, and lifelong learning:

**Changes:**

1. **Personalized Learning Paths**: Implement a competency-based education system, where students progress at their own pace, focusing on individual strengths and interests. AI-powered adaptive learning tools would help tailor the learning experience to each student's needs.
2. **Flexible Learning Environments**: Create a variety of learning environments, including online, hybrid, and traditional classroom settings, to cater to different learning styles and needs. This would include virtual reality, augmented reality, and gamification to enhance engagement.
3. **Inclusive and Accessible Curriculum**: Develop a curriculum that incorporates diverse perspectives, cultures, and histories, promoting empathy, understanding, and social equity. Ensure that all materials are accessible to students with disabilities, using technologies like text-to-speech, image recognition, and audio descriptions.
4. **Project-Based Learning**: Emphasize hands-on, project-based learning, encouraging students to apply theoretical knowledge to real-world problems. This would foster critical thinking, creativity, and collaboration skills.
5. **Lifelong Learning Opportunities**: Offer ongoing education and skill development opportunities for adults, including online courses, workshops, and apprenticeships. This would help individuals adapt to changing work environments and technological advancements.
6. **Teacher Training and Support**: Provide teachers with regular training and support to effectively integrate technology, adapt to changing student needs, and promote social-emotional learning.
7. **Community Engagement**: Foster partnerships between schools, local businesses, and community organizations to provide students with practical experiences, mentorship, and networking opportunities.
8. **Assessment and Feedback**: Develop a balanced assessment system, combining traditional evaluations with more innovative methods, such as peer review, self-assessment, and project-based evaluations. Provide regular feedback to students, helping them track progress and identify areas for improvement.
9. **Mental Health and Well-being**: Prioritize mental health and well-being, incorporating social-emotional learning, mindfulness, and stress management techniques into the curriculum.
10. **Data-Driven Decision Making**: Leverage data analytics and AI to inform educational decisions, track student progress, and identify areas for improvement.

**Measuring Success:**

To measure the success of these changes, I would track the following metrics over time:

1. **Student Engagement and Motivation**: Monitor student participation, attendance, and self-reported motivation to learn.
2. **Academic Achievement**: Evaluate student performance using a range of assessments, including standardized tests, project-based evaluations, and peer review.
3. **Social-Emotional Learning**: Assess students' social-emotional skills, such as empathy, self-awareness, and self-regulation.
4. **Equity and Access**: Track the representation of diverse student populations in programs, courses, and extracurricular activities.
5. **Graduation and Post-Secondary Education Rates**: Monitor the number of students graduating from high school and pursuing post-secondary education or career training.
6. **Employment and Career Advancement**: Follow alumni progress, tracking employment rates, career advancement, and job satisfaction.
7. **Teacher Satisfaction and Retention**: Evaluate teacher job satisfaction, retention rates, and professional development opportunities.
8. **Community Engagement and Partnerships**: Measure the number of partnerships between schools and local businesses, community organizations, and government agencies.
9. **Mental Health and Well-being**: Assess the impact of mental health and well-being initiatives on student outcomes, including stress levels, anxiety, and depression.
10. **Return on Investment**: Evaluate the financial returns on education investments, including the economic benefits of a more educated and skilled workforce.

**Evaluation Framework:**

To ensure the effectiveness of these changes, I would establish an evaluation framework with the following components:

1. **Short-term (0-3 years)**: Monitor student engagement, academic achievement, and teacher satisfaction.
2. **Mid-term (4-6 years)**: Evaluate social-emotional learning, equity and access, and community engagement.
3. **Long-term (7-10 years)**: Assess graduation and post-secondary education rates, employment and career advancement, and return on investment.
4. **Cycle of Continuous Improvement**: Regularly review and refine the educational system, incorporating feedback from students, teachers, parents, and community stakeholders.

By implementing these changes and tracking their success over time, I believe that the redesigned educational system would provide students with a more engaging, inclusive, and effective learning experience, preparing them for success in an ever-changing world.

In [12]:
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
# model_name = "llama3.2"
model_name = "llama4"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

What a fascinating question! Let's dive into a reimagined educational system that prioritizes accessibility, equity, and lifelong learning. Here's a comprehensive overhaul:

**Foundational Principles:**

1. **Personalized Learning**: Focus on individual students' needs, interests, and learning styles.
2. **Accessibility**: Ensure equal access to quality education for all, regardless of geographical location, socio-economic background, or abilities.
3. **Lifelong Learning**: Foster a culture of continuous learning, enabling individuals to adapt and grow throughout their lives.
4. **Technology Integration**: Leverage technology to enhance learning experiences, increase accessibility, and streamline administrative tasks.

**Structural Changes:**

1. **Modular, Competency-Based Progression**: Replace traditional grade levels with a modular system, where students progress based on demonstrated mastery of skills and knowledge.
2. **Flexible Learning Paths**: Offer diverse learning routes, including online, offline, and hybrid models, to cater to different learning styles and needs.
3. **Interdisciplinary Curricula**: Integrate subjects to promote holistic understanding and critical thinking.
4. **Emphasis on Soft Skills**: Incorporate essential life skills, such as emotional intelligence, communication, and problem-solving.

**Implementation Strategies:**

1. **Universal Access to Devices and Internet**: Ensure that all students have access to devices and reliable internet connectivity.
2. **AI-Powered Adaptive Learning Tools**: Implement AI-driven tools that adjust to individual learning needs, providing real-time feedback and recommendations.
3. **Virtual and Augmented Reality**: Incorporate immersive technologies to create engaging, interactive learning experiences.
4. **Teacher Training and Support**: Provide ongoing professional development for educators, focusing on technology integration, pedagogy, and emotional intelligence.

**Assessment and Evaluation:**

1. **Competency-Based Assessments**: Develop assessments that measure students' mastery of specific skills and knowledge.
2. **Regular Feedback and Progress Monitoring**: Implement regular check-ins to track student progress, identify areas for improvement, and adjust instruction accordingly.
3. **Standardized Assessments**: Use standardized assessments to evaluate system-wide effectiveness and identify areas for improvement.

**Social Equity and Accessibility:**

1. **Inclusive Curriculum**: Develop curricula that reflect diverse perspectives, histories, and experiences.
2. **Culturally Responsive Teaching**: Train educators to recognize and address the unique needs of students from diverse backgrounds.
3. **Accessibility Features**: Implement features such as text-to-speech software, audio descriptions, and sign language interpretation to ensure equal access.

**Lifelong Learning:**

1. **Continuing Education Programs**: Offer flexible, affordable programs for adults to upskill and reskill throughout their lives.
2. **Community-Based Learning**: Foster partnerships between educational institutions, community organizations, and local businesses to provide opportunities for lifelong learning.
3. **Personalized Learning Plans**: Encourage individuals to create personalized learning plans, setting goals and tracking progress throughout their lives.

**Measuring Success:**

1. **Student Outcomes**: Track students' academic achievement, social-emotional learning, and career readiness.
2. **Systemic Equity**: Monitor and address disparities in access, participation, and outcomes.
3. **Teacher Satisfaction**: Regularly assess teacher satisfaction, professional growth, and well-being.
4. **Alumni Feedback**: Collect feedback from program graduates to inform continuous improvement.

**Evaluation Metrics:**

1. **Graduation Rates**: Track the percentage of students completing their programs.
2. **Academic Achievement**: Monitor students' progress and achievement in core subjects.
3. **Social-Emotional Learning**: Assess students' development of essential life skills.
4. **Career Readiness**: Evaluate students' preparedness for the workforce.

**Continuous Improvement:**

1. **Regular Review and Revision**: Periodically review and update curricula, assessments, and instructional strategies.
2. **Stakeholder Engagement**: Foster ongoing dialogue with students, educators, parents, and community members to inform improvements.
3. **Data-Driven Decision-Making**: Use data and research to inform policy and practice.

By implementing these changes, we can create a more equitable, accessible, and effective educational system that prepares students for success in the 21st century and beyond.

In [13]:
# So where are we? 

print(competitors)
print(answers)

['gpt-4o-mini', 'claude-3-7-sonnet-latest', 'gemini-2.5-flash-preview-04-17', 'deepseek-chat', 'llama-3.3-70b-versatile', 'llama4']
['Redesigning the educational system from the ground up involves a multifaceted approach that addresses technology integration, accessibility, social equity, and lifelong learning. Here’s a comprehensive framework outlining specific changes and metrics for measuring success.\n\n### Proposed Changes\n\n1. **Personalized Learning Pathways:**\n   - **Implementation:** Use adaptive learning technologies that tailor educational content to individual student needs, abilities, and interests. Allow learners to progress at their own pace.\n   - **Measurement:** Assess student engagement, retention rates, and proficiency levels, tracking individual progress over time compared to traditional benchmarks.\n\n2. **Universal Access to Technology:**\n   - **Implementation:** Provide all students with devices (tablets/laptops) and internet access, ensuring that resources a

In [14]:
# It's nice to know how to use "zip" 

for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")

Competitor: gpt-4o-mini

Redesigning the educational system from the ground up involves a multifaceted approach that addresses technology integration, accessibility, social equity, and lifelong learning. Here’s a comprehensive framework outlining specific changes and metrics for measuring success.

### Proposed Changes

1. **Personalized Learning Pathways:**
   - **Implementation:** Use adaptive learning technologies that tailor educational content to individual student needs, abilities, and interests. Allow learners to progress at their own pace.
   - **Measurement:** Assess student engagement, retention rates, and proficiency levels, tracking individual progress over time compared to traditional benchmarks.

2. **Universal Access to Technology:**
   - **Implementation:** Provide all students with devices (tablets/laptops) and internet access, ensuring that resources are available in underserved communities. Partner with tech companies and local governments.
   - **Measurement:** Eval

In [15]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [16]:
print(together)

# Response from competitor 1

Redesigning the educational system from the ground up involves a multifaceted approach that addresses technology integration, accessibility, social equity, and lifelong learning. Here’s a comprehensive framework outlining specific changes and metrics for measuring success.

### Proposed Changes

1. **Personalized Learning Pathways:**
   - **Implementation:** Use adaptive learning technologies that tailor educational content to individual student needs, abilities, and interests. Allow learners to progress at their own pace.
   - **Measurement:** Assess student engagement, retention rates, and proficiency levels, tracking individual progress over time compared to traditional benchmarks.

2. **Universal Access to Technology:**
   - **Implementation:** Provide all students with devices (tablets/laptops) and internet access, ensuring that resources are available in underserved communities. Partner with tech companies and local governments.
   - **Measurement:**

In [17]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""

In [18]:
print(judge)

You are judging a competition between 6 competitors.
Each model has been given this question:

If you could redesign the educational system from the ground up, considering factors such as technology, accessibility, social equity, and lifelong learning, what specific changes would you implement, and how would you measure the success of these changes over time?

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...]}

Here are the responses from each competitor:

# Response from competitor 1

Redesigning the educational system from the ground up involves a multifaceted approach that addresses technology integration, accessibility, social equity, and lifelong learning. Here’s a comprehensive framework outlining specific changes and metrics for measuring success

In [19]:
judge_messages = [{"role": "user", "content": judge}]

In [20]:
# Judgement time!

openai = OpenAI()
response = openai.chat.completions.create(
    model="o3-mini",
    messages=judge_messages,
)
results = response.choices[0].message.content
print(results)

{"results": ["3", "1", "4", "6", "5", "2"]}


In [21]:
# OK let's turn this into results!

results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: gemini-2.5-flash-preview-04-17
Rank 2: gpt-4o-mini
Rank 3: deepseek-chat
Rank 4: llama4
Rank 5: llama-3.3-70b-versatile
Rank 6: claude-3-7-sonnet-latest
