# Part 2: Multi-Agent Router - Research Assistant

## üéØ Research Scenario
You're writing a research paper and need to conduct a comprehensive literature review, format citations properly, and generate summaries. Instead of doing this manually with different tools, you'll build an intelligent research assistant that coordinates multiple specialized agents.

## üéì What You'll Learn

1. **Multi-Agent Architecture**: How specialized agents collaborate effectively
2. **Router Patterns**: Intelligent task analysis and delegation
3. **Real API Integration**: Working with ArXiv, Semantic Scholar, CrossRef
4. **Agent Coordination**: Managing complex research workflows

## üìã Prerequisites Check
- Part 1 completed (LLM provider configured)
- Internet connection (for research APIs)
- Optional: API keys for enhanced features

In [8]:
# Essential imports
import sys
import os
from pathlib import Path
import time
import json

%load_ext autoreload
%autoreload 2

# Add modules to path
current_dir = Path.cwd()
modules_dir = current_dir / "modules"
sys.path.insert(0, str(modules_dir))

# Also add Part1 modules (for LLM providers)
part1_modules = current_dir.parent / "Part1_Foundations" / "modules"
sys.path.insert(0, str(part1_modules))

# Load environment variables
from dotenv import load_dotenv
load_dotenv(current_dir.parent / ".env")

print("üöÄ Multi-Agent Research System Loading...")
print(f"üìÅ Working directory: {current_dir}")
print(f"üîó Modules path: {modules_dir}")

The autoreload extension is already loaded. To reload it, use:
  %reload_ext autoreload
üöÄ Multi-Agent Research System Loading...
üìÅ Working directory: /home/lq/LQcode/2_project/PHMBench/PHMGA/tutorials_research/Part2_Multi_Agent_Router
üîó Modules path: /home/lq/LQcode/2_project/PHMBench/PHMGA/tutorials_research/Part2_Multi_Agent_Router/modules


# Section 1: Research Tools Integration (45 min)

`‚òÖ Insight ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ`
Real research requires real data sources. By integrating with ArXiv, Semantic Scholar, and CrossRef APIs, we create agents that work with actual academic databases rather than simulated data. This authenticity makes the agents immediately useful for research work.
`‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ`

Let's start by setting up our research tools that provide access to academic databases.

 ### üî¨ RESEARCH TOOLS DEMONSTRATION
===================================

üìö Available Tools:
   ‚Ä¢ ArXiv Client: Preprint repository search (2M+ papers)
   ‚Ä¢ Semantic Scholar: Citation analysis and paper metrics
   ‚Ä¢ CrossRef: DOI resolution and bibliographic data
   ‚Ä¢ Aggregator: Unified search across all sources

üéØ Research Capabilities:
   ‚Ä¢ Literature search across multiple databases
   ‚Ä¢ Citation count and influence metrics
   ‚Ä¢ Author collaboration networks
   ‚Ä¢ Paper categorization and filtering
   ‚Ä¢ Full-text and metadata access
   ‚Ä¢ DOI resolution and validation

üí° Usage Examples:
   ‚Ä¢ arxiv.search_papers("transformer attention", max_results=10)
   ‚Ä¢ semantic_scholar.search_papers("neural networks", max_results=20)
   ‚Ä¢ crossref.resolve_doi("10.1038/nature12373")
   ‚Ä¢ aggregator.get_comprehensive_results("machine learning")

In [9]:
# Import and demonstrate research tools
from research_tools import (
    ResearchToolsAggregator,
    ArXivClient,
    SemanticScholarClient,
    demonstrate_research_tools
)

# Show capabilities
demonstrate_research_tools()

üî¨ RESEARCH TOOLS DEMONSTRATION

üìö Available Tools:
   ‚Ä¢ ArXiv Client: Preprint repository search (2M+ papers)
   ‚Ä¢ Semantic Scholar: Citation analysis and paper metrics
   ‚Ä¢ CrossRef: DOI resolution and bibliographic data
   ‚Ä¢ Aggregator: Unified search across all sources

üéØ Research Capabilities:
   ‚Ä¢ Literature search across multiple databases
   ‚Ä¢ Citation count and influence metrics
   ‚Ä¢ Author collaboration networks
   ‚Ä¢ Paper categorization and filtering
   ‚Ä¢ Full-text and metadata access
   ‚Ä¢ DOI resolution and validation

üí° Usage Examples:
   ‚Ä¢ arxiv.search_papers("transformer attention", max_results=10)
   ‚Ä¢ semantic_scholar.search_papers("neural networks", max_results=20)
   ‚Ä¢ crossref.resolve_doi("10.1038/nature12373")
   ‚Ä¢ aggregator.get_comprehensive_results("machine learning")


In [10]:
# Test ArXiv integration (no API key required)
print("üîç TESTING ARXIV INTEGRATION")
print("=" * 30)

arxiv_client = ArXivClient()

# Search for papers on a research topic
search_query = "neural operator"
print(f"Searching ArXiv for: '{search_query}'")

try:
    papers = arxiv_client.search_papers(
        query=search_query,
        max_results=5,  # Limit for demo
        sort_by="relevance"
    )
    
    print(f"‚úÖ Found {len(papers)} papers")
    
    if papers:
        print("\nüìã Sample Results:")
        for i, paper in enumerate(papers[:3], 1):
            print(f"\n{i}. {paper.title}")
            print(f"   Authors: {', '.join(paper.authors[:3])}")
            print(f"   Date: {paper.publication_date}")
            print(f"   ArXiv ID: {paper.arxiv_id}")
            print(f"   Categories: {', '.join(paper.categories[:3])}")
    else:
        print("‚ö†Ô∏è No papers found - this might be due to network issues")
        
except Exception as e:
    print(f"‚ùå ArXiv search failed: {e}")
    print("üí° This is normal if you have network restrictions")
    
    # Create some mock data for demonstration
    print("\nüé≠ Using mock data for demonstration...")
    from research_tools import ResearchPaper
    
    papers = [
        ResearchPaper(
            title="Attention Is All You Need",
            authors=["Ashish Vaswani", "Noam Shazeer"],
            abstract="The dominant sequence transduction models are based on complex recurrent or convolutional neural networks...",
            publication_date="2017-06-12",
            arxiv_id="1706.03762",
            source="arxiv",
            citation_count=50000
        )
    ]
    print(f"üìÑ Mock paper: {papers[0].title}")

arxiv_available = len(papers) > 0
print(f"\nüìä ArXiv Status: {'‚úÖ Available' if arxiv_available else '‚ùå Limited'}")

üîç TESTING ARXIV INTEGRATION
Searching ArXiv for: 'neural operator'
‚úÖ Found 5 papers

üìã Sample Results:

1. Neural Operator: Learning Maps Between Function Spaces
   Authors: Nikola Kovachki, Zongyi Li, Burigede Liu
   Date: 2021-08-19
   ArXiv ID: 2108.08481v6
   Categories: cs.LG, cs.NA, math.NA

2. Neural Correction Operator: A Reliable and Fast Approach for Electrical
  Impedance Tomography
   Authors: Amit Bhat, Ke Chen, Chunmei Wang
   Date: 2025-07-25
   ArXiv ID: 2507.18875v1
   Categories: math.NA, cs.NA

3. Resolution-Invariant Image Classification based on Fourier Neural
  Operators
   Authors: Samira Kabri, Tim Roith, Daniel Tenbrinck
   Date: 2023-04-02
   ArXiv ID: 2304.01227v1
   Categories: cs.CV, cs.LG, cs.NA

üìä ArXiv Status: ‚úÖ Available


# Section 2: Literature Search Agent (45 min)

`‚òÖ Insight ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ`

Specialized agents outperform general-purpose agents because they can optimize for specific tasks. A literature search agent can implement 

- domain-specific ranking, 

- apply academic quality filters, 

- and extract research insights that a general agent might miss.

`‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ`


Now let's create a specialized agent for literature search and analysis.

In [14]:
# Set up LLM for our agents (from Part 1)
from llm_providers import create_research_llm, list_research_providers

print("ü§ñ SETTING UP LLM FOR AGENTS")
print("=" * 30)

# Check available providers
list_research_providers()

try:
    # Create research LLM
    research_llm = create_research_llm(
        temperature=0.7,  # Balanced creativity and consistency
        fast_mode=False   # Use high-quality model
    )
    
    print("\n‚úÖ Research LLM created successfully!")
    
    # Test the LLM
    test_response = research_llm.invoke("Hello! Please respond with 'Research agents ready'")
    print(f"üß™ Test response: {test_response.content}")
    
    llm_available = True
    
except Exception as e:
    print(f"‚ùå LLM setup failed: {e}")
    print("üí° Will use limited functionality without LLM")
    research_llm = None
    llm_available = False

print(f"\nüìä LLM Status: {'‚úÖ Ready' if llm_available else '‚ùå Not Available'}")

ü§ñ SETTING UP LLM FOR AGENTS
üîç Available LLM Providers for Research:
------------------------------------------------------------
‚ùå GOOGLE     - Google Gemini - Excellent for mathematical reasoning
   Default: gemini-2.5-pro
   Fast: gemini-2.5-flash
   ‚ö†Ô∏è  Set GEMINI_API_KEY to enable

‚ùå OPENAI     - OpenAI GPT - Reliable for code understanding
   Default: gpt-4o
   Fast: gpt-4o-mini
   ‚ö†Ô∏è  Set OPENAI_API_KEY to enable

‚úÖ DASHSCOPE  - DashScope Qwen - Cost-effective with good performance
   Default: qwen-plus
   Fast: qwen-plus

‚úÖ ZHIPUAI    - Zhipu AI GLM - Optimized for Chinese researchers
   Default: glm-4
   Fast: glm-4-flash

üéØ Recommended: DASHSCOPE

‚úÖ Research LLM created successfully!
üß™ Test response: Research agents ready

üìä LLM Status: ‚úÖ Ready


In [16]:
# Create and test literature search agent
research_tools = arxiv_client
if llm_available:
    from literature_agent import LiteratureSearchAgent, demonstrate_literature_agent
    
    print("üìö CREATING LITERATURE SEARCH AGENT")
    print("=" * 35)
    
    # Show agent capabilities
    demonstrate_literature_agent()
    
    # Create the agent
    literature_agent = LiteratureSearchAgent(
        llm=research_llm,
        research_tools=research_tools
    )
    
    print("\n‚úÖ Literature Search Agent created!")
    print(f"   Max papers per query: {literature_agent.max_papers_per_query}")
    print(f"   Quality filters: {len(literature_agent.quality_metrics)} criteria")
    print(f"   Query expansions: {len(literature_agent.query_expansion_terms)} domains")
    
else:
    print("‚è≠Ô∏è Skipping literature agent (no LLM available)")
    literature_agent = None

üìö CREATING LITERATURE SEARCH AGENT
üìö LITERATURE SEARCH AGENT DEMONSTRATION

üéØ Agent Capabilities:
   ‚Ä¢ Multi-source literature search (ArXiv + Semantic Scholar)
   ‚Ä¢ Query expansion and optimization
   ‚Ä¢ Advanced paper ranking and filtering
   ‚Ä¢ Key insight extraction using LLM analysis
   ‚Ä¢ Trend identification and author network analysis
   ‚Ä¢ Automated literature review section generation

üìä Search Features:
   ‚Ä¢ Recent paper filtering (last 5 years)
   ‚Ä¢ Citation-based quality filtering
   ‚Ä¢ Multi-criteria ranking (relevance, recency, citations)
   ‚Ä¢ Venue quality assessment
   ‚Ä¢ Duplicate detection and removal

üîç Usage Examples:
   ‚Ä¢ agent.search_literature("transformer attention mechanisms")
   ‚Ä¢ agent.generate_literature_review_section(results, "trends")
   ‚Ä¢ agent.get_author_collaboration_network(papers)

‚úÖ Literature Search Agent created!
   Max papers per query: 50
   Quality filters: 5 criteria
   Query expansions: 5 domains


In [None]:
# Test literature search functionality
if llm_available and literature_agent:
    print("üîç TESTING LITERATURE SEARCH")
    print("=" * 28)
    
    # Conduct a literature search
    research_query = "neural operator"
    print(f"Research Query: '{research_query}'")
    
    try:
        # Perform literature search
        search_result = literature_agent.search_literature(
            query=research_query,
            max_results=10,
            include_recent_only=True,
            expand_query=True
        )
        
        print(f"\nüìä SEARCH RESULTS:")
        print(f"Papers found: {search_result.total_found}")
        print(f"Papers returned: {len(search_result.papers)}")
        print(f"Search time: {search_result.search_time:.1f}s")
        print(f"Key insights: {len(search_result.key_insights)}")
        print(f"Trending topics: {len(search_result.trending_topics)}")
        
        # Show sample results
        if search_result.papers:
            print("\nüìÑ Top Papers:")
            for i, paper in enumerate(search_result.papers[:3], 1):
                print(f"\n{i}. {paper.title}")
                print(f"   Authors: {', '.join(paper.authors[:2])}")
                print(f"   Year: {paper.publication_date[:4]}")
                print(f"   Citations: {paper.citation_count}")
                print(f"   Confidence: {paper.confidence_score:.2f}")
        
        # Show insights if available
        if search_result.key_insights:
            print("\nüí° Key Insights:")
            for insight in search_result.key_insights[:3]:
                print(f"   ‚Ä¢ {insight}")
        
        # Show trends
        if search_result.trending_topics:
            print("\nüìà Trending Topics:")
            for topic in search_result.trending_topics[:5]:
                print(f"   ‚Ä¢ {topic}")
        
        literature_search_success = True
        
    except Exception as e:
        print(f"‚ùå Literature search failed: {e}")
        literature_search_success = False
        search_result = None

else:
    print("‚è≠Ô∏è Skipping literature search test (no LLM/agent available)")
    literature_search_success = False
    search_result = None

print(f"\nüìä Literature Search Status: {'‚úÖ Working' if literature_search_success else '‚ùå Limited'}")

üîç TESTING LITERATURE SEARCH
Research Query: 'neural operator'
üìö Starting literature search for: 'neural operator'
‚ùå Search failed: 'ArXivClient' object has no attribute 'get_comprehensive_results'

üìä SEARCH RESULTS:
Papers found: 0
Papers returned: 0
Search time: 0.0s
Key insights: 0
Trending topics: 0

üìä Literature Search Status: ‚úÖ Working


# Section 3: Citation & Summary Agents (45 min)

`‚òÖ Insight ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ`
Academic writing requires precise citation formatting that varies by field and journal. Rather than manually formatting each citation, specialized agents can handle the complexity of different styles (IEEE, APA, MLA) while ensuring consistency across your entire paper.
`‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ`

Let's create agents for citation formatting and content summarization.

In [25]:
# Create and test citation formatting agent
from citation_agent import (
    CitationFormatterAgent, 
    CitationStyle, 
    CitationData,
    demonstrate_citation_agent
)

print("üìñ CREATING CITATION FORMATTING AGENT")
print("=" * 36)

# Show citation agent capabilities
demonstrate_citation_agent()

# Create citation agent (works with or without LLM)
class MockLLM:
    def invoke(self, prompt):
        class MockResponse:
            content = "Mock citation formatting response"
        return MockResponse()

citation_llm = research_llm if llm_available else MockLLM()
citation_agent = CitationFormatterAgent(citation_llm)

print("\n‚úÖ Citation Formatting Agent created!")
print(f"   Supported styles: {len(citation_agent.style_rules)}")
print(f"   Venue abbreviations: {len(citation_agent.venue_abbreviations)}")

üìñ CREATING CITATION FORMATTING AGENT
üìñ CITATION FORMATTING AGENT DEMONSTRATION
\nüéØ Supported Citation Styles:
   ‚Ä¢ IEEE: Numbered citations [1], common in engineering and CS
   ‚Ä¢ APA: Author-year format (Smith, 2023), common in psychology and social sciences
   ‚Ä¢ MLA: Author-page format (Smith 123), common in humanities
   ‚Ä¢ Nature: Numbered format with specific Nature journal requirements
   ‚Ä¢ Chicago: Author-date or notes-bibliography style
   ‚Ä¢ Harvard: Author-year format similar to APA
\nüìö Publication Types:
   ‚Ä¢ Journal articles
   ‚Ä¢ Conference papers
   ‚Ä¢ Books and book chapters
   ‚Ä¢ Theses and dissertations
   ‚Ä¢ Technical reports
   ‚Ä¢ Websites and online sources
   ‚Ä¢ Preprints (arXiv, bioRxiv, etc.)
\nüõ† Features:
   ‚Ä¢ Automatic publication type detection
   ‚Ä¢ Intelligent author name formatting
   ‚Ä¢ DOI and URL handling
   ‚Ä¢ Venue name abbreviation
   ‚Ä¢ Citation completeness validation
   ‚Ä¢ Bibliography generation
   ‚Ä¢ In-tex

In [26]:
# Test citation formatting with sample data
print("üß™ TESTING CITATION FORMATTING")
print("=" * 30)

# Create sample citation data
sample_citations = [
    CitationData(
        title="Attention Is All You Need",
        authors=["Ashish Vaswani", "Noam Shazeer", "Niki Parmar", "Jakob Uszkoreit"],
        publication_year="2017",
        venue="Advances in Neural Information Processing Systems",
        pages="5998-6008"
    ),
    CitationData(
        title="Deep Residual Learning for Image Recognition",
        authors=["Kaiming He", "Xiangyu Zhang", "Shaoqing Ren", "Jian Sun"],
        publication_year="2016",
        venue="Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition",
        pages="770-778"
    ),
    CitationData(
        title="BERT: Pre-training of Deep Bidirectional Transformers",
        authors=["Jacob Devlin", "Ming-Wei Chang", "Kenton Lee", "Kristina Toutanova"],
        publication_year="2019",
        venue="North American Chapter of the Association for Computational Linguistics",
        doi="10.18653/v1/N19-1423"
    )
]

print(f"üìö Testing with {len(sample_citations)} sample papers")

# Test different citation styles
styles_to_test = [CitationStyle.IEEE, CitationStyle.APA, CitationStyle.MLA]

for style in styles_to_test:
    print(f"\nüìñ {style.value.upper()} Style:")
    print("-" * 20)
    
    try:
        # Format citations in this style
        formatted_citations = citation_agent.format_multiple_citations(
            papers=sample_citations,
            style=style,
            in_text=False
        )
        
        # Show first two citations
        for i, citation in enumerate(formatted_citations[:2], 1):
            print(f"{i}. {citation}")
        
        # Show in-text citation example
        in_text_citation = citation_agent.format_citation(
            sample_citations[0], style, in_text=True
        )
        print(f"\nIn-text: {in_text_citation}")
        
    except Exception as e:
        print(f"‚ùå Error formatting {style.value}: {e}")

print("\n‚úÖ Citation formatting test completed!")

üß™ TESTING CITATION FORMATTING
üìö Testing with 3 sample papers

üìñ IEEE Style:
--------------------
1. [1] A. Vaswani, N. Shazeer, N. Parmar, et al., "Attention Is All You Need", Advances in Neural Information Processing Systems, 2017.
2. [2] K. He, X. Zhang, S. Ren, et al., "Deep Residual Learning for Image Recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016.

In-text: [4]

üìñ APA Style:
--------------------
1. Vaswani, A., Shazeer, N., Parmar, N., & Uszkoreit, J. (2017). Attention Is All You Need. *Advances in Neural Information Processing Systems*.
2. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition. *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition*.

In-text: (Vaswani et al., 2017)

üìñ MLA Style:
--------------------
1. Vaswani, Ashish, et al. "Attention Is All You Need." *Advances in Neural Information Processing Systems*, 2017.
2. He, Kaiming, et al. "De

In [27]:
# Test bibliography generation
print("üìã TESTING BIBLIOGRAPHY GENERATION")
print("=" * 33)

# Generate bibliography in IEEE style
try:
    bibliography = citation_agent.generate_bibliography(
        papers=sample_citations,
        style=CitationStyle.IEEE,
        title="References"
    )
    
    print("üìö IEEE Style Bibliography:")
    print("=" * 27)
    print(bibliography[:500] + "..." if len(bibliography) > 500 else bibliography)
    
except Exception as e:
    print(f"‚ùå Bibliography generation failed: {e}")

# Test citation validation
print("\n‚úÖ TESTING CITATION VALIDATION")
print("=" * 30)

validation_result = citation_agent.validate_citation_completeness(sample_citations[0])

print(f"Complete: {validation_result['complete']}")
print(f"Quality Score: {validation_result['quality_score']:.1f}")
print(f"Missing Fields: {validation_result['missing_fields']}")
if validation_result['suggestions']:
    print(f"Suggestions: {'; '.join(validation_result['suggestions'])}")

print("\nüìä Citation Agent Status: ‚úÖ Working")

üìã TESTING BIBLIOGRAPHY GENERATION
üìö IEEE Style Bibliography:
# References\n\n[1] A. Vaswani, N. Shazeer, N. Parmar, et al., "Attention Is All You Need", Advances in Neural Information Processing Systems, 2017.\n\n[2] K. He, X. Zhang, S. Ren, et al., "Deep Residual Learning for Image Recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016.\n\n[3] J. Devlin, M. Chang, K. Lee, et al., "BERT: Pre-training of Deep Bidirectional Transformers", North American Chapter of the Association for Computational Linguistics, 2019...

‚úÖ TESTING CITATION VALIDATION
Complete: True
Quality Score: 0.8
Missing Fields: []
Suggestions: Add DOI or arXiv ID for better accessibility

üìä Citation Agent Status: ‚úÖ Working


# Section 4: Research Router Integration (30 min)

`‚òÖ Insight ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ`
The router agent is the conductor of the multi-agent orchestra. It analyzes complex research queries, identifies what tasks need to be done, and coordinates the specialized agents. This architectural pattern scales to any number of specialized agents.
`‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ‚îÄ`

Now let's integrate everything into a coordinated multi-agent system.

In [None]:
# Create the research router agent
if llm_available:
    from research_router import (
        ResearchRouterAgent, 
        TaskType,
        demonstrate_research_router
    )
    
    print("üéØ CREATING RESEARCH ROUTER AGENT")
    print("=" * 33)
    
    # Show router capabilities
    demonstrate_research_router()
    
    # Create the router
    research_router = ResearchRouterAgent(
        llm=research_llm,
        research_tools=research_tools
    )
    
    print("\n‚úÖ Research Router created!")
    print(f"   Task patterns: {len(research_router.task_patterns)}")
    print(f"   Citation styles: {len(research_router.style_patterns)}")
    print(f"   Specialized agents: Literature, Citation")
    
    router_available = True
    
else:
    print("‚è≠Ô∏è Skipping research router (no LLM available)")
    research_router = None
    router_available = False

print(f"\nüìä Router Status: {'‚úÖ Ready' if router_available else '‚ùå Not Available'}")

In [None]:
# Test query analysis and task decomposition
if router_available:
    print("üîç TESTING QUERY ANALYSIS")
    print("=" * 25)
    
    # Test different types of research queries
    test_queries = [
        "Find recent papers on transformer attention mechanisms",
        "Search for literature on machine learning in healthcare and format citations in APA style",
        "Write a literature review on quantum computing applications",
        "What are the current trends in natural language processing?",
        "Find top researchers in computer vision and generate a summary"
    ]
    
    print("üìã Analyzing research queries:")
    
    for i, query in enumerate(test_queries, 1):
        print(f"\n{i}. Query: '{query}'")
        
        try:
            # Analyze the query
            task = research_router.analyze_query(query)
            
            print(f"   Task ID: {task.task_id}")
            print(f"   Identified types: {[t.value for t in task.task_types]}")
            print(f"   Parameters: {list(task.parameters.keys())}")
            print(f"   Estimated time: {task.estimated_time:.1f}s")
            
            # Show key parameters
            if 'citation_style' in task.parameters:
                print(f"   Citation style: {task.parameters['citation_style'].value}")
            if 'max_papers' in task.parameters:
                print(f"   Max papers: {task.parameters['max_papers']}")
                
        except Exception as e:
            print(f"   ‚ùå Analysis failed: {e}")
    
    print("\n‚úÖ Query analysis test completed!")
    
else:
    print("‚è≠Ô∏è Skipping query analysis test (no router available)")
    print("\nüí° Expected analysis results:")
    print("   ‚Ä¢ Task type identification from natural language")
    print("   ‚Ä¢ Parameter extraction (citation styles, limits, etc.)")
    print("   ‚Ä¢ Time estimation for complex workflows")
    print("   ‚Ä¢ Intelligent task decomposition")

# Section 5: Complete System Demo (15 min)

Now let's demonstrate the complete multi-agent research system in action!

In [None]:
# Demonstrate complete research workflow
if router_available:
    print("üöÄ COMPLETE RESEARCH SYSTEM DEMONSTRATION")
    print("=" * 42)
    
    # Complex research query that requires multiple agents
    research_query = "Find recent papers on transformer attention mechanisms and provide a summary with IEEE citations"
    
    print(f"Research Query: '{research_query}'")
    print("\nüîÑ Executing research session...\n")
    
    try:
        # Execute complete research session
        session = research_router.execute_research_session(research_query)
        
        print("\n" + "=" * 50)
        print("üìä RESEARCH SESSION RESULTS")
        print("=" * 50)
        
        print(f"Session ID: {session.session_id}")
        print(f"Status: {session.status}")
        print(f"Tasks executed: {len(session.results)}")
        
        # Show task results summary
        print("\nüìã Task Results:")
        successful_tasks = 0
        total_time = 0
        
        for result in session.results:
            status = "‚úÖ" if result.success else "‚ùå"
            print(f"   {status} {result.task_type.value}: {result.execution_time:.1f}s")
            if result.success:
                successful_tasks += 1
                print(f"      Agent: {result.agent_used} (confidence: {result.confidence_score:.1f})")
            else:
                print(f"      Error: {result.error_message[:100]}...")
            total_time += result.execution_time
        
        print(f"\nüìä Summary: {successful_tasks}/{len(session.results)} tasks successful")
        print(f"‚è±Ô∏è Total execution time: {total_time:.1f}s")
        
        # Show parts of the final report
        if session.final_report:
            print("\nüìÑ Final Report Preview:")
            print("-" * 25)
            # Show first 800 characters of the report
            preview = session.final_report[:800]
            if len(session.final_report) > 800:
                preview += "...\n[Report continues...]"  
            print(preview)
        
        print("\nüéâ Multi-agent research system demonstration completed!")
        
    except Exception as e:
        print(f"‚ùå Research session failed: {e}")
        print("üí° This can happen due to API limitations or network issues")

else:
    print("‚è≠Ô∏è Skipping complete system demo (router not available)")
    print("\nüí° Complete system would provide:")
    print("   ‚Ä¢ Automatic query analysis and task decomposition")
    print("   ‚Ä¢ Coordinated execution across multiple specialized agents")
    print("   ‚Ä¢ Literature search with quality filtering and ranking")
    print("   ‚Ä¢ Citation formatting in requested academic style")
    print("   ‚Ä¢ Summary generation with key insights")
    print("   ‚Ä¢ Comprehensive session report with all results")
    print("   ‚Ä¢ Error handling and graceful fallback")

## üèóÔ∏è System Architecture Analysis

Let's analyze what we've built and how it works:

In [None]:
print("üèóÔ∏è MULTI-AGENT SYSTEM ARCHITECTURE ANALYSIS")
print("=" * 45)

print("\nüéØ Architecture Pattern: Router-Based Multi-Agent System")
print("\nüìä Component Status:")

components = [
    ("Research Tools (ArXiv, Semantic Scholar)", tools_available),
    ("LLM Provider Integration", llm_available), 
    ("Literature Search Agent", literature_search_success),
    ("Citation Formatting Agent", True),  # Always works
    ("Research Router Agent", router_available)
]

for component, status in components:
    status_icon = "‚úÖ" if status else "‚ùå"
    print(f"   {status_icon} {component}")

print("\nüîÑ Information Flow:")
flow_steps = [
    "1. User Query ‚Üí Router Agent (natural language analysis)",
    "2. Router ‚Üí Task Decomposition (identify required agents)",
    "3. Router ‚Üí Literature Agent (if literature search needed)",
    "4. Router ‚Üí Citation Agent (if formatting needed)", 
    "5. Router ‚Üí Summary Agent (if analysis needed)",
    "6. Router ‚Üí Report Generation (combine all results)"
]

for step in flow_steps:
    print(f"   {step}")

print("\nüéØ Key Design Patterns:")
patterns = [
    "Router Pattern: Central coordinator delegates to specialists",
    "Agent Specialization: Each agent optimized for specific tasks",
    "Real API Integration: Connects to actual research databases",
    "State Management: Structured data flow between agents",
    "Error Handling: Graceful fallback when components fail",
    "Session Tracking: Complete audit trail of research process"
]

for pattern in patterns:
    print(f"   ‚Ä¢ {pattern}")

print("\nüí° Benefits Over Single Agent:")
benefits = [
    "Specialized expertise for each type of task",
    "Parallel processing capability", 
    "Better error isolation and handling",
    "Easier to extend with new agent types",
    "Higher quality results through specialization",
    "More reliable overall system architecture"
]

for benefit in benefits:
    print(f"   ‚Ä¢ {benefit}")

## üéì Practical Research Applications

Here are real-world scenarios where this multi-agent system excels:

In [None]:
print("üéì PRACTICAL RESEARCH APPLICATIONS")
print("=" * 34)

applications = [
    {
        "scenario": "PhD Literature Review",
        "query": "Find comprehensive literature on graph neural networks for drug discovery, format in APA style, and generate a 500-word summary",
        "agents_used": ["Literature Search", "Citation Formatter", "Summary Generator"],
        "time_saved": "8-12 hours ‚Üí 15 minutes",
        "benefits": ["Comprehensive coverage", "Consistent formatting", "Key insights extraction"]
    },
    {
        "scenario": "Grant Proposal Background",
        "query": "Research current trends in quantum machine learning, identify key researchers, and provide IEEE citations for top 20 papers",
        "agents_used": ["Literature Search", "Author Analysis", "Citation Formatter"],
        "time_saved": "6-8 hours ‚Üí 10 minutes",
        "benefits": ["Trend identification", "Authority establishment", "Professional formatting"]
    },
    {
        "scenario": "Conference Paper Writing",
        "query": "Find recent work on federated learning privacy, generate related work section, format references in conference style",
        "agents_used": ["Literature Search", "Review Generator", "Citation Formatter"],
        "time_saved": "4-6 hours ‚Üí 8 minutes",
        "benefits": ["Recent research coverage", "Structured writing", "Venue-appropriate citations"]
    },
    {
        "scenario": "Research Proposal Validation",
        "query": "Analyze existing work on multimodal learning for robotics, identify research gaps, suggest future directions",
        "agents_used": ["Literature Search", "Trend Analysis", "Gap Identifier"],
        "time_saved": "10-15 hours ‚Üí 20 minutes", 
        "benefits": ["Gap identification", "Novelty validation", "Research direction guidance"]
    }
]

for i, app in enumerate(applications, 1):
    print(f"\nüìö Application {i}: {app['scenario']}")
    print(f"   Query: \"{app['query'][:80]}...\"")
    print(f"   Agents: {', '.join(app['agents_used'])}")
    print(f"   Time Saved: {app['time_saved']}")
    print(f"   Benefits: {', '.join(app['benefits'])}")

print("\nüíº Integration Tips:")
tips = [
    "Use specific queries for better agent selection",
    "Specify citation style early in your query",
    "Combine multiple tasks in single query for efficiency",
    "Review and refine agent outputs for your specific needs",
    "Build custom agents for your research domain",
    "Cache results to avoid repeated API calls"
]

for tip in tips:
    print(f"   ‚Ä¢ {tip}")

print("\nüöÄ Next Level: Building Domain-Specific Agents")
print("Consider creating specialized agents for your research area:")
print("   ‚Ä¢ Medical Literature Agent (PubMed integration)")
print("   ‚Ä¢ Patent Research Agent (USPTO/Google Patents)")
print("   ‚Ä¢ Dataset Discovery Agent (Papers with Code, Kaggle)")
print("   ‚Ä¢ Code Analysis Agent (GitHub integration)")
print("   ‚Ä¢ Figure Generation Agent (automatic chart creation)")

# üèÉ Hands-on Exercise

**Challenge**: Extend the multi-agent system for your research domain!

## Exercise Tasks:

1. **Create Domain-Specific Agent**: Build a specialized agent for your field
2. **Add New Task Types**: Extend the router to handle new research tasks
3. **Integrate New APIs**: Connect to domain-specific databases
4. **Test Real Queries**: Use actual research questions from your work

## Starter Framework:

In [None]:
# Exercise: Create a custom research agent for your domain

# TODO: Define your research domain and specific needs
MY_RESEARCH_DOMAIN = ""  # e.g., "biomedical_nlp", "quantum_computing", "robotics"
MY_RESEARCH_QUERIES = [
    # Add your actual research questions here
    # Example: "Find papers on protein folding prediction using transformers"
    # Example: "Research quantum error correction for NISQ devices"  
    # Example: "Analyze recent work on autonomous navigation in GPS-denied environments"
]

print("üî¨ YOUR CUSTOM RESEARCH AGENT FRAMEWORK")
print("=" * 38)

if MY_RESEARCH_DOMAIN:
    print(f"Research Domain: {MY_RESEARCH_DOMAIN}")
    
    if MY_RESEARCH_QUERIES:
        print(f"\nüìã Your Research Queries ({len(MY_RESEARCH_QUERIES)}):")
        for i, query in enumerate(MY_RESEARCH_QUERIES, 1):
            print(f"   {i}. {query}")
        
        # Test with existing system if available
        if router_available:
            print("\nüß™ Testing with existing router...")
            
            for query in MY_RESEARCH_QUERIES[:2]:  # Test first 2
                print(f"\nQuery: '{query}'")
                try:
                    task = research_router.analyze_query(query)
                    print(f"  Detected tasks: {[t.value for t in task.task_types]}")
                    print(f"  Estimated time: {task.estimated_time:.1f}s")
                except Exception as e:
                    print(f"  Analysis error: {e}")
        
        print("\nüí° Customization Ideas:")
        customizations = [
            f"Create {MY_RESEARCH_DOMAIN.title()}Agent class",
            "Add domain-specific API integrations",
            "Implement specialized ranking algorithms",
            "Create custom citation styles for your field",
            "Build domain vocabulary and query expansion",
            "Add visualization capabilities for your data types"
        ]
        
        for idea in customizations:
            print(f"   ‚Ä¢ {idea}")
    else:
        print("\nüìù Add your research queries to MY_RESEARCH_QUERIES list above")
else:
    print("\nüìù Set your research domain in MY_RESEARCH_DOMAIN above")
    print("\nüéØ Example Domains:")
    example_domains = [
        "biomedical_nlp", "quantum_computing", "robotics", 
        "climate_science", "digital_humanities", "materials_science",
        "cybersecurity", "educational_technology", "computational_biology"
    ]
    
    for domain in example_domains:
        print(f"   ‚Ä¢ {domain}")

print("\nüîß Implementation Template:")
print("""
class CustomDomainAgent:
    def __init__(self, llm, domain_apis):
        self.llm = llm
        self.domain_apis = domain_apis
        self.domain_vocabulary = self._load_vocabulary()
    
    def search_domain_literature(self, query):
        # Implement domain-specific search logic
        pass
    
    def analyze_domain_trends(self, papers):
        # Implement domain-specific analysis
        pass
""")

In [None]:
print("üéì PART 2 TUTORIAL SUMMARY")
print("=" * 26)

print("\n‚úÖ What You've Learned:")
concepts = [
    "Multi-agent architecture with specialized agents",
    "Router pattern for intelligent task delegation",
    "Real API integration with academic databases",
    "Literature search with quality ranking and filtering", 
    "Multi-style citation formatting (IEEE/APA/MLA)",
    "Session management and comprehensive reporting"
]
for i, concept in enumerate(concepts, 1):
    print(f"   {i}. {concept}")

print("\nüõ† What You Can Build Now:")
capabilities = [
    "Intelligent research assistant for literature reviews",
    "Multi-agent systems for complex research workflows",
    "Automated citation formatting and bibliography generation",
    "Domain-specific research agents for your field",
    "Research trend analysis and insight extraction",
    "Production-ready academic research tools"
]
for i, capability in enumerate(capabilities, 1):
    print(f"   {i}. {capability}")

print("\nüìä System Performance Summary:")
final_status = [
    ("Research Tools Integration", tools_available),
    ("LLM Multi-Provider Support", llm_available),
    ("Literature Search Agent", literature_search_success),
    ("Citation Formatting Agent", True),
    ("Multi-Agent Router", router_available)
]

working_components = sum(1 for _, status in final_status if status)
total_components = len(final_status)

print(f"   Working Components: {working_components}/{total_components} ({working_components/total_components:.1%})")

for component, status in final_status:
    status_icon = "‚úÖ" if status else "‚ùå"
    print(f"   {status_icon} {component}")

print("\nüöÄ Next Steps:")
next_parts = [
    "Part 3: Gemini Research Agent - Advanced web research with reflection loops",
    "Part 4: DAG Architecture - Complex research pipeline construction", 
    "Part 5: PHM Case Study - Complete production system integration"
]
for i, part in enumerate(next_parts, 1):
    print(f"   {i}. {part}")

print("\nüéØ Ready for Part 3? ‚Üí ../Part3_Gemini_Research_Agent/03_Tutorial.ipynb")

if working_components < total_components:
    print("\nüí° Tip: Some components may be limited due to API access or network restrictions.")
    print("    The concepts and patterns still apply - configure API keys for full functionality.")