In [1]:
from lionagi import Branch, iModel, BaseModel

In [2]:
class Source(BaseModel):
    title: str
    url: str


class ResearchReport(BaseModel):
    title: str | None = None
    content: str | None = None
    source: list[Source] | None = None

In [3]:
system_message = """
You are a specialized research assistant. Your task is to thoroughly research the user's query using your available tools
"""

sonnet = iModel(
    provider="openrouter",
    model="anthropic/claude-3.5-sonnet",
    max_tokens=8000,  # required for anthropic models
    invoke_with_endpoint=False,
    temperature=0.2,
)

branch = Branch(system=system_message, chat_model=sonnet)

In [4]:
branch.connect(
    name="search_exa",
    provider="exa",
    endpoint="search",
    queue_capacity=5,
    capacity_refresh_time=1,
    description="Search the exa database for relevant information",
)

branch.connect(
    name="search_perplexity",
    provider="perplexity",
    queue_capacity=100,
    capacity_refresh_time=60,
    description="Search the perplexity database for relevant information",
)

In [5]:
user_msg = """I want a research paper on latest trend using AI to predict the stock market"""

In [6]:
result = await branch.ReAct(
    instruct={
        "instruction": user_msg,
    },
    interpret=True,
    interpret_domain="research",
    interpret_style="exhaustive",
    response_format=ResearchReport,
    extension_allowed=True,
    max_extensions=5,
)

In [7]:
from IPython.display import Markdown

msgs = f"Title: {result.title}\n\nContent: {result.content}\n\n"
for i in result.source:
    msgs += f"Source: [{i.title}]({i.url})\n\n"

Markdown(msgs)

Title: Research Overview: AI in Stock Market Prediction (2021-2024)

Content: 1. Latest AI Methodologies (2021-2024)

Machine Learning & Deep Learning Approaches:
- LSTM networks demonstrate superior performance for time series prediction, showing highest model fitting ability across multiple studies
- XGBoost achieves exceptional accuracy (99.47% R-squared) after hyperparameter tuning
- Hybrid approaches combining CNN-LSTM show promise for feature extraction and long-term dependency handling

Natural Language Processing & Sentiment Analysis:
- FinBERT and VADER implementations for financial news sentiment analysis
- Integration of news headlines and social media sentiment with technical indicators
- Real-time news processing capabilities reducing storage requirements

2. Technical Implementation

Data Sources & Feature Engineering:
- Historical price data (open, high, low, close, volume)
- Technical indicators (RSI, MACD, moving averages)
- News sentiment scores
- Social media sentiment analysis
- Google Trends data (showing minimal impact)

Model Architectures:
- Stacked LSTM networks for time series analysis
- Hybrid CNN-LSTM for feature extraction
- XGBoost with hyperparameter optimization
- Ensemble methods combining multiple approaches

3. Critical Analysis

Accuracy & Performance:
- XGBoost: 99.47% R-squared, MAE 15.98, RMSE 27.34
- LSTM-based models: 83% improvement in portfolio value
- SVR for Indonesian stocks: 0.50% MAPE

Implementation Challenges:
- Data quality and integration issues
- Model drift requiring regular retraining
- Overfitting concerns
- Market anomaly handling
- Cybersecurity risks and data poisoning threats

Regulatory Considerations:
- Data privacy compliance
- Ethical AI usage guidelines
- Shadow AI monitoring
- Market manipulation prevention

4. Comparative Analysis

Traditional vs AI Methods:
- Surprising effectiveness of simple random walk (Naïve) models for monthly predictions
- AI models show superior performance in daily predictions
- Hybrid approaches outperform pure statistical methods

Market Conditions Impact:
- Models struggle with black swan events
- Need for separate anomaly detection systems
- Performance varies across different market conditions

5. Future Directions

Methodological Improvements:
- Integration of quantum computing
- Explainable AI implementation
- Enhanced anomaly detection systems

Scalability & Integration:
- Cloud-based deployment
- Real-time processing capabilities
- Multi-source data integration

Conclusion:
While AI shows promising results in stock market prediction, particularly through hybrid approaches and ensemble methods, challenges remain in handling market anomalies and ensuring regulatory compliance. The field continues to evolve with integration of new technologies and improved methodologies.

Source: [Deep Learning for Stock Market Prediction](https://arxiv.org/pdf/2004.01497.pdf)

Source: [Predicting Stock Prices: A Comparative Study of LSTM, ARIMA, XGBOOST](https://github.com/giftyaiyegbeni/Predicting_Stock_Prices)

Source: [Stock Market Prediction using Reinforcement Learning with Sentiment Analysis](https://ijcionline.com/paper/12/12123ijci01.pdf)

Source: [Google Trends and Technical Indicator based Machine Learning for Stock Market Prediction](https://journal.universitasbumigora.ac.id/index.php/matrik/article/view/2287)



In [8]:
branch.to_df()

Unnamed: 0,created_at,role,content,id,sender,recipient,metadata
0,2025-01-20 12:37:39.272502,system,{'system_message': ' You are a specialized res...,0421205b-2699-4dec-9e04-bb01a97da344,system,a9fdbb06-1cd7-4656-aa04-702d009dce54,{'lion_class': 'lionagi.protocols.messages.sys...
1,2025-01-20 12:37:44.665248,user,"{'context': [], 'instruction': '## IMPROVED PR...",25690e9f-a930-4d8e-9a26-382e9eb268dc,user,a9fdbb06-1cd7-4656-aa04-702d009dce54,{'lion_class': 'lionagi.protocols.messages.ins...
2,2025-01-20 12:37:58.180626,assistant,"{'assistant_response': '```json {  ""analysis""...",38ed84f5-846b-4cc9-8798-6531644db023,a9fdbb06-1cd7-4656-aa04-702d009dce54,user,{'model_response': {'id': 'gen-1737394665-7HvE...
3,2025-01-20 12:38:02.375560,action,"{'action_request': {'function': 'search_exa', ...",8eaaba95-ee0e-4494-8395-8ebab38b8bee,a9fdbb06-1cd7-4656-aa04-702d009dce54,7d6e25e8-32d5-4f28-aded-f04de7dc3838,{'lion_class': 'lionagi.protocols.messages.act...
4,2025-01-20 12:38:02.375719,action,{'action_request_id': '8eaaba95-ee0e-4494-8395...,f44e5bc2-ae2e-4df5-91a3-db59a48d845c,7d6e25e8-32d5-4f28-aded-f04de7dc3838,a9fdbb06-1cd7-4656-aa04-702d009dce54,{'lion_class': 'lionagi.protocols.messages.act...
5,2025-01-20 12:38:03.452335,action,{'action_request': {'function': 'search_perple...,47f1156b-0c2d-4a34-86be-0da4ed0154b0,a9fdbb06-1cd7-4656-aa04-702d009dce54,0f8273c6-7d0c-4897-8e31-cc926dd9e5a1,{'lion_class': 'lionagi.protocols.messages.act...
6,2025-01-20 12:38:03.452443,action,{'action_request_id': '47f1156b-0c2d-4a34-86be...,ab697680-36d0-42f9-8113-08bf5d4d635f,0f8273c6-7d0c-4897-8e31-cc926dd9e5a1,a9fdbb06-1cd7-4656-aa04-702d009dce54,{'lion_class': 'lionagi.protocols.messages.act...
7,2025-01-20 12:38:03.456814,user,{'context': [{'action_request_id': '47f1156b-0...,592f9997-5960-46d7-bc6e-c5aeb201b6fa,user,a9fdbb06-1cd7-4656-aa04-702d009dce54,{'lion_class': 'lionagi.protocols.messages.ins...
8,2025-01-20 12:38:15.498436,assistant,"{'assistant_response': '```json {  ""analysis""...",84d8aede-96e3-4df3-93f5-261a800d7253,a9fdbb06-1cd7-4656-aa04-702d009dce54,user,{'model_response': {'id': 'gen-1737394683-u0sc...
9,2025-01-20 12:38:19.564153,action,"{'action_request': {'function': 'search_exa', ...",af8727d3-da33-4060-a9f6-25ecf87991d2,a9fdbb06-1cd7-4656-aa04-702d009dce54,7d6e25e8-32d5-4f28-aded-f04de7dc3838,{'lion_class': 'lionagi.protocols.messages.act...
