Important
Check out our newly released prediction engine: MiroFish - A Simple and Universal Swarm Intelligence Engine for Predicting Everything
The "Data Analysis Three-Step Approach" is now fully connected: We are excited to announce the official release of MiroFish! With the final piece of the puzzle in place, we have built a complete pipeline from BettaFish (data collection and analysis) to MiroFish (panoramic prediction). The closed loop from raw data to intelligent decision-making is now complete, making it possible to foresee the future!
"BettaFish" is an innovative multi-agent public opinion analysis system built from scratch. It helps break information cocoons, restore the original public sentiment, predict future trends, and assist decision-making. Users only need to raise analysis needs like chatting; the agents automatically analyze 30+ mainstream social platforms at home and abroad and millions of public comments.
Betta is a small yet combative and beautiful fish, symbolizing "small but powerful, fearless of challenges".
See the system-generated research report on "Wuhan University Public Opinion": In-depth Analysis Report on Wuhan University's Brand Reputation
See a complete system run example on "Wuhan University Public Opinion": Video - In-depth Analysis Report on Wuhan University's Brand Reputation
Beyond just report quality, compared to similar products, we have ๐ six major advantages:
-
AI-Driven Comprehensive Monitoring: AI crawler clusters operate 24/7 non-stop, comprehensively covering 10+ key domestic and international social media platforms including Weibo, Xiaohongshu, TikTok, Kuaishou, etc. Not only capturing trending content in real-time, but also drilling down to massive user comments, letting you hear the most authentic and widespread public voice.
-
Composite Analysis Engine Beyond LLM: We not only rely on 5 types of professionally designed Agents, but also integrate middleware such as fine-tuned models and statistical models. Through multi-model collaborative work, we ensure the depth, accuracy, and multi-dimensional perspective of analysis results.
-
Powerful Multimodal Capabilities: Breaking through text and image limitations, capable of deep analysis of short video content from TikTok, Kuaishou, etc., and precisely extracting structured multimodal information cards such as weather, calendar, stocks from modern search engines, giving you comprehensive control over public opinion dynamics.
-
Agent "Forum" Collaboration Mechanism: Endowing different Agents with unique toolsets and thinking patterns, introducing a debate moderator model, conducting chain-of-thought collision and debate through the "forum" mechanism. This not only avoids the thinking limitations of single models and homogenization caused by communication, but also catalyzes higher-quality collective intelligence and decision support.
-
Seamless Integration of Public and Private Domain Data: The platform not only analyzes public opinion, but also provides high-security interfaces supporting seamless integration of your internal business databases with public opinion data. Breaking through data barriers, providing powerful analysis capabilities of "external trends + internal insights" for vertical businesses.
-
Lightweight and Highly Extensible Framework: Based on pure Python modular design, achieving lightweight, one-click deployment. Clear code structure allows developers to easily integrate custom models and business logic, enabling rapid platform expansion and deep customization.
Starting with public opinion, but not limited to public opinion. The goal of "WeiYu" is to become a simple and universal data analysis engine that drives all business scenarios.
For example, you only need to simply modify the API parameters and prompts of the Agent toolset to transform it into a financial market analysis system.
Here's a relatively active Linux.do project discussion thread: https://linux.do/t/topic/1009280
Check out the comparison by a Linux.do fellow: Open Source Project (BettaFish) vs manus|minimax|ChatGPT Comparison
Say goodbye to traditional data dashboards. In "WeiYu", everything starts with a simple question - you just need to ask your analysis needs like a conversation
Provider of core agent capabilities including AI web search, file parsing, and web content scraping: 
Anspire Open is a leading infrastructure provider for the agent era. We offer developers the core capability stack needed to build powerful agents. Currently available services include AI web search (multiple versions, highly competitive pricing), file parsing (limited-time free), web content scraping (limited-time free), cloud browser automation (Anspire Browser Agent, in beta), multi-turn rewriting, and more. We continue to provide a solid foundation for agents to connect and operate in complex digital worlds. Seamlessly integrates with mainstream agent platforms such as Dify, Coze, and Yuanqi. Through a transparent credit-based billing system and modular design, we provide enterprises with efficient, low-cost customized support to accelerate intelligent transformation.
Claim 1GB free trial, enterprise-grade global proxy IP and Scraper API solution provider, register now: 
Thordata helps enterprises easily access publicly available web data through highly reliable proxy networks and automated scraping solutions, with a commitment to 99.9% uptime and 99.7% success rate.
Insight Agent Private Database Mining: AI agent for in-depth analysis of private public opinion databases
Media Agent Multimodal Content Analysis: AI agent with powerful multimodal capabilities
Query Agent Precise Information Search: AI agent with domestic and international web search capabilities
Report Agent Intelligent Report Generation: Multi-round report generation AI agent with built-in templates
| Step | Phase Name | Main Operations | Participating Components | Cycle Nature |
|---|---|---|---|---|
| 1 | User Query | Flask main application receives the query | Flask Main Application | - |
| 2 | Parallel Launch | Three Agents start working simultaneously | Query Agent, Media Agent, Insight Agent | - |
| 3 | Preliminary Analysis | Each Agent uses dedicated tools for overview search | Each Agent + Dedicated Toolsets | - |
| 4 | Strategy Formulation | Develop segmented research strategies based on preliminary results | Internal Decision Modules of Each Agent | - |
| 5-N | Iterative Phase | Forum Collaboration + In-depth Research | ForumEngine + All Agents | Multi-round cycles |
| 5.1 | In-depth Research | Each Agent conducts specialized search guided by forum host | Each Agent + Reflection Mechanisms + Forum Guidance | Each cycle |
| 5.2 | Forum Collaboration | ForumEngine monitors Agent communications and generates host guidance | ForumEngine + LLM Host | Each cycle |
| 5.3 | Communication Integration | Each Agent adjusts research directions based on discussions | Each Agent + forum_reader tool | Each cycle |
| N+1 | Result Integration | Report Agent collects all analysis results and forum content | Report Agent | - |
| N+2 | IR Intermediate Representation | Dynamically select templates and styles, generate metadata through multiple rounds, assemble into IR intermediate representation | Report Agent + Template Engine | - |
| N+3 | Report Generation | Perform quality checks on chunks, render into interactive HTML report based on IR | Report Agent + Stitching Engine | - |
BettaFish/
โโโ QueryEngine/ # Domestic and international news breadth search Agent
โ โโโ agent.py # Agent main logic, coordinates search and analysis workflow
โ โโโ llms/ # LLM interface wrapper
โ โโโ nodes/ # Processing nodes: search, formatting, summarization, etc.
โ โโโ tools/ # Domestic and international news search toolkit
โ โโโ utils/ # Utility functions
โ โโโ state/ # State management
โ โโโ prompts/ # Prompt templates
โ โโโ ...
โโโ MediaEngine/ # Powerful multimodal understanding Agent
โ โโโ agent.py # Agent main logic, handles video/image multimodal content
โ โโโ llms/ # LLM interface wrapper
โ โโโ nodes/ # Processing nodes: search, formatting, summarization, etc.
โ โโโ tools/ # Multimodal search toolkit
โ โโโ utils/ # Utility functions
โ โโโ state/ # State management
โ โโโ prompts/ # Prompt templates
โ โโโ ...
โโโ InsightEngine/ # Private database mining Agent
โ โโโ agent.py # Agent main logic, coordinates database queries and analysis
โ โโโ llms/ # LLM interface wrapper
โ โ โโโ base.py # Unified OpenAI-compatible client
โ โโโ nodes/ # Processing nodes: search, formatting, summarization, etc.
โ โ โโโ base_node.py # Base node class
โ โ โโโ search_node.py # Search node
โ โ โโโ formatting_node.py # Formatting node
โ โ โโโ report_structure_node.py # Report structure node
โ โ โโโ summary_node.py # Summary node
โ โโโ tools/ # Database query and analysis toolkit
โ โ โโโ keyword_optimizer.py # Qwen keyword optimization middleware
โ โ โโโ search.py # Database operation toolkit (topic search, comment retrieval, etc.)
โ โ โโโ sentiment_analyzer.py # Sentiment analysis integration tool
โ โโโ utils/ # Utility functions
โ โ โโโ config.py # Configuration management
โ โ โโโ db.py # SQLAlchemy async engine + read-only query wrapper
โ โ โโโ text_processing.py # Text processing utilities
โ โโโ state/ # State management
โ โ โโโ state.py # Agent state definition
โ โโโ prompts/ # Prompt templates
โ โ โโโ prompts.py # Various prompt templates
โ โโโ __init__.py
โโโ ReportEngine/ # Multi-round report generation Agent
โ โโโ agent.py # Master orchestrator: template selection โ layout โ budget โ chapter โ render
โ โโโ flask_interface.py # Flask/SSE entry point, manages task queuing and streaming events
โ โโโ llms/ # OpenAI-compatible LLM wrappers
โ โ โโโ base.py # Unified streaming/retry client
โ โโโ core/ # Core functionalities: template parsing, chapter storage, document stitching
โ โ โโโ template_parser.py # Markdown template slicer and slug generator
โ โ โโโ chapter_storage.py # Chapter run directory, manifest, and raw stream writer
โ โ โโโ stitcher.py # Document IR stitcher, adds anchors/metadata
โ โโโ ir/ # Report Intermediate Representation (IR) contract & validation
โ โ โโโ schema.py # Block/mark schema constant definitions
โ โ โโโ validator.py # Chapter JSON structure validator
โ โโโ graphrag/ # GraphRAG graph building and search
โ โ โโโ graph_builder.py # Build graph from state JSON + forum logs
โ โ โโโ graph_storage.py # Graph object manager and graphrag.json I/O
โ โ โโโ query_engine.py # Keyword/type/depth query entry
โ โ โโโ ... # Forum/state parsers, prompts, etc.
โ โโโ nodes/ # Full workflow reasoning nodes
โ โ โโโ base_node.py # Node base class + logging/state hooks
โ โ โโโ template_selection_node.py # Template candidate collection and LLM selection
โ โ โโโ document_layout_node.py # Title/TOC/theme designer
โ โ โโโ word_budget_node.py # Word budget planning and chapter directive generation
โ โ โโโ chapter_generation_node.py # Chapter-level JSON generation + validation
โ โโโ prompts/ # Prompt library and schema descriptions
โ โ โโโ prompts.py # Template selection/layout/budget/chapter prompts
โ โโโ renderers/ # IR renderers
โ โ โโโ html_renderer.py # Document IRโinteractive HTML
โ โ โโโ pdf_renderer.py # HTMLโPDF export (WeasyPrint)
โ โ โโโ pdf_layout_optimizer.py # PDF layout optimizer
โ โ โโโ chart_to_svg.py # Chart to SVG conversion tool
โ โโโ state/ # Task/metadata state models
โ โ โโโ state.py # ReportState and serialization utilities
โ โโโ utils/ # Configuration and helper utilities
โ โ โโโ config.py # Pydantic settings + printer helper
โ โ โโโ dependency_check.py # Dependency checking tool
โ โ โโโ json_parser.py # JSON parsing utilities
โ โ โโโ chart_validator.py # Chart validation tool
โ โ โโโ chart_repair_api.py # Chart repair API
โ โโโ report_template/ # Markdown template library
โ โ โโโ ไผไธๅ็ๅฃฐ่ชๅๆๆฅๅ.md
โ โ โโโ ...
โ โโโ __init__.py
โโโ ForumEngine/ # Forum engine: Agent collaboration mechanism
โ โโโ monitor.py # Log monitoring and forum management core
โ โโโ llm_host.py # Forum moderator LLM module
โ โโโ __init__.py
โโโ MindSpider/ # Social media crawler system
โ โโโ main.py # Crawler main program entry
โ โโโ config.py # Crawler configuration file
โ โโโ BroadTopicExtraction/ # Topic extraction module
โ โ โโโ main.py # Topic extraction main program
โ โ โโโ database_manager.py # Database manager
โ โ โโโ get_today_news.py # Today's news fetcher
โ โ โโโ topic_extractor.py # Topic extractor
โ โโโ DeepSentimentCrawling/ # Deep sentiment crawling module
โ โ โโโ main.py # Deep crawling main program
โ โ โโโ keyword_manager.py # Keyword manager
โ โ โโโ platform_crawler.py # Platform crawler manager
โ โ โโโ MediaCrawler/ # Media crawler core
โ โ โโโ main.py
โ โ โโโ config/ # Platform configurations
โ โ โโโ media_platform/ # Platform crawler implementations
โ โ โโโ ...
โ โโโ schema/ # Database schema definitions
โ โโโ db_manager.py # Database manager
โ โโโ init_database.py # Database initialization script
โ โโโ mindspider_tables.sql # Database table structure SQL
โ โโโ models_bigdata.py # SQLAlchemy mappings for large-scale media opinion tables
โ โโโ models_sa.py # ORM models for DailyTopic/Task extension tables
โโโ SentimentAnalysisModel/ # Sentiment analysis model collection
โ โโโ WeiboSentiment_Finetuned/ # Fine-tuned BERT/GPT-2 models
โ โ โโโ BertChinese-Lora/ # BERT Chinese LoRA fine-tuning
โ โ โ โโโ train.py
โ โ โ โโโ predict.py
โ โ โ โโโ ...
โ โ โโโ GPT2-Lora/ # GPT-2 LoRA fine-tuning
โ โ โโโ train.py
โ โ โโโ predict.py
โ โ โโโ ...
โ โโโ WeiboMultilingualSentiment/ # Multilingual sentiment analysis
โ โ โโโ train.py
โ โ โโโ predict.py
โ โ โโโ ...
โ โโโ WeiboSentiment_SmallQwen/ # Small parameter Qwen3 fine-tuning
โ โ โโโ train.py
โ โ โโโ predict_universal.py
โ โ โโโ ...
โ โโโ WeiboSentiment_MachineLearning/ # Traditional machine learning methods
โ โโโ train.py
โ โโโ predict.py
โ โโโ ...
โโโ SingleEngineApp/ # Individual Agent Streamlit applications
โ โโโ query_engine_streamlit_app.py # QueryEngine standalone app
โ โโโ media_engine_streamlit_app.py # MediaEngine standalone app
โ โโโ insight_engine_streamlit_app.py # InsightEngine standalone app
โโโ query_engine_streamlit_reports/ # QueryEngine standalone app outputs
โโโ media_engine_streamlit_reports/ # MediaEngine standalone app outputs
โโโ insight_engine_streamlit_reports/ # InsightEngine standalone app outputs
โโโ templates/ # Flask frontend templates
โ โโโ index.html # Main interface HTML
โโโ static/ # Static resources
โ โโโ image/ # Image resources
โ โ โโโ ...
โ โโโ Partial README for PDF Exporting/ # PDF export dependency setup guide
โ โโโ v2_report_example/ # Report rendering examples
โ โโโ report_all_blocks_demo/ # Full block types demo (HTML/PDF/MD)
โโโ logs/ # Runtime log directory
โโโ final_reports/ # Final generated report files
โ โโโ ir/ # Report IR JSON files
โ โโโ *.html # Final HTML reports
โโโ utils/ # Common utility functions
โ โโโ forum_reader.py # Agent inter-communication forum tool
โ โโโ github_issues.py # Unified GitHub issue link generator and error formatter
โ โโโ retry_helper.py # Network request retry mechanism utility
โโโ tests/ # Unit tests and integration tests
โ โโโ run_tests.py # pytest entry script
โ โโโ test_monitor.py # ForumEngine monitoring unit tests
โ โโโ test_report_engine_sanitization.py # ReportEngine security tests
โ โโโ ...
โโโ app.py # Flask main application entry point
โโโ config.py # Global configuration file
โโโ .env.example # Environment variable example file
โโโ docker-compose.yml # Docker multi-service orchestration config
โโโ Dockerfile # Docker image build file
โโโ requirements.txt # Python dependency list
โโโ regenerate_latest_html.py # Rebind latest chapters and render HTML
โโโ regenerate_latest_md.py # Rebind latest chapters and render Markdown
โโโ regenerate_latest_pdf.py # PDF regeneration utility script
โโโ report_engine_only.py # Report Engine CLI version
โโโ README.md # Chinese documentation
โโโ README-EN.md # English documentation
โโโ CONTRIBUTING.md # Chinese contribution guide
โโโ CONTRIBUTING-EN.md # English contribution guide
โโโ LICENSE # GPL-2.0 open source license
Run Command: Execute the following command to start all services in the background:
docker compose up -dNote: Slow image pull speed. In the original
docker-compose.ymlfile, we have provided alternative mirror image addresses as comments for you to replace with.
Configure the database connection information with the following parameters. The system also supports MySQL, so you can adjust the settings as needed:
| Configuration Item | Value to Use | Description |
|---|---|---|
DB_HOST |
db |
Database service name (as defined in docker-compose.yml) |
DB_PORT |
5432 |
Default PostgreSQL port |
DB_USER |
bettafish |
Database username |
DB_PASSWORD |
bettafish |
Database password |
DB_NAME |
bettafish |
Database name |
| Others | Keep Default | Please keep other parameters, such as database connection pool settings, at their default values. |
All LLM calls use the OpenAI API interface standard. After you finish the database configuration, continue to configure all LLM-related parameters so the system can connect to your selected LLM service.
Once you complete and save the configurations above, the system will be ready to run normally.
If you are new to building Agent systems, you can start with a very simple demo: Deep Search Agent Demo
- Operating System: Windows, Linux, MacOS
- Python Version: 3.9+
- Conda: Anaconda or Miniconda
- Database: PostgreSQL (recommended) or MySQL
- Memory: 2GB+ recommended
# Create conda environment
conda create -n your_conda_name python=3.11
conda activate your_conda_name# Create uv environment
uv venv --python 3.11 # Create Python 3.11 environmentThis section contains detailed configuration instructions:Configure the dependencies
If Step 2 is skipped, the WeasyPrint library may not install correctly, and the PDF functionality may be unavailable.
# Basic dependency installation
pip install -r requirements.txt
# uv version command (faster installation)
uv pip install -r requirements.txt
# If you do not want to use the local sentiment analysis model (which has low computational requirements and defaults to the CPU version), you can comment out the 'Machine Learning' section in this file before executing the command.# Install browser drivers (for crawler functionality)
playwright install chromiumCopy the .env.example file in the project root directory and rename it to .env.
Edit the .env file and fill in your API keys (you can also choose your own models and search proxies; see .env.example in the project root directory or config.py for details):
# ====================== Database Configuration ======================
# Database host, e.g., localhost or 127.0.0.1
DB_HOST=your_db_host
# Database port number, default is 3306
DB_PORT=3306
# Database username
DB_USER=your_db_user
# Database password
DB_PASSWORD=your_db_password
# Database name
DB_NAME=your_db_name
# Database character set, utf8mb4 is recommended for emoji compatibility
DB_CHARSET=utf8mb4
# Database type: postgresql or mysql
DB_DIALECT=postgresql
# Database initialization is not required, as it will be checked automatically upon executing app.py
# ====================== LLM Configuration ======================
# You can switch each Engine's LLM provider as long as it follows the OpenAI-compatible request format
# The configuration file provides recommended LLMs for each Agent. For initial deployment, please refer to the recommended settings first
# Insight Agent
INSIGHT_ENGINE_API_KEY=
INSIGHT_ENGINE_BASE_URL=
INSIGHT_ENGINE_MODEL_NAME=
# Media Agent
...# In project root directory, activate conda environment
conda activate your_conda_name
# Start main application
python app.pyuv version startup command:
# In project root directory, activate uv environment
.venv\Scripts\activate
# Start main application
python app.pyNote 1: After a run is terminated, the Streamlit app might not shut down correctly and may still be occupying the port. If this occurs, find the process that is holding the port and kill it.
Note 2: Data scraping needs to be performed as a separate operation. Please refer to the instructions in section 5.3.
Visit http://localhost:5000 to use the complete system
# Start QueryEngine
streamlit run SingleEngineApp/query_engine_streamlit_app.py --server.port 8503
# Start MediaEngine
streamlit run SingleEngineApp/media_engine_streamlit_app.py --server.port 8502
# Start InsightEngine
streamlit run SingleEngineApp/insight_engine_streamlit_app.py --server.port 8501This section has detailed configuration documentation: MindSpider Usage Guide
# Enter crawler directory
cd MindSpider
# Project initialization
python main.py --setup
# Run topic extraction (get hot news and keywords)
python main.py --broad-topic
# Run complete crawler workflow
python main.py --complete --date 2024-01-20
# Run topic extraction only
python main.py --broad-topic --date 2024-01-20
# Run deep crawling only
python main.py --deep-sentiment --platforms xhs dy wbThis tool bypasses the execution phase of all three analysis engines, directly loads their most recent log files, and generates a consolidated report without requiring the Web interface (while also skipping incremental file-validation steps). It will also generate a Markdown copy after the PDF by default (toggle via CLI flag). It is typically used when rapid retries are needed due to unsatisfactory report outputs, or when debugging the Report Engine.
# Basic usage (automatically extract topic from filename)
python report_engine_only.py
# Specify report topic
python report_engine_only.py --query "Civil Engineering Industry Analysis"
# Skip PDF generation (even if system supports it)
python report_engine_only.py --skip-pdf
# Skip Markdown generation
python report_engine_only.py --skip-markdown
# Show verbose logging
python report_engine_only.py --verbose
# Turn on GraphRAG via CLI and adjust query cap (falls back to .env, default off)
python report_engine_only.py --graphrag-enabled true --graphrag-max-queries 3
# Show help information
python report_engine_only.py --helpFeatures:
- Automatic Dependency Check: The program automatically checks system dependencies required for PDF generation and provides installation instructions if missing
- Get Latest Files: Automatically retrieves the latest analysis reports from three engine directories (
insight_engine_streamlit_reports,media_engine_streamlit_reports,query_engine_streamlit_reports) - File Confirmation: Displays all selected file names, paths, and modification times, waiting for user confirmation (default input
yto continue, inputnto exit) - Direct Report Generation: Skips file addition verification and directly calls Report Engine to generate comprehensive reports
- Automatic File Saving:
- HTML reports saved to
final_reports/directory - PDF reports (if dependencies available) saved to
final_reports/pdf/directory - Markdown reports (disable with
--skip-markdown) saved tofinal_reports/md/directory - File naming format:
final_report_{topic}_{timestamp}.html/pdf/md
- HTML reports saved to
- Optional GraphRAG: CLI flags override
.env; if neither is set the feature stays off
Notes:
- Ensure at least one of the three engine directories contains
.mdreport files - The command-line tool is independent of the Web interface and does not interfere with each other
- PDF generation requires system dependencies, see "Install PDF Export System Dependencies" section above
Quickly re-render the latest outputs:
regenerate_latest_html.py/regenerate_latest_md.py: Re-stitch the latest chapter JSON fromCHAPTER_OUTPUT_DIRinto a Document IR and render to HTML or Markdown directly.regenerate_latest_pdf.py: Read the newest IR underfinal_reports/irand re-export a PDF with SVG vector charts.
โ๏ธ Advanced Configuration (Deprecated: Configuration has been unified to the .env file in the project root directory, and other sub-agents automatically inherit the root directory configuration)
Each agent has dedicated configuration files that can be adjusted according to needs:
# QueryEngine/utils/config.py
class Config:
max_reflections = 2 # Reflection rounds
max_search_results = 15 # Maximum search results
max_content_length = 8000 # Maximum content length
# MediaEngine/utils/config.py
class Config:
comprehensive_search_limit = 10 # Comprehensive search limit
web_search_limit = 15 # Web search limit
# InsightEngine/utils/config.py
class Config:
default_search_topic_globally_limit = 200 # Global search limit
default_get_comments_limit = 500 # Comment retrieval limit
max_search_results_for_llm = 50 # Max results for LLM# InsightEngine/tools/sentiment_analyzer.py
SENTIMENT_CONFIG = {
'model_type': 'multilingual', # Options: 'bert', 'multilingual', 'qwen'
'confidence_threshold': 0.8, # Confidence threshold
'batch_size': 32, # Batch size
'max_sequence_length': 512, # Max sequence length
}The system supports any LLM provider that follows the OpenAI request format. You only need to fill in KEY, BASE_URL, and MODEL_NAME in config.py.
What is the OpenAI request format? Here's a simple example:
from openai import OpenAI client = OpenAI(api_key="your_api_key", base_url="https://aihubmix.com/v1") response = client.chat.completions.create( model="gpt-4o-mini", messages=[ {'role': 'user', 'content': "What new opportunities will reasoning models bring to the market?"} ], ) complete_response = response.choices[0].message.content print(complete_response)
The system integrates multiple sentiment analysis methods, selectable based on needs:
cd SentimentAnalysisModel/WeiboMultilingualSentiment
python predict.py --text "This product is amazing!" --lang "en"cd SentimentAnalysisModel/WeiboSentiment_SmallQwen
python predict_universal.py --text "This event was very successful"# Use BERT Chinese model
cd SentimentAnalysisModel/WeiboSentiment_Finetuned/BertChinese-Lora
python predict.py --text "This product is really great"cd SentimentAnalysisModel/WeiboSentiment_Finetuned/GPT2-Lora
python predict.py --text "I'm not feeling great today"cd SentimentAnalysisModel/WeiboSentiment_MachineLearning
python predict.py --model_type "svm" --text "Service attitude needs improvement"# Add your business database configuration in config.py
BUSINESS_DB_HOST = "your_business_db_host"
BUSINESS_DB_PORT = 3306
BUSINESS_DB_USER = "your_business_user"
BUSINESS_DB_PASSWORD = "your_business_password"
BUSINESS_DB_NAME = "your_business_database"# InsightEngine/tools/custom_db_tool.py
class CustomBusinessDBTool:
"""Custom business database query tool"""
def __init__(self):
self.connection_config = {
'host': config.BUSINESS_DB_HOST,
'port': config.BUSINESS_DB_PORT,
'user': config.BUSINESS_DB_USER,
'password': config.BUSINESS_DB_PASSWORD,
'database': config.BUSINESS_DB_NAME,
}
def search_business_data(self, query: str, table: str):
"""Query business data"""
# Implement your business logic
pass
def get_customer_feedback(self, product_id: str):
"""Get customer feedback data"""
# Implement customer feedback query logic
pass# Integrate custom tools in InsightEngine/agent.py
from .tools.custom_db_tool import CustomBusinessDBTool
class DeepSearchAgent:
def __init__(self, config=None):
# ... other initialization code
self.custom_db_tool = CustomBusinessDBTool()
def execute_custom_search(self, query: str):
"""Execute custom business data search"""
return self.custom_db_tool.search_business_data(query, "your_table")The system supports uploading custom template files (.md or .txt format), selectable when generating reports.
Create new templates in the ReportEngine/report_template/ directory, and our Agent will automatically select the most appropriate template.
We welcome all forms of contributions!
Please read the following contribution guidelines:
The system has now completed the final prediction step! Visit ใMiroFish - Predict Everythingใ: https://github.com/666ghj/MiroFish
Important Notice: This project is for educational, academic research, and learning purposes only
-
Compliance Statement:
- All code, tools, and functionalities in this project are intended solely for educational, academic research, and learning purposes
- Commercial use or profit-making activities are strictly prohibited
- Any illegal, non-compliant, or rights-infringing activities are strictly prohibited
-
Web Scraping Disclaimer:
- The web scraping functionality in this project is intended only for technical learning and research purposes
- Users must comply with the target websites' robots.txt protocols and terms of use
- Users must comply with relevant laws and regulations and must not engage in malicious scraping or data abuse
- Users are solely responsible for any legal consequences arising from the use of web scraping functionality
-
Data Usage Disclaimer:
- The data analysis functionality in this project is intended only for academic research purposes
- Using analysis results for commercial decision-making or profit-making purposes is strictly prohibited
- Users should ensure the legality and compliance of the data being analyzed
-
Technical Disclaimer:
- This project is provided "as is" without any express or implied warranties
- The authors are not responsible for any direct or indirect losses caused by the use of this project
- Users should evaluate the applicability and risks of this project independently
-
Liability Limitation:
- Users should fully understand relevant laws and regulations before using this project
- Users should ensure their usage complies with local legal and regulatory requirements
- Users are solely responsible for any consequences arising from the illegal use of this project
Please carefully read and understand the above disclaimer before using this project. Using this project indicates that you have agreed to and accepted all the above terms.
This project is licensed under the GPL-2.0 License. Please see the LICENSE file for details.
FAQ: 666ghj#185
- Project Homepage: GitHub Repository
- Issue Reporting: Issues Page
- Feature Requests: Discussions Page
- ๐ง Email: hangjiang@bupt.edu.cn
- Enterprise Custom Development
- Big Data Services
- Academic Collaboration
- Technical Training
Thanks to these excellent contributors:




