This project focuses on analyzing data from the Human Gut Microbiome Project and training AI models to extract insights and make predictions based on microbiome data.
├── data/
│ ├── raw/ # Raw data from the Human Gut Microbiome Project
│ └── processed/ # Processed data ready for analysis and modeling
├── notebooks/ # Jupyter notebooks for exploration and visualization
├── src/
│ ├── data/ # Scripts for data downloading and processing
│ ├── eda/ # Scripts for exploratory data analysis
│ ├── features/ # Scripts for feature engineering and preprocessing
│ ├── models/ # Scripts for model training and evaluation
│ └── utils/ # Utility functions used across the project
└── requirements.txt # Python dependencies
-
Create a virtual environment:
python -m venv venv venv\Scripts\activate # Windows -
Install dependencies:
pip install -r requirements.txt -
Download the data (see data/README.md for details)
The Human Gut Microbiome Project (HMP) provides a wealth of data on the microorganisms that live in the human gut. This project uses data from:
- Exploratory Data Analysis (EDA)
- Feature Engineering and Preprocessing
- Model Training and Evaluation
- Interpretation and Visualization
This project explores various AI models for microbiome data analysis:
- Classification models for disease prediction
- Regression models for quantitative trait prediction
- Clustering models for microbiome community identification
- Deep learning models for complex pattern recognition
A Streamlit chat application that allows users to ask questions about biomedicine and gut microbiome research. The application uses SmolAgents to create an intelligent agent that can answer questions directly or search PubMed for relevant research papers.
- Interactive chat interface for asking questions about biomedicine and gut microbiome research
- Intelligent agent that decides whether to answer directly or search for research papers
- PubMed integration for searching and retrieving scientific articles
- Ability to extract article abstracts, citation counts, and links
- Balanced presentation of research that both supports and challenges user hypotheses
- Clone this repository
- Install the required dependencies:
pip install -r requirements.txt- Update the email address in
pubmed_tools.pywith your valid email (required by NCBI):
Entrez.email = "your_email@example.com" # Replace with a valid email- Run the Streamlit app:
streamlit run app.py-
Open your web browser and navigate to the URL displayed in the terminal (usually http://localhost:8501)
-
Start asking questions about biomedicine and gut microbiome research!
- "What is the gut microbiome?"
- "How does the gut microbiome affect mental health?"
- "What are the latest research findings on probiotics and gut health?"
- "Is there evidence linking gut bacteria to autoimmune diseases?"
- "What research exists on the relationship between diet and microbiome diversity?"
The application is built with the following components:
- Streamlit: Provides the web interface and chat functionality
- SmolAgents: Framework for creating intelligent agents with tool-using capabilities
- BiomedicalAgent: Custom agent implementation that specializes in biomedical research
- PubMed Tools: Custom tools for searching PubMed and extracting article information
- streamlit
- smol-agents
- biopython
- requests
- beautifulsoup4
- lxml