BioVix: An Integrated Large Language Model Framework for Data Visualization, Graph Interpretation, and Literature-Aware Scientific Validation
- overview
- Features
- Prerequisites
- BioVix Installation
- Running BioVix
- Outputs
- Deployment
- Tips for Success
- Reference
- License and Issues
- Authors and Contacts
BioVix is an AI-assisted visualization tool built on Streamlit that streamlines the workflow from data analysis to literature discovery. By integrating Plotly for visualization, DeepSeek V3.1 for query processing, and Semantic Scholar for bibliographic search, it offers a robust analytical environment. Furthermore, the system incorporates GPT-OSS-20B for structured dialogue and Qwen2.5-VL-32B-Instruct for visual graph reasoning, enabling users to gain a deeper understanding of data trends through natural language interaction.
- Data Upload & Management: Supports CSV, TSV, and Excel (.xlsx) formats.
- AI-Powered Chart Generation: Create interactive Plotly visualizations from natural language queries.
- AI Insights: Automatically analyze and provide explanations for generated charts.
- Academic Research Integration: Discover relevant research papers via Semantic Scholar.
- Graph Interpreter: Analyze uploaded graph images using AI.
- Data Q&A: Ask questions about your dataset and receive AI-driven answers.
- Sample Datasets: Access pre-loaded datasets for quick testing (Apple Stock, Gene Expression, Hospital Data).
- Python <3.9 and >=3.11
- API keys for:
git clone https://github.com/MuhammadZain-Butt/BioVix.git
cd BioVixIt is highly recommended to use a virtual environment to avoid dependency conflicts.
python -m venv env
env\Scripts\activateNote: Replace env_name with your preferred name for the virtual environment.
python3 -m venv env_name
source env_name/bin/activateNote: Replace env_name with your preferred name for the virtual environment.
pip install -r requirements.txtCreate a .env file in the project root directory (as in BioVix):
DEEPSEEK_API_KEY="your_deepseek_key_here"
GPT_API_KEY="your_gpt_key_here"
QWEN_API_KEY="your_qwen_key_here"
SEMANTIC_SCHOLAR_API_KEY="your_semantic_scholar_key_here"
Tip: If you do not have these API keys, you can create accounts here to generate them:
After installing dependencies and setting up your environment, you can start BioVix using Streamlit.
streamlit run app.pyOnce the command runs, the app will automatically open in your default browser at: http://localhost:8501
The following panels illustrate the outputs of BioVix across varying datasets. (A) displays the raw input data, while (B,C,D) presents the corresponding interactive visualization rendered with Plotly. (E) provides the AI-generated interpretation of the graph, along with the derived search query. Finally, (F) lists the relevant research papers retrieved from Semantic Scholar using the formulated query:
BioVix is deployed on Hugging Face and can be tested or used directly, click here
- Ensure that input files are correctly formatted (e.g., CSV, XLSX, or TSV) and contain all information required for visualization.
- Write queries in a clear and detailed manner, and avoid using informal language.
- Use consistent naming conventions for columns and variables to improve clarity and interpretation.
In Process.
This BioVix is licensed under the MIT License - see the LICENSE file for details. Submit issues or contributions via GitHub Issues.
Mr. Muhammad Zain Butt
Integrative Omics and Molecular Modeling Laboratory, Department of Bioinformatics and Biotechnology, Government College University Faisalabad (GCUF), Faisalabad, 38000, Pakistan
Email: zain.202302328@gcuf.edu.pk
Mr. Rana Sheraz Ahmad
Integrative Omics and Molecular Modeling Laboratory, Department of Bioinformatics and Biotechnology, Government College University Faisalabad (GCUF), Faisalabad, 38000, Pakistan
Email: ranasheraz.202101902@gcuf.edu.pk
Ms. Eman Fatima
Integrative Omics and Molecular Modeling Laboratory, Department of Bioinformatics and Biotechnology, Government College University Faisalabad (GCUF), Faisalabad, 38000, Pakistan
Email: eman.202204127@gcuf.edu.pk
Dr. Muhammad Tahir ul Qamar (Correspondence)
Integrative Omics and Molecular Modeling Laboratory, Department of Bioinformatics and Biotechnology, Government College University Faisalabad (GCUF), Faisalabad, 38000, Pakistan
Email: m.tahirulqamar@hotmail.com



