AIDx Model Development is an innovative repository focused on developing AI-driven models for healthcare applications. This project specializes in processing medical data, particularly from the MIMIC-IV database, and offers tools for model training and dataset management.
- Data Processing: Scripts for tokenizing and transforming healthcare data such as MIMIC-IV 🏥.
- Model Training & Fine-Tuning: Utilizes fine-tuning to create AI models on medical datasets 🧬.
- Dataset Management: Tools to create and handle datasets, including Parquet datasets, for healthcare AI 📊.
aidx.py
: The gateway to various functionalities like dataset creation and model tuning 🌐.- Docker Support: Facilitates hosting the Jupyter Notebook environment 🐳.
- Notebooks: Step-by-step guides in Jupyter notebooks for data processing and analysis 📓.
- Scripts: Python scripts for comprehensive processes including database interactions and data workflows 📜.
- Dependencies:
requirements.txt
for easy installation of necessary packages 📌.
- Google Drive and
rclone
for dataset storage- Make sure the
rclone.conf
file is in thescripts/utils
directory
- Make sure the
- MIMIC-IV Database with PostgreSQL
- Clone the repository.
- Set up the Docker environment, which hosts the Jupyter Notebook.
- Install dependencies from
requirements.txt
. - Run
python aidx.py
to access the command menu and start exploring the functionalities.