Skip to content

⚑️ MLE is an AI Agent that leverages the power of LLM (OpenAI / Anthropic / Ollama / etc) to assist with machine learning development tasks. πŸ”₯ Chat with your AI projects to build, reproduce and optimize them. πŸš€ Using RAG/Web Search/Tool Calling/etc. 🍻 Integrate with many data/MLOps tools like wandb, mlflow etc.

License

Notifications You must be signed in to change notification settings

MLSysOps/MLE-agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Kaia: A Pair Agent for AI Engineer / Researchers

πŸ’Œ Fathers' love for Kaia πŸ’Œ

kaia-llama

GitHub commit activity PyPI - Downloads Downloads GitHub License

Join our Discord community X (formerly Twitter) Follow

Kaia_v014.mp4

Overview

MLE-Agent is designed as a pairing LLM agent for machine learning engineers and researchers. It is featured in three major modes:

  • πŸš€ Kaggle Mode will help you to easily participate in Kaggle competitions, prepare submissions and get a good score.
  • β˜• Baseline Mode can quickly build a baseline model for your AI project according to your requirements.
  • πŸ”₯ Advanced Mode (Coming Soon) is designed to utilize users' favorite MLOps tools, understand SOTA methods, and suggest optimizations for users' machine learning projects.

Milestones

πŸš€ June 16th, 2024: Pre-release the Kaggle Mode (need to install from the source code)

πŸš€ June 1st, 2024: Release the Baseline Mode (v0.1.0)

Get started

Installation

pip install mle-agent

Configuration

You must set up an LLM and choose tools before using the agent.

mle config

Usage (Baseline Mode)

Create a new project

mle new <project name>

A workspace with <project name> will be created where you execute the mle new command.

Start a project

mle start

Note

  • Debugging on the cloud may incur high costs, please ensure you have enough budget.
  • You can start a project under any path, the code/data generated will be stored in the target workspace.

Project-related operations

mle project ls # show all the available projects
mle project delete <project name> # delete a given project
mle project switch # switch the current working project
mle project show # show the status of the current project

Roadmap

The following is a list of the tasks we plan to do, welcome to propose something new!

πŸ”¨ Plan, Generate, Execute and Debug Code
  • An easy-to-use CLI interface
  • Create/Select/Delete a project
  • Understand users' requirements to suggest the file name, dataset, task, model arch, etc
  • Generate a detailed coding plan
  • Write baseline model code
  • Execute the code on the local machine/cloud
  • Debug the code and revise the code
  • Googling the error message to debug the code
  • Data Augmentation
  • Hyperparameter tuning
  • Model evaluation
⭐ More LLMs and Serving Tools
  • Ollama LLama 2/3
  • OpenAI GPT-3.5
  • OpenAI GPT-4
  • Codellama
  • Codemitral
  • Anthropic Codex
πŸ’– Better user experience
  • Web UI (coming soon)
  • Discord
🧩 Integrations
  • SkyPilot
  • Snowflake
  • Databricks
  • Wandb
  • MLflow
  • DBT
  • Airflow
🌐 Search Enhancement
  • Google
  • GitHub
  • HuggingFace
  • Paper with Code
  • Arxiv

Contributing

We welcome contributions from the community. We are looking for contributors to help us with the following tasks:

  • Benchmark and Evaluate the agent
  • Add more features to the agent
  • Improve the documentation
  • Write tests

If you are interested in contributing, please check the CONTRIBUTING.md file.

Support and Community

  • Discord community. If you have any questions, please feel free to ask in the Discord community.
  • Twitter. Follow us on Twitter to get the latest updates.

License

Check LICENSE file for more information.

About

⚑️ MLE is an AI Agent that leverages the power of LLM (OpenAI / Anthropic / Ollama / etc) to assist with machine learning development tasks. πŸ”₯ Chat with your AI projects to build, reproduce and optimize them. πŸš€ Using RAG/Web Search/Tool Calling/etc. 🍻 Integrate with many data/MLOps tools like wandb, mlflow etc.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages