Skip to content

Analyze, structure and clean unstructured data with AI

License

Notifications You must be signed in to change notification settings

Contributorrandom/lilac

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🌸 Lilac

Static Badge Downloads License Twitter

NEW: Try the Lilac hosted demo with pre-loaded datasets

👋 Welcome

Lilac is an open-source product that helps you analyze, structure, and clean unstructured data with AI.

Lilac can be used from our UI or from Python.

lilac-welcome.mp4

💻 Install

To install Lilac on your machine:

pip install lilac

You can also use Lilac with no installation by forking our public HuggingFace Spaces demo.

🔥 Getting started

Start the Lilac webserver from the CLI:

lilac start

Or start the Lilac webserver from Python:

import lilac as ll

ll.start_server()

This will open start a webserver at http://localhost:5432/.

📁 Documentation

Visit our website: lilacml.com

💻 Why Lilac?

Lilac is a visual tool and a Python API that helps you:

  • Explore datasets with natural language (e.g. documents)
  • Enrich your dataset with metadata (e.g. PII detection, profanity, text statistics, etc.)
  • Conceptually search and tag your data (e.g. find paragraphs about injury)
  • Remove unwanted or problematic data based on your own criteria
  • Analyze patterns in your data

Lilac runs completely on device using powerful open-source LLM technologies.

💬 Contact

For bugs and feature requests, please file an issue on GitHub.

For general questions, please visit our Discord.

About

Analyze, structure and clean unstructured data with AI

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 52.4%
  • Svelte 19.0%
  • TypeScript 15.4%
  • Jupyter Notebook 11.9%
  • Shell 0.6%
  • CSS 0.3%
  • Other 0.4%