Skip to content

🚕 An agnostic data profiling GUI to make your data science tasks easier.

License

Notifications You must be signed in to change notification settings

Pankajomkar/Bumblebee

 
 

Repository files navigation

built_by iron

Bumblebee

With Bumblebee, you can easily clean and prepare bigdata using a visual interface. It is built over Optimus/Pyspark so you can handle small and big data efficiently.

For more info about Bumblebee, please go to: https://hi-bumblebee.com/

Installation

Docker

https://medium.com/hi-bumblebee/how-to-run-bumblebee-in-a-docker-container-c9da047d1ff1

Ubuntu/Digital Ocean

https://medium.com/hi-bumblebee/how-to-install-bumblebee-on-digital-ocean-ef77138f1838

Load and explore

Get data from CSV, JSON, parquet, Avro files, and databases. Then get histograms, frequency charts, and advance stats. Database

Transform and Clean

Convert unstructured data, standardize strings, unify date format, Impute data, handle outliers and create custom functions. Transform

Prepare for Machine Learning

Bin columns, string clustering, one-hot encode, scaling, and split train and test data. Prepare

Interact with code like in jupyter notebook

Every action over your data is added as a transformation step using python code that you can modify anytime. Also, you can add any python code you want to make complex Apache Spark transformations. Interact

See Bumblebee in action

https://www.loom.com/embed/c2cfb6a2e0a549e2afeb8d484865b968

Contributing to Bumblebee

Contributions go far beyond pull requests and commits. We are very happy to receive any kind of contributions including:

  • Documentation updates, enhancements, designs, or bugfixes.
  • Spelling or grammar fixes.
  • README.md corrections or redesigns.
  • Adding unit, or functional tests
  • Triaging GitHub issues -- especially determining whether an issue still persists or is reproducible.
  • Searching #hibumblebee on twitter and helping someone else who needs help.
  • Blogging, speaking about, or creating tutorials about Bumblebee and its many features.

About

🚕 An agnostic data profiling GUI to make your data science tasks easier.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Vue 55.3%
  • CSS 37.4%
  • JavaScript 7.3%