PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
-
Updated
Jun 20, 2024 - Python
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
ML project focused on predicting Titanic passenger survival using various algorithms and extensive data analysis techniques. This project includes detailed data visualization and interpretation to uncover key factors affecting survival. By leveraging various ML models the analysis aims to achieve high predictive accuracy.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Data Exploration is the initial step in data analysis, where users explore a large data set in an unstructured way to uncover initial patterns, characteristics, and points of interest.
General purpose AI Studio. Mix AI models, connect to databases, APIs (like Stripe), CSV/Excel files, even email. Solve daily problems and share your workflows :)
Group project for the course Business Analytics Applications with Python for my MScBA in Business Analytics & Management.
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Travel Data Analysis Internship Project at iNeuron.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
Customer Segmentation using R
Grep through all Grafana entities in the spirit of git-wtf.
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Successfully fine-tuned a pretrained DistilBERT transformer model that can classify social media text data into one of 4 cyberbullying labels i.e. ethnicity/race, gender/sexual, religion and not cyberbullying with a remarkable accuracy of 99%.
Telecom Data Analysis with Apache Hive
This project involves a comprehensive comparative analysis of various machine learning models to classify activities based on a given dataset. The analysis follows a structured approach, including data exploration, model training, model evaluation, and results interpretation to identify the best performing model.
Unlock insights into the U.S. healthcare landscape from 2019 to 2020. Our PowerBI-driven analysis delves into hospital performance, patient outcomes, and payer-provider dynamics. Dive into detailed reports and visualizations for informed decision-making, empowering healthcare stakeholders, and shaping the industry's future.
SQL Data Cleaning And Exploration: Analysis Practice. #SQL #DataCleaning #DataExploring #DataScience
Pheno-Ranker is a tool designed for performing semantic similarity analysis on phenotypic data structured in JSON format, such as Beacon v2 Models or Phenopackets v2.
Add a description, image, and links to the data-exploration topic page so that developers can more easily learn about it.
To associate your repository with the data-exploration topic, visit your repo's landing page and select "manage topics."