The Trends in Data Jobs project is a web scraping and data visualization tool designed to track and analyze trends in data-related job postings.
-
Updated
Jun 8, 2024 - Jupyter Notebook
The Trends in Data Jobs project is a web scraping and data visualization tool designed to track and analyze trends in data-related job postings.
A daily digest of the articles or videos I've found interesting, that I want to share with you.
This project implements an end-to-end techstack for a data platform, can be used on production.
A Cloud Native Batch System (Project under CNCF)
scBubbletree: quantitative tool for visual exploration of scRNA-seq data
Upserts, Deletes And Incremental Processing on Big Data.
Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.
A curated list of awesome big data frameworks, resources and other awesomeness. With repository stars⭐ and forks🍴
A cross-platform Echarts dashboard application,Powerpoint-like, designed based on Excel data, with the capability to update data remotely.supports line, spline, area, areaspline, column, bar, pie, scatter, angular gauges, arearange, areasplinerange, columnrange, bubble, box plot, error bars, funnel, waterfall. 支持柱状图、条形图、折线图、曲线图、折线填充图、曲线填充图、气泡图、扇形图。
𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
A utility for calculating geometric means from a spreadsheet, picking them into separate worksheets, and returning the calculations as an Excel workbook.
Web scraper that extracts all daily tennis matches, and analyse them to predict the probability in the "First Set Player To Break Serve" market.
A general purpose Distributed Systems Framework
DataWave is an ingest/query framework that leverages Apache Accumulo to provide fast, secure data access.
TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.
IT Knowledge Base from 20 years in DevOps, Linux, Cloud, Big Data, AWS, GCP etc - gradually porting my large private knowledge base to public
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Class that provides a high-precision floating-point arithmetic library
Add a description, image, and links to the bigdata topic page so that developers can more easily learn about it.
To associate your repository with the bigdata topic, visit your repo's landing page and select "manage topics."