- Ventura, CA, United States
Stars
This repository provides builds of the parquet-cli JAR utility, a component of the parquet-java project.
📈 GCPlot - all-in-one JVM GC Logs Analyzer (Server)
TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.
Currently the only webapp to access the TP-Link smarthome api
Apache Hive Metastore as a Standalone server in Docker
XML data source for Spark SQL and DataFrames
A Python module for learning all major algorithms
TPC-H queries in Apache Spark SQL using native DataFrames API
Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared …
🖼️ A command-line system information tool written in bash 3.2+
DoEKS is a tool to build, deploy and scale Data & ML Platforms on Amazon EKS
Example code for running Spark and Hive jobs on EMR Serverless.
This repo contains samples for EMR Studio feature.
An awesome README template to jumpstart your projects!
Convert Cloudformation templates to Terraform.
Powerful and versatile multi-purpose calculator for the Android platform
Example Spark streaming sample codes with Custom Listeners to push streaming metrics into Amazon CloudWatch metrics
A colorful, dark color scheme for Vim.
Simple PII data check using PySpark
Terraform module for Amazon MWAA(Apache Airflow)
Render markdown on the CLI, with pizzazz! 💅🏻
Transparent proxy server that works as a poor man's VPN. Forwards over ssh. Doesn't require admin. Works with Linux and MacOS. Supports DNS tunneling.
Context aware, pluggable and customizable data protection and de-identification SDK for text and images