Skip to content
View Pushkr's full-sized avatar
👨‍💻
Solving Data Engineering problems..
👨‍💻
Solving Data Engineering problems..

Block or report Pushkr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The Python library for names.

Python 887 154 Updated Feb 6, 2025

List of Computer Science courses with video lectures.

68,418 9,260 Updated Mar 26, 2025

My notes for AWS Solutions Architect Associate.

1,662 478 Updated Jul 26, 2023

This is a repo documenting the best practices in PySpark.

Jupyter Notebook 462 77 Updated Dec 8, 2022

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

161,710 10,165 Updated Nov 19, 2024

An evolving how-to guide for securing a Linux server.

17,958 1,148 Updated Oct 19, 2024

Resumes generated using the GitHub informations

JavaScript 62,325 1,358 Updated Feb 15, 2023

😎 Awesome lists about all kinds of interesting topics

353,453 28,803 Updated Mar 13, 2025

A curated list of awesome big data frameworks, ressources and other awesomeness.

13,524 2,565 Updated Feb 14, 2025

A convenient Python wrapper for Apache NiFi

Python 255 75 Updated Mar 25, 2025

A curated list of data engineering tools for software developers

7,227 1,299 Updated Mar 14, 2025

Retrying library for Python

Python 7,234 291 Updated Mar 25, 2025

Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.

Java 88 15 Updated Mar 5, 2024

ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.

Java 283 97 Updated Feb 27, 2019

Data cleansing tutorial for chipy scientific SIG

Jupyter Notebook 8 8 Updated Feb 18, 2016

📚 Parameterize, execute, and analyze notebooks

Python 6,116 437 Updated Jan 7, 2025

📘 The interactive computing suite for you! ✨

TypeScript 6,240 552 Updated Dec 30, 2023

SparkOnHBase

Scala 279 177 Updated Mar 30, 2021

A python Web HDFS based tool for inter/intra-cluster data copying.

Python 9 4 Updated Aug 27, 2020

The Python micro framework for building web applications.

Python 69,186 16,350 Updated Jan 5, 2025

the only cheat sheet you need

Python 39,154 1,809 Updated Feb 1, 2025

50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra,…

Shell 1,338 472 Updated Mar 14, 2025

Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers

69 27 Updated Oct 18, 2022

Chaos Monkey is a resiliency tool that helps applications tolerate random instance failures.

Go 15,641 1,192 Updated Jan 6, 2025

📖 A collection of pure bash alternatives to external processes.

Shell 36,867 3,310 Updated Nov 28, 2023

关于Python的面试题

Shell 16,768 5,549 Updated Mar 5, 2025

A list of helpful Scala related questions you can use to interview potential candidates.

502 86 Updated Mar 21, 2017

A curated list of awesome Apache Spark packages and resources.

Shell 1,773 336 Updated Oct 24, 2024

Examples for High Performance Spark

Scala 506 234 Updated Nov 3, 2024
Next
Showing results