Skip to content
View ShaileshKumar97's full-sized avatar
🧐
🧐

Block or report ShaileshKumar97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ShaileshKumar97/README.md

Hi there, I'm Shailesh Kumar 👋!

Who I am?

  • A Machine Learning / GenAI Engineer based in India.
  • Working as Lead Data Scientist at Katonic AI.
  • I love math, programming, data science, and books.
  • Creator of ExplainIt, an opensource package for drift detection & data quality management.
  • Open Source Enthusiast.
  • See my portfolio at shaileshkumar97.github.io.

What I'm doing?

  • Writing Python, SQL, HTML/CSS, PostgreSQL, MySQL, Redis etc...
  • Contributing to Open Source.
  • Mostly active on LinkedIn.
  • Currently, building end-to-end production ready generative ai assistants/agents to handle different types of knowledge base for variety of usecases.
  • Previously, built an end-to-end production pipeline for processing short videos with different usecases.

What are my skill sets?

  • 🎛 Machine Learning Operations (MLOps):

    • Language: PythonSQL
    • Framework: MlflowKubeflowElyraDashFastAPIStreamlit
    • Databases: PostgreSQLMySQLRedisSnowflake
    • Concepts: Data PipelineFeature StoreData GovernanceModel PipelineModel DeploymentApp DeploymentModel MonitoringDrift DetectionModel Explainability
  • 👨‍💻 Python Developer:

    • Open Source Projects:

      • Explainit: A modern enterprise-ready business intelligence web application SDK for Drift Detection, Monitoring & Data Quality Management.
    • In-House SDK: (for katonic.ai)

      • Feature Store: To manage end-to-end life-cycle of features & integrate with existing data stores, feature pipelines, data governance, and ML platforms.
      • Connectors: To access the data from different databases/warehouses and stores to a given destination.
      • FileManager: To access, store and update/manipulate objects within the katonic file browser.
      • Pipeline: To convert an existing notebook into a Kubeflow pipeline.
      • AutoML: To build, Train & Log different Machine Learning, Deep Learning models.
      • Log: To quick register the trained models with mlflow in platform for deployment to the production environment.
  • 🧮 Machine Learning:

    • Language: PythonSQL
    • Framework: Scikit-LearnXgboostCatboostPandasPlotlyMatplotlibPyspark
    • Databases: PostgreSQLMySQLPostgreSQL
    • Big Data: SparkData Lake (Delta, Hudi, Hive)
    • Protocol: REST
  • 🤖 Deep Learning:

    • Language: Python
    • Framework: PyTorchTensorflowKerasOpenCVLibrosa
  • 🗄️ Backend:

    • Language: Python
    • Framework: FastAPIFlaskStreamlit, Dash
    • Databases: PostgreSQLMySQLAWS S3RedisSnowFlake
    • System Architecture: MonolithicModular
    • Protocol: REST
  • 🖥 Frontend:

    • Language: HTMLCSSPython
    • Framework/Library: Dash, Streamlit
    • Utils: BootstrapModular CSS
  • 🎡 Ecosystem:

    • Containerization: Docker
    • Version Control: GitGitHub
    • CI/CD: Github Actions
    • Project Management: GitHub Projects

How to reach me?

Twitter LinkedIn Medium Mail


Shailesh Kumar's GitHub stats

Pinned Loading

  1. Microservice-Architectures Microservice-Architectures Public

    Python 1

  2. ChatQL ChatQL Public

    ChatQL: Querying Databases Through Conversation with Power of LLMs

    Python 1

  3. Speech-Enhancement Speech-Enhancement Public

    Jupyter Notebook

  4. Deep-Learning-Projects Deep-Learning-Projects Public

    Jupyter Notebook 2

  5. Machine-Learning-Projects Machine-Learning-Projects Public

    Jupyter Notebook 1

  6. Vision-Transformer-for-ECG-Classification Vision-Transformer-for-ECG-Classification Public

    Jupyter Notebook