Skip to content
View Rinub's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report Rinub

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Rinub/README.md

A Software Engineer's repository

Abhishek Naidu | Twitter Abhishek's LinkedIN

Hi, i'm Ibrahim Rinub Babu, • Software Engineer with 4+ years of experience in design and development of data-intensive applications. • Well-acquainted with SQL, database engineering, improving database performance, server-side technology. • Specialized in python frameworks for designing large-scale data processing systems and ETL data pipelines. • Deployed and Administered Hadoop cluster using Cloud platforms in AWS EMR to developed streaming ETL pipelines and process big data in Airflow, Spark and created REST API’s to provide data access using Django. • Built Machine Learning Pipelines in PySpark MLlib to generate computed columns for recommendation system. • Certified in IBM Data Science Professional and Apache Spark. Experience in developing data infrastructure for financial and health insurance domain. Currently exploring technologies such as DevOps, CI/CD, Kubernetes.

GIF

Languages and Frameworks:

🚧 SKILLS

➲ Programming Languages ► Python, SQL, PL/SQL, PowerShell scripting.

➲ Database and servers ► MySQL, PostgreSQL, Oracle 21c, IBM DB2, AWS RDS.

➲ ETL Tools ► Python (Pyspark, SQLAlchemy Pandas, Airflow), IBM Datastage.

➲ Cloud services ► Redshift, AWS EMR, AWS EC2, S3, Cloud watch, Lambda.

➲ Python Frameworks ► RESTful API(Django, Flask), Machine Learning (scikit-learn, TensorFlow, Keras).

➲ Big Data ► distributed data processing frameworks such as Apache Spark, and Hadoop.

➲ DevOps & BI Tools ► Docker, Git, Jenkins, CI/CD Pipeline and Power BI for KPI, Postman.

⋙ RESEARCH ► Data science methods, Data mining, Survey Creations, Focus Groups, Machine learning, Deep learning, Business Intelligence, Project Management Techniques, Knowledge of business structure.

⋙ DATA MANAGEMENT ► Database Design & Management, Data Quality, Pattern & trend identification, visualization of data insights

◆ CERTIFICATION ◆ ⋙ Data Science ► IBM Data Science Professional, Coursera. - 2021 ⋙ Apache Spark ► Building Machine Learning Pipelines in PySpark MLlib, Coursera. - 2022 ⋙ C Programming ► Certification Course in C Programming, Bharathidasan University - 2016 ⋙ Pandas and Python ► Learn Data Analysis using Pandas and Python, Udemy - 2017

◆ WORK PROJECTS ◆

Designed cost efficient Pyspark ETL Data pipeline for creating realtime Dashboards for Inventory stock management using PostgreSQL, AWS EMR Hadoop cluster and Lambda. • In order to reduce the cost by 60% a Transit AWS EMR cluster has been used to develop a Pyspark ETL Pipeline. • The Cluster will be created, run the Pyspark ELT script and terminate after processing and transfering the data to the Data Warehouse. • To achieve this a python script using Packages such as boto3, S3, has been deployed in AWS Lambda to create a cluster periodically an run the PYspark script stored in S3 and terminate the cluster after the process is complete. Developed Real-time Dashboards for Inventory stock for supply chain management using Power-BI. • Integrated AWS Redshift data source to Power BI using the Power BI data connector for Redshift and Configure real-time streaming in the Power BI Service using the Redshift data source. • Create a Power BI report and visualize your data and Publish the report to the Power BI Service regarding Inventory stocks for supply chain management and to identify investment levels.

Pinned Loading

  1. IMPLEMENTATION-OF-TOUCHLESS-HAND-GESTURE-RECOGNITION-ATM-BASED-ON-DEEP-LEARNING-APPROACH- IMPLEMENTATION-OF-TOUCHLESS-HAND-GESTURE-RECOGNITION-ATM-BASED-ON-DEEP-LEARNING-APPROACH- Public

    Jupyter Notebook 1

  2. IMPLEMENTINNG-A-BUSINESS-INTELLIGENCE-AND-BUSINESS-ANALYTIC-SYSTEM-FOR-TEXTILE-MANUFACTURERS IMPLEMENTINNG-A-BUSINESS-INTELLIGENCE-AND-BUSINESS-ANALYTIC-SYSTEM-FOR-TEXTILE-MANUFACTURERS Public

    1