Skip to content

ibnunowshad/digital-cv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

91 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Ibrahim Nowshad

Infrastructure pro turned cloud architect, crafting robust, secure solutions on leading platforms. Proven experience in migrations, automation, and security. Passionate learner, global explorer, ready to build cloud kingdoms. Taiwan Employment Gold Card holder based in Dubai 🇦🇪

Email . Website . LinkedIn . GitHub . Telegram . Mastodon

👩🏼‍💻 Engineering Experience

Manager - Reliability & Security @ Cult.Sport (Feb 2023 - Aug 2023)
Sports e‐tailer for smart fitness products, sportswear, at‐home workout equipment, and bicycles

  • Reduced downtime by 35% by migrating legacy services to a highly available Proxmox-based infrastructure with containerized deployments using Docker, Portainer and Ubuntu.
  • Developed Prometheus alerts and Grafana dashboards to monitor key infrastructure metrics and trigger proactive notifications for potential issues.
  • Improved security incident response time by 20% by implementing Wazuh and ELK stack for centralized logging, analysis, and threat detection.
  • Led incident response efforts for 7 major outages, resolving them within 2 hours on average thanks to Wazuh and ELK-based situational awareness.
  • Collaborated with developers and operations teams to implement DevOps practices and automate infrastructure rollouts and scaling, reducing mean time to deployment by 40%.
  • Established SRE culture and practices within the IT team, empowering engineers to own incident response and proactive monitoring while fostering cross-functional collaboration.
  • Technologies used: Jira, Confluence, Git, Bind9, Wazuh, Jenkins, SAML, Nessus, Puppet, Proxmox, Palo Alto, Cisco.

Technology Manager - Reliability & Security @ Lazada (Jan 2020 - Feb 2023)
South East Asia’s e‐tailer incubated by Rocket Internet and acquired by Alibaba Group

  • Redesigned the fulfillment infrastructure with improved operational resilience and uptime as key objectives. Achieved uninterrupted deliveries even during multiple major outages by implementing rearchitectured infrastructure, Prometheus alerts, automated failover.
  • Collaborated with product teams to integrate SRE principles into custom internal tools (Gitlab, Lark, Aone).
  • Developed custom Prometheus exporters to monitor application health and user experience metrics, enabling proactive identification and resolution of performance issues.
  • Improved observability and reduced customer impact by resolving UX (user experience) issues 20% faster.
  • Demonstrated exceptional problem-solving skills by breaking down complex business issues, prioritizing tasks, and developing effective solutions for technical challenges and initiatives though Kanban boards.
  • Implemented a comprehensive DevSecOps framework for continuous security monitoring and threat detection.
  • Maintained compliance for 12,000+ Lazada endpoints (including BYOD devices) using DLP (Data Loss Protection), EDR (Endpoint Detection and Response), and MITRE knowledgebases (CVE and ATT&CK).
  • Reduced identified vulnerabilities by 65% and improved incident response time by 12%.
  • Technologies used: Jira, Confluence, Git, Bind9, SAML, Nessus, Puppet, SQL.

Technical Project Manager @ Alibaba Cloud (Mar 2018 - Jan 2020)
Alicloud provides reliable and secure cloud computing and data processing capabilities as a part of its online solutions.

  • Led the construction of a highly available and resilient Datacenter (Private, Public and Hybrid Cloud), exceeding industry standards for uptime and disaster recovery.
  • Achieved 99.99% uptime SLA through redundant power systems, automated self-healing infrastructure, and comprehensive disaster recovery plan.
  • Increased network deployment speed by 100% (from 100 to 200 switches/day) through innovative rack design and automation initiatives.
  • Utilized Infrastructure Management tools like Device42 and Racktables to manage rack provisioning and configuration.
  • Reduced resource requirements and minimized deployment disruption while increasing network scalability.
  • Successfully executed large-scale network projects (USD 7M) involving intricate cabling upgrades and expansions, ensuring uninterrupted service delivery.
  • Implemented rolling deployments and automated failover to minimize downtime during critical network changes.
  • Completed projects on time and within budget, contributing to the expansion of Alicloud's Availability Zones (AZs), Content Delivery Network (Alicloud CDN) in APAC.
  • Proficiently utilized project management methodologies such as Scrum, Kanban and Gantt charts, along with tools like Jira, Confluence and Gitlab, to ensure smooth coordination and collaboration throughout the projects.
  • Technologies used: Jira, Confluence, Racktables, Device42, Git, AWS, Route53, IAM, RDS, EC2, Puppet.

Lead Network Engineer - Site Reliability @ Versé (Jun 2016 - Mar 2018)
Regional news aggregator in India through a mobile app called dailyhunt

  • Implemented Grafana and Cacti to monitor network traffic and trigger automated auto scalling at 99% accuracy, ensuring optimal resource utilization and service availability.
  • Increased network capacity and reduced maintenance windows, leading to 60% improvement in user experience metrics such as response times, sessions stability.
  • Collaborated with Content Delivery Partners (Akamai, Cloudflare) to implement traffic offloading, reducing internal network load by 40%.
  • Enhanced network performance and scalability while achieving 30% cost savings on network resources.
  • Introduced Git version control for network configurations, enabling faster rollout and rollback of changes, minimizing risk and downtime.
  • Increased efficiency of planned maintenance tasks in the cloud from 65% to 95%, reducing service disruptions and operational overhead.
  • Promoted CI/CD practices within the team, empowering engineers to automate network deployments and improve code quality.
  • Built and led a collaborative team of 4 Reliability Engineers, fostering a culture of learning and skill development.
  • Technologies used: Jira, Confluence, Git, Akamai, Cloudflare, Citrix, Brocade, Openstack, VMware, Ceph.

Lead Network Engineer @ Myntra (Apr 2014 - Jul 2016)
Fashion e‐tailer of India, acquired by Flipkart and Walmart

  • Established and led a dynamic team of 8 System/Network Engineers, fostering a culture of SRE principles and continuous improvement.
  • Streamlined operations and achieved 25% increase in operational efficiency through automated monitoring, incident response practices, and knowledge sharing initiatives.
  • Implemented predictive maintenance and inventory management best practices, reducing surplus equipment by 20% and optimizing resource allocation.
  • Achieved a 99.99% uptime rate, minimizing service disruptions and exceeding SLAs, while optimizing infrastructure cost.
  • Designed a highly reliable, performant, and secure platform infrastructure that prioritized continuous monitoring, automation, and disaster recovery.
  • Utilized Infrastructure opensource tools and cloud-native technologies to enable rapid and secure deployments.
  • Enhanced user experience and ensured data protection, supporting future growth and scalability.
  • Instituted monthly network and security audits, increasing incident detection accuracy by 2% and proactive resolution rate by 95%.
  • Minimized incident downtime and improved overall system resilience through effective root cause analysis and automated remediation workflows.
  • Partnered with technical management to define the platform roadmap and resource allocation plans, aligning infrastructure with SRE principles and scaling needs.
  • Technologies used: Jira, Confluence, Git, Slack, Ruckus, Fortinet, Extreme Networks, VMWare, Shell.

🚀 Other Positions Held

Senior Executive - IT Infrastructure @ Café Coffee Day (Jan 2012 - Apr 2014)
Junior Network Engineer @ iTech India (Oct 2010 - Jan 2012)

🎓 Education

B.Tech in Information Technology, First Class
Anna University - Chennai, India (2007 - 2010)
Couse Work: Analysis of Password Login Phishing-Based Protocols for Security Improvements or 2FA Auth with JSP and MSSQL. This project was based on IEEE 2009 and published in the International Conference on Emerging Technologies. Link to the paper

Diploma in Information Technology, First Class (Honors)
State Board of Technical Education - Chennai, India (2004 - 2007)
Course Work: 'Information about my Institution' - Basic HTML and MSSQL website with student registration online

📚 Certifications

(Cloud Native Computing Foundation and Linux Foundation) Certified Kubernetes Administrator (In Preparation)
(Schneider) Data Center University Associate Development Path
(Cybrary) Nessus Fundamentals
(Cybrary) Manage a Network Infrastructure
(Rackspace) CloudU Rackspace Certified
VMware Certified Associate - Cloud, Data Center Virtualization and Workforce Mobility
Microsoft Certified IT Professional, Technology Specialist, Solution Associate

About

A fancy, new cv that I can send to all future empoyers

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages