DevOps engineer based in Wrocław, Poland, CET
Highly skilled and motivated Site Reliability Engineer (SRE) / DevOps with more than 10 years of experience in designing, building, and maintaining highly scalable and reliable systems.
- Programming Languages: Perl, Python, Golang, Shell scripting
- Cloud Technologies: Yandex cloud, Digital Ocean
- Containerization and Orchestration: OpenVZ, Docker, Kubernetes, Podman, LXC, nspawn
- Virtualization: KVM, VMware ESxi, Xen, Virtuozzo
- Infrastructure as Code (IaC): Puppet, Ansible, Terraform
- Continuous Integration/Continuous Delivery (CI/CD): Jenkins, Github/GitLab CI/CD
- CI/CD Sec: SonarQube, FindSecBugs, Checkov, OpenSCAP, Vuls, grype, Trivy, clair, Falco, syft, cosign
- Monitoring and Alerting: Nagios + mrtg, Zabbix, Prometheus, Grafana, ELK Stack
- Databases: MySQL, PostgreSQL, Oracle, MongoDB, ClickHouse, Redis/Memcached, tarantool
- Load balancing: LVS, haproxy, nginx, maxscale
- High availability: RHCS, Corosync/Pacemaker, keepalived/VRRP
- Networking: Cisco, Juniper, Brocade
- VoIP: Kamailio, asterisk
- Collaboration and Communication: Jira, Confluence
- Cloud Services Engineer, Yandex Ltd, 16.05.2024
- Completion of the course "celestial mechanics", ITMO University, 10.06.2015
- Engineering, SPbPU University, 11.05.2015
- SQL server for beginners, UDEMY, 13.03.2015
- SQL language online cource, UDEMY, 13.03.2015
- M202: MongoDB Advanced Deployment and Operations, MongoDB University, 26.02.2015
- LFS101x.2: Introduction to Linux, edX.org, 02.02.2015
- Programming for Everybody (Python), coursera.org, 12.01.2015
- Introduction to Linux, stepik.org, 10.12.2014
- VoIP course for beginners, skillfactory, 06.12.2014
- LFS101x: Introduction to Linux, edX.org, 15.08.2014
- Technical specialist Radmin, RADMIN, 17.01.2014
- Technical specialist ESET, ESET NOD32, 24.12.2013
- M102: MongoDB for DBAs, MongoDB University, 26.11.2013
- Competition: "System Administration PRO", IT Starz 2013, 27.09.2013
- Automated infrastructure management using Puppet and Ansible, leading to a 70% reduction in manual provisioning time and enhanced environment consistency.
- Designed and implemented scalable pacemaker/corosync clusters for deploying (micro)services, containers and virtual machines, improving application scalability and fault tolerance.
- Developed and maintained CI/CD pipelines using Jenkins and GitLab CI/CD, enabling automated building, testing, and deployment of applications.
- Set up comprehensive monitoring and alerting systems using Prometheus, VictoriaMetrics, Grafana, and the ELK Stack, which enabled proactive issue identification and decreased mean time to resolution.
- Collaborated with development teams to boost application performance and reliability through meticulous performance tuning, load testing, and code optimization.
- Led incident response and troubleshooting efforts, ensuring quick resolution of critical incidents and minimizing system downtime.
- Conducted Chaos Engineering experiments to proactively identify system weaknesses and improve resilience.
- Participated in on-call rotations, handling incident responses and performing root cause analyses to prevent future occurrences.
- Managed bare-metal infrastructure in proprietary data centers, ensuring high availability, scalability and security.
- Automated infrastructure provisioning and configuration using Puppet and Ansible, reducing deployment time by 50% and improving infrastructure consistency.
- Implemented centralized logging and log analysis using rsyslog, improving troubleshooting and monitoring capabilities.
- Collaborated closely with development teams to devise and implement performance monitoring and optimization techniques.
- Worked in conjunction with security teams to implement and maintain security controls and ensure compliance with industry standards.
- Organized and executed disaster recovery planning and testing to guarantee business continuity.
- Developed scripts for Adrenaline bot reducing manual ingame windows provisioning time by 60% which allowed 24/7 farming.
- Managed more than 20 hosts with hundreds ingame windows.
- Implemented automated backup solutions to safeguard data and ensure quick recovery in case of failures.
- Built a computer network for approximately 30 hosts with 2 internet providers for fault tolerance that allowed to speed up the document flow.
- Set up and maintained Squid proxy server for caching that allowed reduce internet traffic and payments to providers.
- Implemented Windows Active Directory for streamlined network management and improved security.
- Installed and updated software applications on all company computers, ensuring compliance with licensing agreements and reducing security vulnerabilities.
- Provided technical support and troubleshooting for hardware and software issues, minimizing downtime and maintaining smooth operations.
- Established network security firewalls and antivirus solutions, to protect against external and internal threats.
- Trained employees on best practices for using network resources, improving overall efficiency and reducing the occurrence of user-related issues.
- Managed data backup and recovery processes to ensure data integrity and quick restoration in case of system failures.
- Russian: Native
- English: B1
- Polish: B1
Available upon request
Actual version of this CV, you can always find here: https://iglov.github.io/cv or you can just download it as a file.