Head of Data
- Definition, planning and execution of the Generative AI strategy in the company. Since the last Q of 2023, we have started developing Generative AI solutions. Advanced technologies such as Azure OpenAI, Copilot Studio, Microsoft Teams, AI Search, App Services, Blob Storage, Streamlit, LangChain, LangSmith and Azure DevOps are used, marking a milestone in the company's technological history.
- The benefited areas range from: Human resources, logistics, IT, legal, environment, occupational health and safety to UM, maintenance, metallurgy.
- Training on Data, Analytics and GenAI topics for the company's collaborators.
Feb 2022 – Mar 2023 | Peru
Data Management Lead
- Provide training to C Level, Business Managers on the importance of Data Management, Data Governance and Data Quality, Data Literacy.
- Conduct a Data Management Maturity Assessment (DMMA).
- Establish Data Strategy
- Build Data Governance Program
- Evaluate Data Governance tools before implementation.
- Definition of scalable data architectures.
- Creation of formats (policies, standards) for the data governance program.
- Proofs of concept of data integration technologies, data transformation, data orchestation, data catalog, data integration, data quality, data observability.
Feb 2021 – Jan 2022 | USA / Mexico
Data Architect Senior (Globant)
- DataOps
- Data Platform, Data Mesh (DDD)
- Dataform (Process ELT)
- Technologies :
- Google Cloud : Big Query, Cloud Storage, Cloud Build, BigTable, Dataflow
- Azure : Azure DevOps (Repos, Pipelines, CICD)
- Open Source : Terraform, SonarQube, Dataform
Jul 2019 – Feb 2021 | Peru
Chapter Lead Big Data
- Implementation Serverless Data Lake Framework - SDLF (https://catalog.us-east-1.prod.workshops.aws/workshops/501cb14c-91b3-455c-a2a9-d0a21ce68114/en-US/20-production/100-multi-env)
- Pipeline DataOps and MLOps
- POCs tools Data & Analytics
- Technologies :
- AWS : S3, Elastic Kubernetes Services (EKS), Fargate, DynamoDB, Step Functions, Lambda, SageMaker, CodeCommit, Code Build, Code Deploy, Code Pipeline, CloudFormation
- Azure : Functions, Table Storage, Blob Storage, Speech to text
- Open Source : Spark, Python, Scala
Jan 2019 – Jun 2020 | USA
Senior Data Engineer
- Management of the Big Data ecosystem from a cluster on AWS (HortonWorks)
- Shell programming in linux.
- Technologies :
- Open Source : Spark, Sqoop, Kafka, Nifi, Hive, Hadoop, Hbase
- DevOps : Gitlab, Jenkins
- Other : Talent Data Studio, Birst, Zabbix, Grafana, Oracle, Shell
Sep 2018 – Jan 2019 | Peru
Senior Data Engineer
- Management of the Big Data ecosystem from a Hadoop cluster in EMR using Hive and Spark.
- Development of scripts in Python to access AWS services.
- Ingesting data from Redshift to Hive with Spark.
- Technologies :
- AWS : Lambda, SQS, EMR, DynamoDB, Step Functions, RedShift, S3
- DevOps : Github
- Open Source : Spark, Sqoop, Nifi, Hive, Hadoop, Hbase, Scala, Python