Skip to content
View lrdsouza's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Cora
  • Manaus, Amazonas, Brazil

Highlights

  • Pro

Block or report lrdsouza

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lrdsouza/README.md

Leonardo Rodrigues de Souza

Senior AI Engineer | Machine Learning Architect

LinkedIn Email Location


Typing SVG

> whoami

AI/ML Engineer building mission-critical GenAI systems in regulated fintech environments.

  • πŸ›‘οΈ Designing LLM security middleware β€” adversarial defense, PII anonymization, prompt lifecycle auditing
  • πŸ“‰ Cut 85% inference costs by replacing managed services with in-house distilled models
  • πŸš€ Built RTGen (INPI-registered software) β€” GenAI report generation 39x faster than human baseline
  • πŸŽ“ M.Sc. Computer Science + B.Sc. Statistics β€” Federal University of Amazonas

> featured-work

πŸ›‘οΈ GenAI Security Middleware

Banco Cora β€” Fintech

End-to-end security layer for LLM interactions in a regulated banking environment.

Python FastAPI LangChain Redis

Red Teaming NER/PII Prompt Injection Defense

95% attack detection Β |Β  96% PII accuracy Β |Β  14 sensitive data types

πŸ“„ RTGen β€” AI Report Generator

Registered Software (INPI)

GenAI-powered technical report generation system using RAG architecture.

Python LlamaIndex Docker GCP

RAG Document Generation Automation

39x faster than human Β |Β  15% OpEx savings

πŸ₯‡ Toxic Language Detection

1st Place β€” Brazilian ML Olympiad 2024

Optimized BERT model that outperformed commercial LLM-based solutions for toxic language detection in PT-BR.

PyTorch HuggingFace

BERT Fine-tuning NLP LoRA

1st place Β |Β  Cost-efficient Β |Β  Beat commercial LLMs

πŸ“‰ NLP FinOps Pipeline

Instituto de Pesquisas Eldorado

Re-engineered NLP inference pipelines replacing expensive managed services with distilled in-house models.

Kubernetes MLflow AWS

Model Distillation MLOps CI/CD

85% cost reduction Β |Β  92%+ accuracy maintained


> tech-stack

Core Engineering

Python FastAPI PostgreSQL MySQL Redis Pydantic

GenAI & NLP Security

LangChain LangGraph LlamaIndex vLLM Pinecone FAISS

Models: Gemini GPT LLaMA Claude BERT GLiNER Grok Techniques: Fine-tuning (LoRA/QLoRA) RAG Red Teaming Purple Teaming LLM-as-a-Judge

Machine Learning & Data Science

PyTorch scikit-learn Pandas NumPy

Methods: Bayesian Inference A/B Testing SHAP Classification Regression Hybrid Filtering

Infrastructure & MLOps

Docker Kubernetes GitHub Actions ArgoCD GCP AWS Azure HuggingFace

Observability

Prometheus Grafana MLflow LangSmith


> experience

🏦 Banco Cora β€” Fintech

AI Engineer & Solutions Architect 2025 - Present

Security middleware for Generative AI in a highly regulated financial environment

  • Designed async processing core for message interception with LLM provider decoupling
  • Led Red Teaming strategies and fine-tuning for adversarial attack detection (Jailbreaks/Prompt Injection)
  • Built proprietary NER models for real-time anonymization of 14 sensitive data types (LGPD compliance)
  • Established Tracing/Logging standards for full prompt lifecycle auditability

πŸ“‘ Instituto de Pesquisas Eldorado β€” Telecom & Innovation

Senior Software Analyst 2021 - 2025

AI solutions for global tech players focused on ROI maximization

  • Re-engineered NLP pipelines: 85% cost reduction replacing managed services with distilled models (92%+ accuracy)
  • Created RTGen (INPI-registered): GenAI report system 39x faster than human, 15% OpEx savings
  • Optimized recommendation algorithms (Hybrid Filtering): 3% CTR uplift on end-user engagement
  • Structured scalable ETL/ELT pipelines and CI/CD for ML with scientific reproducibility

> achievements

Achievement Year
πŸ₯‡ 1st Place β€” Brazilian Machine Learning Olympiad β€” Toxic language detection, outperforming commercial LLMs with optimized BERT 2024
πŸ† EldFlash Award β€” E2E architecture with LLMs for strategic risk metric 2025
🌟 Eldorado Excellence Award β€” LLM automation architecture with high ROI 2024
πŸ“„ Registered Software (INPI) β€” Author of RTGen, AI-powered report generation 2024
⚑ EldFlash Award β€” Tracking and resolution of critical pipeline failure 2022
πŸŽ“ M.Sc. Computer Science β€” Federal University of Amazonas 2021
πŸ“Š B.Sc. Statistics β€” Federal University of Amazonas 2019

Popular repositories Loading

  1. denv_genomes_severity denv_genomes_severity Public

    Database containing .csv files with samples of dengue protein sequences labeled with the severity degree of infection in human hosts.

    1

  2. lrdsouza lrdsouza Public

    Profile README