Skip to content

Refatorar pipeline de métricas: services, counter, opensearch#111

Merged
pitangainnovare merged 43 commits intomainfrom
v2.0.0-pr4
May 2, 2026
Merged

Refatorar pipeline de métricas: services, counter, opensearch#111
pitangainnovare merged 43 commits intomainfrom
v2.0.0-pr4

Conversation

@pitangainnovare
Copy link
Copy Markdown
Contributor

O que esse PR faz?

Reestrutura completamente o pipeline de métricas em 3 novos pacotes:

  • metrics/services/ — pipeline de parse/export: jobs.py (ciclo de vida DailyMetricJob), resources.py (fetch robots/mmdb), parser.py (parse + heartbeat), export.py (OpenSearch), daily_payloads.py (I/O em disco)
  • metrics/opensearch/ — cliente OpenSearch: client.py, mappings.py (4 esquemas de índice), scripts.py (Painless upsert), names.py (index naming)
  • metrics/counter/ — lógica COUNTER R5.1: access.py (extração/validação), aggregation.py (unique metrics), documents.py (build de documentos OpenSearch), identifiers.py, parser.py
  • metrics/tasks/ — tasks Celery divididas: parse.py, process.py, resume.py, index.py, cleanup.py

Remove metrics/es.py, metrics/utils/ (indexing, parser_utils, file_utils). Corrige imports e mock paths nos testes.

Onde a revisão poderia começar?

  1. metrics/counter/access.py — extração de dados de acesso COUNTER R5.1
  2. metrics/opensearch/client.py — OpenSearchUsageClient
  3. metrics/services/parser.py — orquestração do pipeline

Screenshots

N/A

Quais são tickets relevantes?

N/A

Referências

N/A

@pitangainnovare pitangainnovare force-pushed the v2.0.0-pr4 branch 2 times, most recently from ab15601 to 8429109 Compare May 2, 2026 01:52
@pitangainnovare pitangainnovare self-assigned this May 2, 2026
@pitangainnovare pitangainnovare merged commit c048ebf into main May 2, 2026
1 of 2 checks passed
@pitangainnovare pitangainnovare deleted the v2.0.0-pr4 branch May 4, 2026 19:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Refatoração para versão 2.0 Otimizar uso do banco de dados para processar logs de coleções maiores

1 participant