Pulling changes from BYK WIP to RAG-17 #11
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request introduces a comprehensive centralized logging and monitoring stack for the project, leveraging Grafana, Loki, and supporting services. The changes include new configuration files, Docker Compose service definitions, and documentation to enable real-time log aggregation, visualization, and alerting for deployment and model orchestration processes. The stack is designed to be easily integrated with Python services and supports advanced log filtering and monitoring via Grafana dashboards.
Logging and Monitoring Stack Setup
docker-compose.ymlwith service definitions for Loki, Grafana, Qdrant, Langfuse (worker/web), Clickhouse, Minio, Redis, and Postgres, including health checks, persistent volumes, and network configuration for seamless orchestration..envfile with environment variables for all services, including credentials, endpoints, and configuration options for S3, Redis, Clickhouse, Grafana, and Langfuse.Grafana and Loki Configuration
grafana-configs/loki-config.yamlfor Loki server setup (filesystem storage, schema, cache, and analytics opt-out), andgrafana-configs/grafana-datasources.yamlto auto-provision Loki as the default Grafana datasource. [1] [2]grafana-configs/grafana-dashboards.yamlto organize dashboards under the "RAG Module" folder for easier navigation.Monitoring Dashboard
grafana-configs/grafana-dashboard-deployment.jsonwith a ready-to-use Grafana dashboard for the RAG Module Orchestrator, featuring log count graphs by level/service and a real-time log viewer with filtering by service and log level.Documentation
grafana-configs/README.mdexplaining the architecture, setup, usage, integration instructions for Python services, troubleshooting tips, and best practices for log monitoring and retention.