The pipeline collects data from multiple blockchains, processes them, then provides as data-inputs for reputation ranking system and warehouse to build an onchain analytics platform.
Support cloud native deployment (kubernetes) with autoscaler Fork from blockchain-etl
- evm indexer:
./indexer
- export file format:
- csv
- json new line
- parquet
- iceberg (depend on issue)
- export sink:
- s3
- local
- other filesystem follow fsspec (not tested, but highly compatible)
- clickhouse
- starrocks
- streaming sink:
- kafka
- kinesis
- postgres
- clickhouse
- trace specs:
- parity traces
- geth traces (only export)
- export file format:
- scheduler/monitor: airflow
- labeling engine: graphql
- graph processing
- graphframes (deprecated)
- graphscope
- resilient in memory distributed storage
- vineyard v6d
- algorithm:
- contract classification
- in degree
- pagerank
- fraud detection
- bot detection
- docker: build of docker image
- k8s: yaml resources for k8s components
- docs