CacheRoute-v0.1.3
The basic CacheRoute architecture.
Finish the KDN_server.
Finish the interface between the scheduler and KDN servers. During its lifespan, the scheduler initiates the control plane[7002] to listen for registration requests from KDN. The scheduler supports dynamic maintenance of knowledge within the KDN pool.
Finish the interface between the scheduler and the proxy. The scheduler can maintain a dynamic proxy pool in the control plane. Improve scheduler output, strategy, and resource maintenance.
Finish integrating the scheduler with the KDN and proxy selection strategies, supporting simple round-robin scheduling.
Finish the interface between the proxy and the instance. The proxy can maintain a dynamic instance pool in the control plane[8002].
Build the prepare-ready parallel task queue for proxy, prepare queue for knowledge injection. The ready queue forward task to instances.
The knowledge-oriented routing of the scheduler strategy TBD.
The scheduler maintains the status of tasks TBD.
Proxy parallel queue strategy TBD.
Scheduler/proxy resource updater TBD.
The KDN UI for easy use, TBD.
The instance resource collector above vLLM TBD.