Skip to content

Conversation

@andthattoo
Copy link
Member

@andthattoo andthattoo commented Oct 19, 2025

Migration from initial repo.

Workflow:
Run shards + API via CLI.

  • Discovery as subprocess for memory bloat prevention during profile
  • Add Streaming gRPC
  • Move Embedding + LM head to start|end shards
  • Add Shard config class
  • Use compression/ folder for compression
  • Bump MLX to 0.29.2
  • Submodule updates
  • Cleaner logs
  • Fixed serialization
  • Repacking for offloaded layers

…_step_for_ring_with_grpc

- added streaming support
receive-activation migration fixed
compression missing func added
asnycio.Queue is now placed
shard instance name standardized, removed mDNS suffix
@andthattoo andthattoo merged commit 41a82b3 into master Oct 19, 2025
Yuvrajxms09 pushed a commit to Yuvrajxms09/dnet that referenced this pull request Dec 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants