-
Notifications
You must be signed in to change notification settings - Fork 15.9k
Pull requests: deepseek-ai/DeepSeek-V3
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add CI/CD workflows for multiple environments and languages
#886
opened Jun 1, 2025 by
nodoubtz
Loading…
Fix apply_chat_template for function calling (Issue #860)
#877
opened May 21, 2025 by
ritik4ever
Loading…
doc: Update deployment instructions using TensorRT-LLM in README.
#876
opened May 21, 2025 by
bobboli
Loading…
gh repo clone deepseek-ai/DeepSeek-V3Create devcontainer.jsonj
#867
opened May 10, 2025 by
xxxyalaxx90xxx
Loading…
Fix: safer and cleaner forward() in distributed embedding layer
#834
opened Apr 5, 2025 by
saro1993
Loading…
Fix: Add metadata to bf16 safetensors for compatibility with transformers
#749
opened Mar 6, 2025 by
tflsxyy
Loading…
Critical Improvements for Model Correctness, Efficiency, and Robustness
#717
opened Feb 25, 2025 by
abdurrahman482937
Loading…
Optimize Multi-head Latent Attention (MLA) with Fast Path for Short Sequences
#684
opened Feb 19, 2025 by
XxAlonexX
Loading…
7 tasks done
Fix incorrect comment in linear function regarding weight.element_size()
#662
opened Feb 14, 2025 by
iamvalenciia
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.