-
Notifications
You must be signed in to change notification settings - Fork 14.9k
Insights: deepseek-ai/DeepSeek-V3
Overview
-
- 0 Merged pull requests
- 5 Open pull requests
- 6 Closed issues
- 11 New issues
There hasn’t been any commit activity on deepseek-ai/DeepSeek-V3 in the last week.
Want to help out?
5 Pull requests opened by 5 people
-
Add zh version of README
#747 opened
Mar 5, 2025 -
Fix: Add metadata to bf16 safetensors for compatibility with transformers
#749 opened
Mar 6, 2025 -
NoneType check
#751 opened
Mar 6, 2025 -
Update fp8_cast_bf16.py
#753 opened
Mar 8, 2025 -
deepseek ai chatbot
#762 opened
Mar 11, 2025
6 Issues closed by 3 people
-
Feature Request: Profile Customization and Integration with Google/Microsoft Accounts
#351 closed
Mar 12, 2025 -
could u provide some fp8 sft demo scripts?
#306 closed
Mar 9, 2025 -
可否给几个纯RL训练的数据示例?
#744 closed
Mar 7, 2025 -
调用api测试模型时,参数n只能设置为1
#291 closed
Mar 6, 2025 -
An issue about pretraining deepseek v3
#293 closed
Mar 6, 2025 -
官方API 如何开启联网搜索、上传图片解析、上传附件解析?有没有真人在线客服、技术支持群?
#745 closed
Mar 5, 2025
11 Issues opened by 11 people
-
[Question] How to compare the similarity of two PPT contents based on the V3 model
#763 opened
Mar 11, 2025 -
负载均衡是怎么更新bias偏置量的?
#761 opened
Mar 11, 2025 -
[BUG]为什么v3的function calling很慢?
#760 opened
Mar 11, 2025 -
[BUG]
#759 opened
Mar 10, 2025 -
关于deepseek官网定制AI的黑箱模式
#758 opened
Mar 10, 2025 -
[Features request] Generating "Explain Top Topics" Tokens for Enhanced User Engagement
#757 opened
Mar 10, 2025 -
[BUG]
#755 opened
Mar 9, 2025 -
[BUG]
#752 opened
Mar 6, 2025 -
[Question] Questions about MMA FP8 accumulator precision
#750 opened
Mar 6, 2025 -
[BUG]当对话长度足够久时,AI开始显得不再活灵活现,甚至感觉有些像机器人
#748 opened
Mar 6, 2025 -
sharing my study notes about DeepSeekV3 分享一下我的学习笔记
#746 opened
Mar 5, 2025
30 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
关于DualPipe的问题,看论文查阅资料后仍不清楚,请贵司回答下
#734 commented on
Mar 5, 2025 • 0 new comments -
route_scale是怎么得出的?代码里有这个超参数但是论文中没有提到?
#235 commented on
Mar 6, 2025 • 0 new comments -
[BUG] Accessibility Improvement for Screen Reader Users in DeepSeek v3 Chat Feature**
#233 commented on
Mar 6, 2025 • 0 new comments -
here windows installation
#173 commented on
Mar 6, 2025 • 0 new comments -
v3 repetitive function call ?
#15 commented on
Mar 6, 2025 • 0 new comments -
Issue: Incorrect comment in `linear` function
#579 commented on
Mar 8, 2025 • 0 new comments -
Question about SMs partitions?
#574 commented on
Mar 8, 2025 • 0 new comments -
[BUG] torchrun subprocess received Signal 8 (SIGFPE)
#547 commented on
Mar 8, 2025 • 0 new comments -
Unable to register, policy risk control?
#365 commented on
Mar 8, 2025 • 0 new comments -
[BUG]json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0).
#721 commented on
Mar 8, 2025 • 0 new comments -
[BUG] Login or signup on Arc Browser loading forever
#350 commented on
Mar 8, 2025 • 0 new comments -
generator_model = AutoModelForCausalLM.from_pretrained('deepseek-ai/DeepSeek-R1', trust_remote_code=True) throws error in RAG model/产生错误
#335 commented on
Mar 9, 2025 • 0 new comments -
Fix Critical Bug in Right-to-Left Language Support and Add Persian, Arabic, and Hebrew Languages
#326 commented on
Mar 9, 2025 • 0 new comments -
[BUG] No option to login in via google
#322 commented on
Mar 9, 2025 • 0 new comments -
Cline插件调用卡在Api request
#265 commented on
Mar 9, 2025 • 0 new comments -
[BUG] Accessibility Enhancement: Screen Reader Support for Toggle Buttons in DeepSeek Chat
#246 commented on
Mar 9, 2025 • 0 new comments -
Create an AI that can write a WORKING SIMPLE Python APP.
#743 commented on
Mar 9, 2025 • 0 new comments -
网页版深度思考默认语言需要默认中文
#221 commented on
Mar 10, 2025 • 0 new comments -
Converted bf16 Model on Hugging Face
#4 commented on
Mar 10, 2025 • 0 new comments -
Delete Meaningless issues #2
#707 commented on
Mar 10, 2025 • 0 new comments -
Numerical Stability in Scaling Factor Computation (s = tl.max(tl.abs(x)) / 448.)
#619 commented on
Mar 11, 2025 • 0 new comments -
[Paper BUG] Conflict between Figure 3, formula 21 and formula 22
#592 commented on
Mar 11, 2025 • 0 new comments -
[BUG]safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge
#557 commented on
Mar 11, 2025 • 0 new comments -
关于deepseek中Multi Head Latent Attention 中的一些问题
#658 commented on
Mar 11, 2025 • 0 new comments -
[BUG] (Due to technical issues, the search service is temporarily unavailable.)
#711 commented on
Mar 11, 2025 • 0 new comments -
Language capabilities
#247 commented on
Mar 12, 2025 • 0 new comments -
Update kernel.py
#612 commented on
Mar 11, 2025 • 0 new comments -
Add me as contributor of Amharic and Oromo Language Translator
#701 commented on
Mar 5, 2025 • 0 new comments -
add intro file
#729 commented on
Mar 11, 2025 • 0 new comments -
Docs: add LightLLM as supported engine
#736 commented on
Mar 6, 2025 • 0 new comments