Skip to content

Conversation

Jintao-Huang
Copy link
Collaborator

  1. 不再使用gradient_checkpointing, 加快训练速度. (如何要使用3090训练, 还是需要开启)
  2. eval_steps从50 -> 100
  3. full的实验环境 -> A100
  4. fix chatglm2 gloo不支持bf16的问题
  5. 增加seqgpt, 增加template: text-generation, 增加dataset: cmnli
  6. 等...

@tastelikefeet tastelikefeet merged commit a467046 into modelscope:main Sep 8, 2023
tastelikefeet added a commit to tastelikefeet/swift that referenced this pull request Sep 8, 2023
…lace_lora

* commit 'f925f4297268bbc6a14a12157cbb23a06a225cfb':
  Add internlm (modelscope#59)
  Add bloom (modelscope#55)
  Add baichuan2 (modelscope#40)
  add feat: only save model (modelscope#49)
  Add openbuddy llama2 (modelscope#47)
  fix ddp bug (modelscope#45)
  fix template bug2 (modelscope#44)
tastelikefeet added a commit to tastelikefeet/swift that referenced this pull request Sep 8, 2023
* feat/replace_lora:
  Add internlm (modelscope#59)
  Add bloom (modelscope#55)
  Add baichuan2 (modelscope#40)
  add feat: only save model (modelscope#49)
  Add openbuddy llama2 (modelscope#47)
  fix ddp bug (modelscope#45)
  fix template bug2 (modelscope#44)

# Conflicts:
#	examples/pytorch/llm/src/llm_sft.py
#	examples/pytorch/llm/src/utils/preprocess.py
hjh0119 pushed a commit to hjh0119/swift that referenced this pull request Jul 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants