Skip to content

Releases: InternLM/InternLM

InternLM-v0.2.1dev20240102

02 Jan 05:05
5539f9d
Compare
Choose a tag to compare

What's Changed

  • fix(timeout): larger timeout by @JiaoPL in #495
  • feat(doc): add GPU memory info for 7B & 20B models by @li126com in #507
  • feat(model): add rope_base interface by @00INDEX in #512
  • Feat(QA): Check loss when swapping micro_num and micro_bsz && Check grad norm by @li126com in #510
  • Fix(QA): the py name in main is wrong by @li126com in #514
  • fix/feat: small fix and enhancement by @SolenoidWGT in #515
  • test(workflow): add workflow for loss test and change trigger event by @kkscilife in #513
  • fix(ci): fix test model ckpt ci test by @SolenoidWGT in #518
  • test(workflow): add unit test case by @kkscilife in #524
  • feat(storage): use multipart upload when using oss by @li126com in #520
  • Fix (QA checkpoint): fix test_model_checkpoint singleton import by @li126com in #526
  • fix(model): add IS_SEQUENCE_PARALLEL check for norm module by @yingtongxiong in #528
  • feat(model): add output embedding tf32 option by @JiaoPL in #523
  • feat(grad_norm): vocab grad norm profiling by @JiaoPL in #519
  • fix(data): fix the unpack for type_ids when use_flash_attn=False by @yingtongxiong in #516
  • fix(storage): unify the name of AK and SK by @li126com in #527
  • fix(test): fix type_ids unpack bug by @SolenoidWGT in #530
  • feat(model): support llama model with checkpoint loading by @li126com in #532
  • fix(metric): add metric dtype control by @Pryest in #533
  • feat(ckpt): support auto resume in Volc and Ali by @li126com in #529
  • fix(sequence_parallel): fix norm all-reduce in seq_parallel when not overlaping by @yingtongxiong in #534
  • fix(pp): fix no-packed dataset load micro batch error by @SolenoidWGT in #538
  • fix(model): change model_type LLAMA to LLAMA2 by @li126com in #539
  • fix(moe): fix moe zero mode bug by @blankde in #548
  • fix(grad_norm): token grad norm with tp by @JiaoPL in #547
  • test(workflow): change into reserved by @kkscilife in #550
  • fix(model): add ckpt_type constraint when loading ckpts by @li126com in #542
  • feat(logger): add tensorboard key value buffer by @SolenoidWGT in #549
  • fix(metrics): remove redundant cuda memory in metric calculations by @SolenoidWGT in #557
  • fix(lr_scheduler): fix when resuming lr_scheduler without loading optimizer by @gaoyang07 in #565

Full Changelog: v0.2.1dev20231121...v0.2.1dev20240102

InternLM-v0.2.1dev20230922

21 Nov 09:00
eba2b85
Compare
Choose a tag to compare
Pre-release

TBD

InternLM-v0.2.1dev20230915

15 Sep 11:31
2710fa7
Compare
Choose a tag to compare

Highlights

  • fix the bug that may have grad overflow when total_steps is small
  • fix the rotary_emb.inv_freq KeyError in tool convert2hf.py
  • add unit test for model

What's Changed

🚀 Features

  • feat(core/trainer.py): add more tgs metrics by @li126com in #310

🐞 Bug fixes

  • fix(convert2hf.py): fix the rotary_emb.inv_freq KeyError by @jiangtann in #299
  • fix(configs/7B_sft.py): model dtype float16 to bfloat16 by @huangting4201 in #302
  • fix(chat): fix stream_chat to return generator by @zhjunqin in #123

📚 Documentations

✅ Tests

  • tests(tests/test_model): add unit test for model by @li126com in #300
  • tests(tests/test_solver): add unit test for optimizer by @li126com in #303

🌐 Other

Known issues

Full Changelog: v0.2.1dev20230909...v0.2.1dev20230915

InternLM-v0.2.1dev20230909

08 Sep 12:45
Compare
Choose a tag to compare

What's Changed

  • fix(ckpt): fix snapshot none load error and remove file lock by @SolenoidWGT in #298

Full Changelog: v0.2.1dev20230908...v0.2.1dev20230909

InternLM-v0.2.1dev20230908

08 Sep 10:12
Compare
Choose a tag to compare

Highlights

  • fix the bug that may have NaN value when overlap gradients' allreduce with backward
  • support timeout wrapper and runtime diagnosis
  • support readthedocs Chinese version

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

  • fix(doc/code-docs): autodoc shown error by @huangting4201 in #265
  • fix(eval): no need to check length of valid_dl when using streaming dataset by @00INDEX in #274
  • fix/broadcast should not in commu stream by @sunpengsdu in #276
  • fix(model): set tensor parallel attribute for mlp by @yingtongxiong in #271
  • feat(ckpt): checkpoint bug fixes and feature enhancements. by @SolenoidWGT in #259
  • fix(ckpt): fix checkpoint reload bug by @SolenoidWGT in #282
  • fix(core/context): use dummy mode to generate random numbers in model construction by @blankde in #266
  • fix(monitor): add alert switch and refactor monitor config by @JiaoPL in #285
  • fix: fix the bug to do bcast in a stream by @sunpengsdu in #294

📚 Documentations

🌐 Other

Known issues

New Contributors

Full Changelog: v0.2.1dev20230901...v0.2.1dev20230908

InternLM-v0.2.1dev20230901

01 Sep 09:27
Compare
Choose a tag to compare

Highlights

  • Support centos and ubuntu dockerfile
  • Support runtime gpu flops and nccl allreduce speed test

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

  • Fix StreamingDataset does not have an len method by @00INDEX in #251
  • Fix argument missing in getting loss metrics by @MagicDevilZhang in #256
  • Fix the error that RotaryEmbedding is converted to a non-fp32 format during operation by @YWMditto in #239

📚 Documentations

🌐 Other

Known issues

InternLM-v0.2.0

24 Aug 14:07
54f85a6
Compare
Choose a tag to compare

Features:

  1. Support pipeline parallel, including interleaved and non-interleaved pipeline scheduler.
  2. Support sequence parallel.
  3. Support model evaluation.
  4. Support tf32 with flash-attention.
  5. Support tensorboard writer for recording training performance metrics.
  6. Support customed uniscale logger.
  7. Support calculating model's accuracy and perplexity metrics.
  8. Support oss storage and checkpoint asynchronous uploading.
  9. Support automatically loading the latest checkpoint.
  10. Support checkpoint snapshot.
  11. Support monitoring the status of training jobs, and alarm abnormal status.
  12. Support torch profiler.
  13. Support simple memory profiler.

Optimizations:

  1. Overlapping optimizer parameters broadcast with model forward.
  2. Overlapping optimizer last bucket gradients allreduce with compute norm.

InternLM-v0.1.0

24 Aug 05:45
e1cefae
Compare
Choose a tag to compare
fix huggingface link (#219)