Merge branch 'main' into 'docs' #463

LZHgrla · 2024-03-11T11:57:14Z

No description provided.

* update * update cfgs * update * fix bugs * upload docs * rename * update * Revert "update cfgs" This reverts commit 93966aa. * update cfgs * update * rename * rename * fix bc * fix stop_word * fix * fix * Update prompt_template.md

* fix bugs * Update mmbench.py

set target_modules

* support deepseek moe * update docs * update * update

fix

Update pth_to_hf.py

* update examples * add examples * add json template config * rename * update * update * update

* add cfgs * add internlm2 template * add dispatch * add docs * update readme * update

* fix * Update utils.py

* accelerate cli * Update entry_point.py * Update entry_point.py --------- Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com>

* fix * update * Update README.md * Update README_zh-CN.md

update

* update * Update README.md * Update README.md * Update README.md * Update README_zh-CN.md * update * update * fix pre-commit * update

bump v0.1.12

Update version.py

* update results * update

Update internlm2 template

fix

update

* add new loop * rename * fix pre-commit * add max_keep_ckpts * fix * update cfgs * update examples * fix * update * update llava * update * update * update * update

* support petrelfs * fix deepspeed save/load/resume * add ENV to toggle petrelfs * support hf save_pretrained * patch deepspeed engine

fix

* support ddp mmbench evaluate * Update xtuner/tools/mmbench.py Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com> * Update xtuner/tools/mmbench.py Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com> * update minimum version of mmengine * Update runtime.txt --------- Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com>

fix

update

Update utils.py

* add local_attn_args_to_messagehub_hook * add internlm repo sampler * add internlm repo dataset and collate_fn * dispatch internlm1 and internlm2 local attn * add internlm2 config * add internlm1 and intenrlm2 config * add internlm2 template * fix replace_internlm1_rote bugs * add internlm1 and internlm2 config templates * change priority of EvaluateChatHook * fix docs * fix config * fix bug * set rotary_base according the latest internlm2 config * add llama local attn * add llama local attn * update intern_repo_dataset docs when using aliyun * support using both hf load_dataset and intern_repo packed_dataset * add configs * add opencompass doc * update opencompass doc * use T data order * use T data order * add config * add a tool to get data order * support offline processing untokenized dataset * add docs * add doc about only saving model weights * add doc about only saving model weights * dispatch mistral * add mistral template * add mistral template * fix torch_dtype * reset pre-commit-config * fix config * fix internlm_7b_full_intern_repo_dataset_template * update local_attn to varlen_attn * rename local_attn * fix InternlmRepoSampler and train.py to support resume * modify Packer to support varlen attn * support varlen attn in default pipeline * update mmengine version requirement to 0.10.3 * Update ceph.md * delete intern_repo_collate_fn * delete intern_repo_collate_fn * delete useless files * assert pack_to_max_length=True if use_varlen_attn=True * add varlen attn doc * add varlen attn to configs * delete useless codes * update * update * update configs * fix priority of ThroughputHook and flake8 ignore W504 * using map_fn to set length attr to dataset * support split=None in process_hf_dataset * add dataset_format_mapping * support preprocess ftdp and normal dataset * refactor process_hf_dataset * support pack dataset in process_untokenized_datasets * add xtuner_dataset_timeout * using gloo backend for monitored barrier * set gloo timeout * fix bugs * fix configs * refactor intern repo dataset docs * fix doc * fix lint --------- Co-authored-by: pppppM <67539920+pppppM@users.noreply.github.com> Co-authored-by: pppppM <gjf_mail@126.com>

…` and rename 'internlm_repo' to 'intern_repo' (InternLM#372) * fix * rename internlm_repo to intern_repo * add InternlmRepoSampler for preventing bc break * add how to install flash_attn to doc

…nLM#379) * delete useless codes * refactor process_untokenized_datasets: add ftdp to dataset-format * fix lint

…ernLM#381) support flash attn 2 in internlm1, internlm2 and llama

…nternLM#384) update

update

…#385) * support saving eval output before save checkpoint * refactor

* fix lr scheduler setting * fix more --------- Co-authored-by: zilong.guo <zilong.guo@zeron.ai> Co-authored-by: LZHgrla <linzhihao@pjlab.org.cn>

fix

* rename * update docs * update template * update * add cfgs * update * update

…ternLM#404) * [Fix] Fix no space in chat output using InternLM2. (InternLM#357) * Update chat.py * Update utils.py * Update utils.py * fix pre-commit --------- Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com> Co-authored-by: LZHgrla <linzhihao@pjlab.org.cn>

fix

update

…NEL environment variable (InternLM#411) * dispatch support transformers>=4.36 * add USE_TRITON_KERNEL environment variable * raise RuntimeError use triton kernels on cpu * fix lint

* [Feature]Add InternLM2-Chat-1_8b full config * [Feature]Add InternLM2-Chat-1_8b full config * update --------- Co-authored-by: LZHgrla <linzhihao@pjlab.org.cn> Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com>

fix

* added gemma config and template * check config and make sure the consistancy * Update xtuner/configs/gemma/gemma_2b_base/gemma_2b_base_qlora_alpaca_e3.py Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com> * Update xtuner/configs/gemma/gemma_2b_base/gemma_2b_base_full_alpaca_e3.py Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com> * Update xtuner/configs/gemma/gemma_7b_base/gemma_7b_base_full_alpaca_e3.py Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com> * Update xtuner/configs/gemma/gemma_7b_base/gemma_7b_base_qlora_alpaca_e3.py Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com> * Update xtuner/utils/templates.py Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com> * update * added required version * update * update --------- Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com> Co-authored-by: LZHgrla <linzhihao@pjlab.org.cn>

* add base dataset * update dataset generation * update refcoco * add convert refcooc * add eval_refcoco * add config * update dataset * fix bug * fix bug * update data prepare * fix error * refactor eval_refcoco * fix bug * fix error * update readme * add entry_point * update config * update config * update entry point * update * update doc * update --------- Co-authored-by: jacky <jacky@xx.com>

update

* Update version.py * Update version.py

fix bugs when using epochbasedrunner

…nternLM#410) * support smart_tokenizer_and_embedding_resize * replace ast with json.loads * support list_dataset_format cli * add doc about ftdp and custom dataset * add custom dataset template * add args name to process_hf_dataset * use new process_untokenized_datasets * support tokenize_ftdp_datasets * add mistral_7b_w_tokenized_dataset config * update doc * update doc * add comments * fix data save path * smart_tokenizer_and_embedding_resize support zero3 * fix lint * add data format to internlm2_7b_full_finetune_custom_dataset_e1.py * add a data format example to configs associated with finetuning custom dataset * add a data format example to configs associated with finetuning custom dataset * fix lint

修改了一个错别字

* split finetune_custom_dataset.md to 6 parts * refactor custom_dataset and ftdp_dataset related docs * fix comments

LZHgrla and others added 30 commits January 11, 2024 23:29

[Fix] Fix errors about stop_words (InternLM#313)

28c0556

* fix bugs * Update mmbench.py

[Fix] Fix Mixtral LoRA setting (InternLM#312)

8ab2762

set target_modules

[Feature] Support DeepSeek-MoE (InternLM#311)

ceeb9be

* support deepseek moe * update docs * update * update

[Fix] Set torch.optim.AdamW as the default optimizer (InternLM#318)

fa73895

fix

[FIx] Fix pth_to_hf for LLaVA model (InternLM#316)

ff69f4c

Update pth_to_hf.py

[Improve] Add demo_data examples (InternLM#278)

31e13d4

* update examples * add examples * add json template config * rename * update * update * update

[Feature] Support InternLM2 (InternLM#321)

4aea1e4

* add cfgs * add internlm2 template * add dispatch * add docs * update readme * update

[Fix] Fix the resume of seed (InternLM#309)

80a71bc

* fix * Update utils.py

[Feature] Accelerate xtuner xxx (InternLM#307)

6377c79

* accelerate cli * Update entry_point.py * Update entry_point.py --------- Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com>

[Fix] Fix InternLM2 url (InternLM#325)

9ddf308

* fix * update * Update README.md * Update README_zh-CN.md

[Fix] Limit the version of python, >=3.8, <3.11 (InternLM#327)

7a7d000

update

[Fix] Add trust_remote_code=True for AutoModel (InternLM#328)

89fb330

update

[Docs] Improve README (InternLM#326)

1a3b492

* update * Update README.md * Update README.md * Update README.md * Update README_zh-CN.md * update * update * fix pre-commit * update

bump verion to v0.1.12 (InternLM#323)

6c4c73b

bump v0.1.12

set dev version (InternLM#329)

2d90521

Update version.py

[Docs] Add LLaVA-InternLM2 results (InternLM#332)

f9dd540

* update results * update

Update internlm2_chat template (InternLM#339)

582404c

Update internlm2 template

[Fix] Fix examples demo_data configs (InternLM#334)

d2f7a59

fix

bump version to v0.1.13 (InternLM#340)

0633939

update

set dev version (InternLM#341)

60fabeb

update

[Feature] More flexible TrainLoop (InternLM#348)

b0f36f3

* add new loop * rename * fix pre-commit * add max_keep_ckpts * fix * update cfgs * update examples * fix * update * update llava * update * update * update * update

[Feature]Support CEPH (InternLM#266)

076375d

* support petrelfs * fix deepspeed save/load/resume * add ENV to toggle petrelfs * support hf save_pretrained * patch deepspeed engine

[Improve] Add --repetition-penalty for xtuner chat (InternLM#351)

f225761

fix

[Fix] KeyError of encode_fn (InternLM#361)

9f84746

fix

[Fix] Fix batch_size of full fine-tuing LLaVA-InternLM2 (InternLM#360)

5951cbc

fix

[Fix] Remove system for alpaca_map_fn (InternLM#363)

e53b55e

update

[Fix] Use DEFAULT_IMAGE_TOKEN instead of '<image>' (InternLM#353)

7f813e3

Update utils.py

HIT-cwh and others added 27 commits January 30, 2024 15:29

[Fix] Fix rotary_base, add colors_map_fn to `DATASET_FORMAT_MAPPING…

a37b344

…` and rename 'internlm_repo' to 'intern_repo' (InternLM#372) * fix * rename internlm_repo to intern_repo * add InternlmRepoSampler for preventing bc break * add how to install flash_attn to doc

update (InternLM#377)

8ed299b

Delete useless codes and refactor process_untokenized_datasets (Inter…

fde32e8

…nLM#379) * delete useless codes * refactor process_untokenized_datasets: add ftdp to dataset-format * fix lint

[Feature] support flash attn 2 in internlm1, internlm2 and llama (Int…

d489f09

…ernLM#381) support flash attn 2 in internlm1, internlm2 and llama

[Fix] Fix installation docs of mmengine in intern_repo_dataset.md (I…

0197953

…nternLM#384) update

[Fix] Update InternLM2 apply_rotary_pos_emb (InternLM#383)

58537c3

update

[Feature] support saving eval output before save checkpoint (InternLM…

47c08d8

…#385) * support saving eval output before save checkpoint * refactor

[Fix] lr scheduler setting (InternLM#394)

fd8522f

* fix lr scheduler setting * fix more --------- Co-authored-by: zilong.guo <zilong.guo@zeron.ai> Co-authored-by: LZHgrla <linzhihao@pjlab.org.cn>

[Fix] Remove pre-defined system of alpaca_zh_map_fn (InternLM#395)

f63859b

fix

[Feature] Support Qwen1.5 (InternLM#407)

9aecaf3

* rename * update docs * update template * update * add cfgs * update * update

[Fix] typo: --system-prompt to --system-template (InternLM#406)

1db633b

fix

[Improve] Add output_with_loss for dataset process (InternLM#408)

544b534

update

[Fix] Fix dispatch to support transformers>=4.36 & Add USE_TRITON_KER…

fad4cee

…NEL environment variable (InternLM#411) * dispatch support transformers>=4.36 * add USE_TRITON_KERNEL environment variable * raise RuntimeError use triton kernels on cpu * fix lint

[Feature]Add InternLM2-1_8b configs (InternLM#396)

4cbaf54

* [Feature]Add InternLM2-Chat-1_8b full config * [Feature]Add InternLM2-Chat-1_8b full config * update --------- Co-authored-by: LZHgrla <linzhihao@pjlab.org.cn> Co-authored-by: Zhihao Lin <36994684+LZHgrla@users.noreply.github.com>

[Fix] Fix extract_json_objects (InternLM#419)

a515bb8

[Fix] Fix pth_to_hf error (InternLM#426)

f8e9dc1

fix

[Fix] Inconsistent BatchSize of LengthGroupedSampler (InternLM#436)

be75353

update

bump version to v0.1.14 (InternLM#431)

e6fcce1

update

set dev version (InternLM#437)

1136707

* Update version.py * Update version.py

[Bugs] Fix bugs when using EpochBasedRunner (InternLM#439)

bba639e

fix bugs when using epochbasedrunner

Update prompt_template.md (InternLM#441)

56dbdd7

修改了一个错别字

[Doc] Split finetune_custom_dataset.md to 6 parts (InternLM#445)

770bac3

* split finetune_custom_dataset.md to 6 parts * refactor custom_dataset and ftdp_dataset related docs * fix comments

Merge branch 'main' into docs

0c3a08f

LZHgrla changed the base branch from main to docs March 11, 2024 11:57

fix pre-commit

f8efd94

LZHgrla merged commit c360cb8 into InternLM:docs Mar 11, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge branch 'main' into 'docs' #463

Merge branch 'main' into 'docs' #463

LZHgrla commented Mar 11, 2024

Merge branch 'main' into 'docs' #463

Merge branch 'main' into 'docs' #463

Conversation

LZHgrla commented Mar 11, 2024