Skip to content

Releases: modelscope/modelscope

v1.14.0

27 Apr 12:59
Compare
Choose a tag to compare

New models

No. Model-id and links
1 Qwen1.5-110B-Chat
2 CodeQwen1.5-7B-Chat
3 WizardLM-2-8x22B
4 c4ai-command-r-v01
5 通义千问1.5-32B-对话
6 dbrx-instruct

Highlight features

  1. Dataset refactoring, to be compatible with HF datasets structure, new to limit datasets<2.19.0 (Breaking Changes)

What's Changed

New Contributors

Full Changelog: v1.13.2...v1.14.0

v1.13.2

22 Mar 09:57
Compare
Choose a tag to compare

Highlight features

  1. Dataset refactoring, to be compatible with HF datasets structure. (Breaking Changes)
  2. Unified datasets storage and management with GIT. (Breaking Changes)

What's Changed

New Contributors

Full Changelog: v1.13.1...v1.13.2

v1.13.1

18 Mar 12:02
Compare
Choose a tag to compare

New models

No. Model-id and links
1 GeoMVSNet:基于几何感知的多视图深度估计
2 Res2Net说话人确认-中文-3D-Speaker-16k
3 ResNet34说话人确认-中文-3D-Speaker-16k
4 自监督深度补全

Highlight features

  1. Support importing AWQConfig from modelscope
  2. Support stream_generate for LLMPipeline

What's Changed

New Contributors

Full Changelog: v1.12.0...v1.13.1

v1.12.0 release

06 Feb 02:35
Compare
Choose a tag to compare

中文版本

新模型推荐

序号 模型名称&快捷链接
1 支持qwen1.5系列模型
2 RIFE视频插帧
3 VFI-RAFT视频插帧
4 轻量级快速图像特征点匹配

高亮功能

  • add rife-video-frame-interpolation and model (#685)
  • image normal estimation (#683)
  • add image matching fast model based on lightglue (#694)
  • Feature/LoFTR_image_local_feature_matching (#687)
  • support qwen1.5 models
  • upgrade funasr1.0

BugFix

  • fix anydoor pre-commit flake8 and isort errors (#707)
  • fix some ci case issue.

v1.11.0 release

22 Jan 08:10
Compare
Choose a tag to compare

New Models Recommended

No Model Name & Link
0 Emu2-Gen
1 qanything_models
2 Emu2-Chat
3 Emu2
4 TinyLlama-1.1B-Chat-v1.0
5 notux-8x7b-v1
6 Machine_Mindset_en_ENFJ
7 Machine_Mindset_en_ENFP
8 Machine_Mindset_en_ENTJ
9 Machine_Mindset_en_ENTP
10 Machine_Mindset_en_ESFJ
11 Machine_Mindset_en_ESFP
12 Machine_Mindset_en_ESTJ
13 Machine_Mindset_en_ESTP
14 Machine_Mindset_en_INFJ
15 Machine_Mindset_en_INFP
16 Machine_Mindset_en_INTJ
17 Machine_Mindset_en_INTP
18 Machine_Mindset_en_ISFJ
19 Machine_Mindset_en_ISFP
20 Machine_Mindset_en_ISTJ
21 Machine_Mindset_en_ISTP
22 Machine_Mindset_zh_ENFJ
23 Machine_Mindset_zh_ENFP
24 Machine_Mindset_zh_ENTJ
25 Machine_Mindset_zh_ENTP
26 Machine_Mindset_zh_ESFJ
27 Machine_Mindset_zh_ESFP
28 Machine_Mindset_zh_ESTJ
29 Machine_Mindset_zh_ESTP
30 Machine_Mindset_zh_INFJ
31 Machine_Mindset_zh_INFP
32 Machine_Mindset_zh_INTJ
33 Machine_Mindset_zh_ISFJ
34 Machine_Mindset_zh_ISFP
35 Machine_Mindset_zh_ISTJ
36 Machine_Mindset_zh_ISTP
37 WavMark
38 speech_eres2net_large_200k_sv_zh-cn_16k-common
39 emotion2vec_base
30 QAnything
41 speech_fsmn_vad_zh-cn-8k-common-onnx
42 speech_paraformer_asr_nat-zh-cn-8k-common-vocab8358-tensorflow1-onnx
43 dolphin-2.6-mistral-7b
44 scepter_scedit
45 deepseek-moe-16b-base
46 deepseek-moe-16b-chat
47 phi-2
48 llava-internlm-7b
49 llava-v1.5-7b-xtuner
50 llava-v1.5-7b-xtuner-pretrain
51 llava-internlm-7b-pretrain
52 speech_ngram_lm_zh-cn-ai-wesp-fst-token8358
53 realisticVisionV51_v51VAE
54 THUDM_chatglm-6b
55 AnyDoor
56 cv_gaussian-splatting-recon_damo
57 AnimateDiff_ms
58 cv_omnidata_image-normal-estimation_normal
59 cv_rife_video-frame-interpolation
60 Qwen-7B-Chat-GGUF
61 stable-zero123
62 cv_adabins_image-depth-prediction_indoor
63 Qwen-14B-Chat-GGUF
64 wav2vec2-large-xlsr-53-english
65 speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch
66 cv_resnet-transformer_local-feature-matching_outdoor-data
67 naturalspeech2_libritts
68 text_to_audio
69 valle_libritts
70 vits_ljspeech
71 hifigan_speech_bigdata
72 BigVGAN_singing_bigdata
73 singing_voice_conversion
74 Machine_Mindset_zh_INTP
75 bel_canto
76 music_genre
77 chest_falsetto
78 llava-v1.5-13b-xtuner
79 llava-v1.5-13b-xtuner-pretrain
80 Mistral-7B-Instruct-v0.2-GGUF
81 cv_transformer_image-matching_fast
82 cv_efficientsam-s_image-instance-segmentation_sa1b
83 Ziya-Visual-Lyrics-14B
84 dpo-sdxl-text2image-v1
85 mistral-ft-optimized-1218
86 IP-Adapter-FaceID
87 [mask_refine](https://modelscope.cn/models/dam...
Read more

v1.10.0 release

08 Dec 15:19
Compare
Choose a tag to compare

中文版本

新模型推荐

序号 模型名称&快捷链接
0 Yi-34B-Chat-4bits
1 Yi-34B-Chat-8bits
2 Yi-6B-Chat
3 Yi-6B-Chat-4bits
4 Yi-6B-Chat-8bits
5 Yi-34B-Chat
6 Video-LLaVA-V1.5
7 Video-LLaVA-7B
8 LanguageBind_Video
9 LanguageBind_Video_FT
10 LanguageBind_Image
11 LanguageBind_Video_merge
12 LanguageBind_Audio
13 LanguageBind_Depth
14 LanguageBind_Thermal
15 Video-LLaVA-Pretrain-7B
16 Aquila2-70B-Expr
17 AquilaChat2-70B-Expr
18 AquilaChat2-34B-Int4-GPTQ
19 AquilaChat2-34B-16K
20 speech_rwkv_transducer_asr-en-16k-gigaspeech-vocab5001-pytorch-online
21 funasr-runtime-win-cpu-x64
22 ModelScope-Agent-14B
23 speech_sambert-hifigan_nsf_tts_donna_en-us_24k
24 speech_sambert-hifigan_nsf_tts_david_en-us_24k
25 MiniGPT-v2
26 speech_sambert-hifigan_tts_waan_Thai_16k
27 cv_background_generation_sd
28 speech_eres2net_base_250k_sv_zh-cn_16k-common
29 Pose-driven-image-generation-HumanSD
30 cv_stable-diffusion-v2_image-feature
31 nlp_minilm_ibkd_sentence-embedding_english-sts
32 nlp_minilm_ibkd_sentence-embedding_english-msmarco
33 speech_eres2net_large_mej_lre_16k_common
34 speech_eres2net_base_mej_lre_16k_common
35 Ziya2-13B-Base
36 SUS-Chat-34B
37 animatediff-motion-adapter-v1-5
38 animatediff-motion-adapter-v1-4
39 animatediff-motion-adapter-v1-5-2
40 animatediff-motion-lora-zoom-in
41 animatediff-motion-lora-pan-left
42 animatediff-motion-lora-tilt-up
43 animatediff-motion-lora-rolling-clockwise
44 animatediff-motion-lora-zoom-out
45 animatediff-motion-lora-pan-right
46 animatediff-motion-lora-tilt-down
47 animatediff-motion-lora-rolling-anticlockwise
48 Qwen-1_8B-Chat-Int4
49 Qwen-1_8B-Chat-Int8
50 Qwen-72B-Chat-Int4
51 Qwen-72B-Chat-Int8
52 Qwen-Audio-Chat
53 Qwen-72B-Chat
54 Qwen-72B
55 Qwen-1_8B
56 Qwen-1_8B-Chat
57 Qwen-Audio
58 cogvlm-chat
59 cogvlm-base-224
60 cogvlm-base-490
61 cogvlm-grounding-base
62 cogvlm-grounding-generalist
63 deepseek-llm-7b-base
64 deepseek-llm-67b-base
65 deepseek-llm-7b-chat
66 deepseek-llm-67b-chat
67 xlm-MLM-en-2048
68 OrionStar-Yi-34B-Chat
69 tigerbot-180b-base-v2
70 tigerbot-13b-chat-v5
71 tigerbot-13b-chat-v5-4bit-exl2
72 tigerbot-70b-chat-v4-4bit-exl2
73 tigerbot-180b-chat-v2
74 tigerbot-13b-chat-v5-4k
75 tigerbot-13b-base-v3
76 tigerbot-70b-chat-v4-4k
77 tigerbot-70b-base-v2
78 tigerbot-70b-chat-v4
79 BlueLM-7B-Base
80 BlueLM-7B-Chat-32K
81 BlueLM-7B-Chat-4bits
82 Sunsimiao-Qwen-7B
83 MindChat-Qwen-7B-v2-self_lora
84 jina-embeddings-v2-base-en
85 jina-embeddings-v2-small-en
86 qwen-chat-7B-ggml
87 qwen-chat-14B-ggml
88 bge-reranker-large
89 bge-reranker-base

高亮功能

  • 支持本地拉起测试推理服务
  • 支持vllm推理
  • LLMPipeline 支持vllm
  • 官方镜像升级到python3.10,pytorch升级2.1.0, tensorflow 1.14.0 ubuntu22.04
  • upgrade to python3.10

功能列表

  • Support VLLM in LLMPipeline (#604)
  • add bpemodel path in asr_trainer
  • add llm riddles (#621)
  • feat: deploy checker for swingdeploy

功能提升

  • python311 support for whl
  • llm pipeline support chatglm3 (#618)
  • Support transformers==4.35.0 (#633)

BugFix

  • Fix _set_gradient_checkpointing bug (#660)
  • fix test reliability issue (#657)
  • fix: Doc...
Read more

v1.9.4

27 Oct 13:08
Compare
Choose a tag to compare

中文版本

Feature

  • 新增句子向量模型,支持gte, bloom
  • stable diffusion新增freeU方法
  • LLMPipeline 支持Swift adapter模型推理
  • 镜像制作时自动升级funasr transformer最新版本
  • venv强制依赖移除,以便更好地支持windows系统 #575

bugfix

  • 修复 shop_segmentation pipeline兼容timm 0.5.2
  • 修复huggingface position_ids兼容性问题
  • 修复chatglm sp_tokenizer属性确实问题
  • 修复ofa模型transformers新版兼容性问题
  • 修复trainer中work_dir设置不生效问题 #573
  • 修复hf相关的bug #569 #567

新增模型推荐

序号 模型名称&快捷链接
1 GTE文本向量-中文-通用领域-large
2 GTE文本向量-英文-通用领域-large
3 GTE文本向量-英文-通用领域-small
4 GTE文本向量-英文-通用领域-base
5 GTE文本向量-中文-通用领域-small
6 X-vector说话人转换点定位-两人-中文
7 Udever 多语言通用文本表示模型 3b
8 Udever 多语言通用文本表示模型 1b1
9 GTE文本向量-中文-通用领域-base
10 基于扩散模型的人物多视图生成模型
11 Udever 多语言通用文本表示模型 560m
12 Udever 多语言通用文本表示模型 7b1
13 通义千问-14B-Chat-Int8
14 通义千问-7B-Chat-Int8
15 CT-Transformer标点-中英文-通用-large-onnx
16 CodeFuse-QWen-14B
17 ECAPA-TDNN说话人确认-中文-CNCeleb-16k
18 ECAPA-TDNN说话人确认-中文-3D-Speaker-16k
19 中文字体风格迁移模型
20 Whisper语音识别�-英文-small
21 Whisper语音识别-多语言-large
22 中文字体生成基础模型
23 FreeU文本生成图像模型
24 Paraformer语音识别-英文-通用-16k-离线-长音频版
25 Paraformer分角色语音识别-中文-通用
26 Paraformer语音识别-英文-通用-16k-离线-large-onnx
27 PASDv2图像超分辨率
28 Transducer语音识别-英文-gigaspeech-16k-实时
29 PMR-base
30 PMR-large
31 EQA-PMR-large
32 CodeFuse-StarCoder-15B
33 基于NER微调的机器阅读理解模型
34 CodeFuse-CodeLlama-34B-4bits
35 零样本文本分类-SSTuning-base-多语
36 通义千问-14B-Chat-Int4
37 零样本文本分类-SSTuning-base-英语
38 人脸检测与五官定位
39 多语言Conformer Listener
40 SambertHifigan语音合成-多语言-多人预训练-16k
41 音频量化编码-freqcodec_magphase-英文-libritts-16k-gr8nq32ds320-pytorch
42 音频量化编码-freqcodec_magphase-英文-libritts-16k-gr1nq32ds320-pytorch
43 音频量化编码-Encodec-中英文-通用-16k-nq32ds640-pytorch
44 音频量化编码-Encodec-中英文-通用-16k-nq32ds320-pytorch
45 音频量化编码-Encodec-英文-libritts-16k-nq32ds320-pytorch
46 ERes2Net-Large说话人日志-对话场景角色区分-通用
47 3DHuman-Syn三维角色驱动
48 音频量化编码-Encodec-英文-libritts-16k-nq32ds640-pytorch
49 文本生成3D头部模型
50 文本引导模型纹理生成-三维视觉
51 3DHuman-Syn生成式3D人物模型库

English Version

Feature

  • Added sentence vector model, supporting gte and bloom.
  • Stable diffusion introduces a new freeU method.
  • LLMPipeline now supports Swift adapter model inference.
  • Automatically upgrade to the latest version of funasr transformer during image creation.
  • Forced venv dependency removed to better support Windows system. #575

bugfix

  • Fixed shop_segmentation pipeline compatibility with timm 0.5.2.
  • Resolved compatibility issues with huggingface position_ids.
  • Fixed the missing sp_tokenizer attribute in chatglm.
  • Addressed compatibility issues of ofa model with newer transformers version.
  • Resolved the issue where the work_dir setting in trainer was not taking effect. #573
  • Fixed hf-related bugs. #569 #567.

New Models Recommended

No Model Name & Link
1 nlp_gte_sentence-embedding_chinese-large
2 nlp_gte_sentence-embedding_english-large
3 nlp_gte_sentence-embedding_english-small
4 nlp_gte_sentence-embedding_english-base
5 nlp_gte_sentence-embedding_chinese-small
6 speech_xvector_transformer_scl_zh-cn_16k-common
7 udever-bloom-3b
8 udever-bloom-1b1
9 nlp_gte_sentence-embedding_chinese-base
10 multimodal_multiview_avatar_gen
11 udever-bloom-560m
12 udever-bloom-7b1
13 Qwen-14B-Chat-Int8
14 Qwen-7B-Chat-Int8
15 punc_ct-transformer_cn-en-common-vocab471067-large-onnx
16 CodeFuse-QWen-14B
17 speech_ecapa-tdnn_sv_zh-cn_cnceleb_16k
18 speech_ecapa-tdnn_sv_zh-cn_3dspeaker_16k
19 font_style_transfer_model
20 [speech_whisper-small_asr_english](https://mode...
Read more

v1.9.3 release

19 Oct 11:16
Compare
Choose a tag to compare

中文版本

高亮功能

  • 优化ci
  • 兼容transformers 4.34.0
  • Support int4 model for llm_pipeline

BugFix

  • fix merge error (#582)
  • move venv import from file level to class level to avoid import error… (#575)

English Version

Highlight

  • optimize ci
  • compatible with transformers 4.34.0
  • Support int4 model for llm_pipeline

BugFix

  • fix merge error (#582)
  • move venv import from file level to class level to avoid import error… (#575)

v1.9.2 release

07 Oct 03:18
Compare
Choose a tag to compare

中文版本

新模型推荐

高亮功能

  • 增加image_control_3d_portrait模型
  • 增加3dhuman render and animation 模型
  • 增加LLMPipeline支持大模型推理

功能列表

  • 支持swift trainer和pipeline
  • 增加image_control_3d_portrait模型
  • 增加3dhuman render and animation 模型
  • 新增 model for card correction
  • 新增 head_reconstruction and text_to_head model
  • 增加LLMpipeline支持大模型推理
  • 增加 onnx exporter for ocr recognition model
  • 增加 onnx exporter for ocr_detection db model

功能提升

BugFix

  • 修复onnxruntime 新版本兼容性问题
  • 修复huggingface兼容性问题
  • asr支持本地模型
  • 修复ci问题

English Version

New Model List and Quick Access

Highlight

  • Add 3dhuman render and animation models
  • Add image_control_3d_portrait
  • Add LLMpipeline support LLM inference

Breaking changes

Feature

  • support swift trainer and pipeline (#547)
  • add image_control_3d_portrait
  • add 3dhuman render and animation models
  • add model for card correction
  • add head_reconstruction and text_to_head model
  • add LLMpipeline support LLM inference
  • add onnx exporter for ocr recognition model
  • add onnx exporter for ocr_detection db model

Improvements

BugFix

  • Fix onnxruntime providers parameter compatible issue
  • Fix hf bug (#569)
  • Fix support local asr models (#556)
  • Fix fix ci issue

v1.9.1

16 Sep 09:20
Compare
Choose a tag to compare

中文版本

新模型推荐

高亮功能

  • 模型下载增加失败重试功能

功能列表

功能提升

  • 支持模型下载失败重试

BugFix

  • 解决新版本transformers position_ids兼容性问题

English Version

New Model List and Quick Access

Highlight

  • Retry download model when failed.

Breaking changes

Feature

  • Retry download model when failed.

Improvements

BugFix

  • Fix latest transformers position_ids compatible issue.