Add albert t5 #293

RbRe145 · 2025-09-26T06:05:36Z

PR Category

添加了新的模型，以及对应的模型load函数
`def get_t5_model_and_inputs(model_name, text, dtype):
import paddle
from paddlenlp.transformers import T5ForConditionalGeneration, T5Tokenizer

# 1) 分词器（先建 tokenizer 方便取 pad/eos id）
tokenizer = T5Tokenizer.from_pretrained(model_name)

# 2) 编码输入（支持单条或批量 text）
enc = tokenizer(
    text,
    return_tensors="pd",
    padding=True,
    truncation=True,
    max_length=512,
)

# 补 attention_mask（pad 处为 0，其他为 1）
if "attention_mask" not in enc:
    input_ids = enc["input_ids"]
    attn_mask = (input_ids != tokenizer.pad_token_id).astype("int64")
    enc["attention_mask"] = attn_mask

# 构造 decoder_input_ids：
# T5 以 pad_token_id 作为 decoder_start_token_id
batch_size = enc["input_ids"].shape[0]
decoder_input_ids = paddle.full(
    shape=[batch_size, 1],
    fill_value=tokenizer.pad_token_id,
    dtype="int64",
)

# 3) 加载模型
model = T5ForConditionalGeneration.from_pretrained(model_name)
if dtype == "float16":
    model = model.astype(paddle.float16)
model.eval()

# 4) 组装喂给模型的输入
inputs = {
    "input_ids": enc["input_ids"],
    "attention_mask": enc["attention_mask"],
    "decoder_input_ids": decoder_input_ids,
}
return model, inputs

def get_albert_model_and_inputs(model_name, text, dtype):
"""
加载 ALBERT backbone（AlbertModel）并构造输入。
- model_name 例如: "albert-base-v2", "albert-xxlarge-v1"（PaddleNLP 内置名称）
- dtype: "float32" 或 "float16"
返回: (model, inputs_dict)
"""
import paddle
from paddlenlp.transformers import AlbertConfig, AlbertModel, AlbertTokenizer

# 1) 读取配置（不触发权重下载）
config = AlbertConfig.from_pretrained(model_name)

# 2) 模型
#    若你只需要网络结构，可改成: model = AlbertModel(config)
model = AlbertModel(config)
if dtype == "float16":
    model = model.astype(paddle.float16)
model.eval()

# 3) 分词器
tokenizer = AlbertTokenizer.from_pretrained(model_name)

# 若无 pad_token，则回退到 unk_token（ALBERT 没有 eos_token，别设 pad=eos）
if tokenizer.pad_token is None:
    tokenizer.pad_token = tokenizer.unk_token

enc = tokenizer(
    text,
    return_tensors="pd",
    padding=True,
    truncation=True,
    max_length=512,
)

if "attention_mask" not in enc:
    input_ids = enc["input_ids"]
    enc["attention_mask"] = (input_ids != tokenizer.pad_token_id).astype("int64")

return model, enc

`

Description

RbRe145 added 6 commits September 23, 2025 13:29

fix model's hash and json

b6a7783

add new bart and xlnet models

873fa29

add new albert and t5 models

adadf10

fix nlp_getter f format

e715153

add new t5 and albert models

a2824c2

add hash to new models

bb21125

RbRe145 requested a review from Xreki September 26, 2025 06:14

Merge branch 'develop' into add_albert_t5

ba11302

Xreki approved these changes Sep 26, 2025

View reviewed changes

Xreki merged commit 1aaf229 into PaddlePaddle:develop Sep 26, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add albert t5 #293

Add albert t5 #293

Uh oh!

RbRe145 commented Sep 26, 2025

Uh oh!

Uh oh!

Uh oh!

Add albert t5 #293

Add albert t5 #293

Uh oh!

Conversation

RbRe145 commented Sep 26, 2025

PR Category

Description

Uh oh!

Uh oh!

Uh oh!