Vulnerability Report:trust_remote_code param is enabled, causing unsafe execution of Python files

I think it's necessary to provide a security vulnerability report for the project
[https://github.com/lm-sys/FastChat/blob/main/fastchat/model/model_adapter.py#L108](https://github.com/lm-sys/FastChat/blob/main/fastchat/model/model_adapter.py#L108)
`
            tokenizer = AutoTokenizer.from_pretrained(
                model_path,
                use_fast=self.use_fast_tokenizer,
                revision=revision,
                trust_remote_code=True,
            )
`
There is a security risk in using the AutoTokenizer.from_pretrained function if 'trust_remote_code' is equal to True. When there is a tokenizer_config.json file in the model's project directory and the "auto_map" field in this file contains an array parameter that starts with the character value "AutoTokenizer", the function will try to parse the following values ​​of the array as Python files.
![config](https://github.com/kyo-w/ai_config_attack/blob/main/1902fa20000fa4e506ce7fdf5ae4c88.png?raw=true)

The attack effects are as follows
![target ](https://github.com/kyo-w/ai_config_attack/blob/main/94b67ac279fdf754b9baf88e0712759.png?raw=true)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vulnerability Report:trust_remote_code param is enabled, causing unsafe execution of Python files #3720

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Vulnerability Report:trust_remote_code param is enabled, causing unsafe execution of Python files #3720

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions