### 🐛 Describe the bug  ### Environment pytorch==1.11, cuda==11.3, colossalai==0.3.0 transformers==4.28.1
🐛 Describe the bug
Environment
pytorch==1.11, cuda==11.3, colossalai==0.3.0
transformers==4.28.1