how to sft support larger context length? #48
Labels
doc-required
Your PR changes impact docs and you will update later.
question
Further information is requested
sft
if set max_sequence_len to 4k, does the model able to do extrapolation automatically?
The text was updated successfully, but these errors were encountered: