yangjianxin1 · eltociear · Nov 12, 2023
diff --git a/README.MD b/README.MD
@@ -126,11 +126,11 @@ python evaluate.py \
 
 The examples generated by [LongQLoRA-Vicuna-13b-8k](https://huggingface.co/YeungNLP/LongQLoRA-Vicuna-13b-8k) ars as follows.
 
-Examples of long context generartion, the input context lengths are between 4096 and 8192 which are larger than original context length of LLaMA2.
+Examples of long context generation, the input context lengths are between 4096 and 8192 which are larger than original context length of LLaMA2.
 
 <img src="figure/longdemo.png" width="800">
 
-Examples of short context generartion, model keep the performance of short instruction following.
+Examples of short context generation, model keep the performance of short instruction following.
 
 <img src="figure/shortdemo.png" width="800">
 
@@ -150,4 +150,4 @@ Examples of short context generartion, model keep the performance of short instr
 ## Acknowledgement
 - This work combines the advantages of [QLoRA](https://github.com/artidoro/qlora), [Position Interpolation](https://arxiv.org/abs/2306.15595) and [LongLoRA](https://github.com/dvlab-research/LongLoRA)
 - The code of Shift Short Attention is implemented by [LongLoRA](https://github.com/dvlab-research/LongLoRA)
-- The training code is modified upon [Firefly](https://github.com/yangjianxin1/Firefly)
+- The training code is modified upon [Firefly](https://github.com/yangjianxin1/Firefly)