Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
update alpaca gpt4 to use dataset entry (#2869)
#2827 Update alpaca gpt4 to use dataset entry. I ran ```bash ~$ python check_dataset_appearances.py -d alpaca_gpt4 --cache_dir .cache --mode sft 'Found the following occurances in TRAIN alpaca_gpt4:' { re.compile('\\[\\d+(?:,\\s*\\d+)*?\\]'): [ '[3, 45, 99, 2, 8, 6, 72]', '[10, 8, 7, 4]', '[1, 2, 3, 4, 5]', '[7, 3, 4, 6, 2]']} 'Found the following occurances in VAL alpaca_gpt4:' { 'openai': [ 'u’re approved, get your API key.\n' '\n' '2. Install the `openai` library in your Python environment ' 'using p']} ``` Checked all the occurances for the references, but all are programming related and have nothing to do with references, so this looks fine: ```python DatasetEntry(questions=['Re-order the integer list given in the input field such that all odd numbers are first and even numbers are last. \n[2, 3, 8, 45, 6, 99, 72]'], answers=['[3, 45, 99, 2, 8, 6, 72]'], context=None, lang=None, length=None, quality=None, humor=None, creativity=None) ```
- Loading branch information