New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Writing prompt #1274
Writing prompt #1274
Conversation
Add writing prompt dataset
@fabraz thanks! can you add a readme in the folder and a open in colab button to the notebook. Idea being someone can quickly scan the readme and then just open the notebook in Google colab even if they want to start playing with it. https://github.com/LAION-AI/Open-Assistant/tree/main/notebooks/example |
@andrewm4894, I just realized that this notebook would be better located under data-augmentation. Do you agree? |
Yep, maybe so. Kinda a lot of stuff ending up under data augmentation since it is very general but makes sense. Am sure some stage down the road we might reorder things in various ways but happy to keep it fairly organic for now. So yep, maybe makes sense to add under data augmentation for consistency as of now. |
❌ pre-commit failed. |
Colab button and README.md cleared. |
notebook is looking a bit funny for some reason seems like maybe some code cells and markdown getting messed up - is it just me? |
So weird. It seems that it converted all the cells into a single one. Checking … |
❌ pre-commit failed. |
@andrewm4894, I've found that the messy jupyter was due to a GitHub bug. When GH renders the following code, it lost any jupyter style setting thereafter. I made some changes in the code, and now GH managed to present it correctly. Check it out. I've done many commits for troubleshooting purposes. Let me know if it would be better to open another PR, to get rid of this commit messy history. |
oh interesting - ive seen markdown cells break github rendering before but never code cells lol |
@andrewm4894 I removed unnecessary comments and fixed the cell with error output. Check, please! |
This should have had the |
Waiting to merge this |
This PR deliveries the code used to sample writing prompt dataset, which can be downloaded at https://huggingface.co/datasets/fabraz/writingPromptAug.