Check out my project TinyNews - I generated ~90 million tokens of synthetic text data and used it to train several small GPT-2 style models form scratch.
I recently completed the Data Science program at Bloomtech. I previously worked in material science research & development after receiving my degree in chemical engineering. My first patent was granted in 2019.