Understanding the building blocks of ChatGPT: from the parameters that make up the model to the way in which OpenAI, the company that made ChatGPT, has grown (and shrunk) over the last decade, will make you appreciate just how much goes into a single query of the model.
I created a poster as a compliment this article, with help from Cornell Data Journal (CDJ) member Cindy Weng, to present during CDJ's end-of-semester Symposium event.
While I sourced information from a number of different places, the ones that played the most significant role in this project were:
- Wealth shown to scale: This website provided a large amount of visual inspiration for the layout of my article
- Info 4940: Neural Networks in Practice: A that provides a high-level overview of neural networks, including the transformer model
- Attention Is All You Need: The seminal paper that introduced transformer models
- The Illustrated Transformer: A visual walkthrough of transformer models
- Neural Networks / Deep Learning Video Series: 3Blue1Brown's very useful animations of the innards of GPT models
For citations relating to content within the article itself, most claims are hyperlinked to reliable sources. Data used for visuals and sources for other information can likely be found within the article script.
- HTML, CSS, TypeScript
- I used HTML and CSS to make the website, including some interactive elements with TypeScript.
- Figma
- The layout of the website was designed in Figma, and I also used it to create most of the visual images (outside of hand-drawn ones)
