Final Project

Abstract

Every startup claims to be disrupting an industry or changing the world. Most startup ideas are destined to fail, but some truly change the world. By training on thousands of startup taglines, articles, and interviews, this project aims to generate a lot of bad startup ideas, and a few good ones.

Most startup ideas can be summarized in just one paragraph. The tagline describes the overarching concept (e.g., "Uber is finding you better ways to move, work, and succeed"), and the next few sentences can provide a more detailed description of the product, as well as context on the startup's history, people, and industry.

The tagline can be created by finetuning GPT-2 with a dataset of startup taglines (https://github.com/winsonluk/gpt_pitches), and the subsequent sentences can be generated by feeding this tagline as a prefix into a other GPT-2 models finetuned with startup descriptions and company analyses (https://github.com/winsonluk/gpt_descriptions and https://github.com/winsonluk/gpt_summaries).

The ideas generated have been fairly realistic (most are bad, some are good), and I incorporated them into faux startup website similar to https://tiffzhang.com/startup, with a few thousand permutations of ideas. The value of these ideas depend solely on the reader's interpretation (see reader-response theory), but hopefully some of these ideas are cohesive enough to serve as inspiration.

For the final project presentation, I will do a live demo of the interactive startup generator, and ask the audience to think of a realistic app idea from the generated startups. I'll also present some slides to share my thoughts about how ML ties in with creativity (PDF).

Project Report

Report

Model/Data

I created three models finetuned with the gpt-2-simple library.
The first model is trained on startup taglines from Startups List. I used Rick Hennessy's scraped dataset. Download: https://winsonluk.com/assets/ideas.zip
The second model is from the previous dataset, but with startup descriptions rather than taglines. Download: https://winsonluk.com/assets/gpt_summaries.zip
The third model is trained on TechCrunch posts, which focus on the latest developments in technology. The scraped data is from Kaggle. Download: https://winsonluk.com/assets/gpt.zip

Code

Results

Website: http://recuria.com
Taglines only: https://github.com/winsonluk/gpt_pitches/blob/master/io/pitches.txt
Taglines with descriptions: https://github.com/winsonluk/gpt_summaries/blob/master/io/summaries.txt
Taglines with TechCrunch commentary: https://github.com/winsonluk/gpt_descriptions/blob/master/io/descriptions.txt
Taglines with descriptions and TechCrunch commentary: https://github.com/winsonluk/gpt_descriptions/blob/master/io/descriptions_with_summaries.txt

Technical Notes

~~The multi-gpu fork of gpt-2-simple needs to be installed to train with the 774M model.~~ This fork was merged to master in v0.7, so pip install gpt-2-simple v0.7+.
I used 4 x Tesla V100 GPUs and 16 GB of RAM on Vast.ai to train the models. Training will fail with single GPUs or less than 16 GB of RAM. After training, generation can be performed with a single GPU, though 16 GB of RAM is still necessary.
The startup tagline and description models are finetuned to a loss of around 0.1, while the larger TechCrunch model is finetuned to a loss of 1.8.
I sampled all models with temperature ranges from 0.2 to 2.0 and top-p from 0.1 to 1.0 (higher values translate to more "creativity" in the text) to find the optimal parameters for realistic text generation.

Examples

Bloopers

Lowering unemployment

Strategic arms sales

Internet of things

Workers of the world, unite!

10,000 hours

Reference

References to any papers, techniques, repositories you used:

Papers
- GPT-2
Repositories
- https://github.com/huntrontrakkr/gpt-2-simple
- https://github.com/tiffz/startup
Blog posts
- https://minimaxir.com/2019/09/howto-gpt2/
- https://towardsdatascience.com/how-to-sample-from-language-models-682bceb97277

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
1.png		1.png
10.png		10.png
11.png		11.png
2.png		2.png
4.png		4.png
5.png		5.png
7.png		7.png
8.png		8.png
9.png		9.png
README.md		README.md
final.key		final.key
final.pdf		final.pdf
latinos.png		latinos.png
nazis.png		nazis.png
report.pdf		report.pdf
smart.png		smart.png
strip.png		strip.png
tinder.png		tinder.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation