-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Resources #1
Comments
Drawing on last year's projects, there's some things to learn from: Emily Short's Annals of the Parrigues was an interesting project, but from a Resources perspective her end notes on writing for the generator are full of excellent suggestions for how to write your text and when to use different approaches. Another project I was impressed with was A Time For Destiny by @cpressey. The "Story Compiler" write-up certainly suggests some avenues for future exploration. (And the discussion around it already inspired @enkiv2 to make a goal-driven planner plot generator.) The Deserts of the West by @mewo2 has also been a source of inspiration; this National Geographic writeup is particularly illuminating, as are the blog posts about its language generator and map generator. I could go on, there are a ton of interest projects to learn from. But I also want to emphasize that you don't need to get that fancy: if you're looking for an accessible way to jump into making your first book generator, you might want to check out Tracery. (There's also a Python port, if you need that.) Also, the Gutenberg Python library is back in active development, should you need texts or metadata from Project Gutenberg. |
I wrote a series of blog posts on dev.to that focuses on NaNoGenMo and text generation. The emphasis of these blog posts is on how to produce "readable" computer-generated text that a human may theoretically like.
As a side-note, all of these blog posts are computer-generated as well (and a link to their source code is provided with each blog post) -- though the techniques I used here are hard to scale, since I needed to handwrite the corpus beforehand. Still useful as proofs of concepts. I also provided links to several NaNoGenMo novels as well, so you could use these blog posts as a reference guide. |
I should probably also mention these libraries:
|
This sense2vec thing (using SpaCy + word2vec) seems very promising. |
Emily Short posted a resource list yesterday on her blog: On Fri, Oct 28, 2016 at 12:41 AM Darius Kazemi notifications@github.com
|
Ooh, @greg-kennedy just added to Corpora a list of adjectives to describe people. dariusk/corpora@ccf894c |
Created for last year's NaNoGenMo, I've updated this JSON of Project Gutenberg metadata: |
If you need meter or rhyme for your poetic projects, some resources (mostly found after Emily Short asked a question on Twitter and many people replied with suggestions): A set of word lists, organized by rythmic feet. The CMU Pronouncing Dictionary is a dictionary of 134K English words and their pronunciations, including stress. NLTK and Allison Parrish's pronouncingpy library provide Python interfaces to it. poem-gen, a 2014 NaNoGenMo project by Camden Segal may also be of interest. As is NaPoGenMo 2015 and NaPoGenMo 2016. Some other resources (which may or may not have been mentioned in previous years): textacy: higher-level NLP built on spaCy: streaming documents, filter linguistic elements, vectorized and semantic network representations, topic models, language identification... TextBlob is another Python option for processing textual data and NLP. KoNLPy: Korean NLP in Python RiTa: JavaScript/Processing/Node NLP tools for computational literature Pressagio text prediction system: word completions in Python, etc. (A Python port of Presage) Lexeme: A constructed language word database, generation, and declension program. moby: Javascript interface for the Moby Thesaurus |
I have a pattern recognizer I'm working on that will look at text and create 'templates' for phrases, with 'variables' where you can insert names, etc. Example: "Hello, Bob" -> "Hello, {1}". Currently it's only able to generate templates given two line of text, but I'm working on expanding it so it can scan an entire corpus and find the best template candidates, and convert them to templates. I'll post it here when I'm done. |
I should note that my scene-sequel project from last year has been broken On Thu, Nov 3, 2016 at 8:31 AM Matthew D. notifications@github.com wrote:
|
For those of you working in Java (or the JVM) the Stanford CoreNLP library just released a beta of version 3.7.0. (Interfaces also exist for many other programming languages.) |
If anybody is working on poetry (or something where meter matters), this list of words grouped by part of speech and syllable count might be useful: http://www.ashley-bovan.co.uk/words/partsofspeech.html |
Need character names? Here's a NodeJS module that spits out different names of characters from Infinite Jest by David Foster Wallace: https://github.com/accraze/infinitejest-names |
@accraze what is the license on this? It might be a good addition to On Tue, Nov 22, 2016 at 12:29 AM Andy Craze notifications@github.com
|
From #114 (comment):
|
This is an open issue where you can comment and add resources that might come in handy for NaNoGenMo.
There are already a ton of resources on the old resources threads for the 2013 edition, the 2014 edition, and the 2015 edition.
The text was updated successfully, but these errors were encountered: