Question about combining models #37

anjabeth · 2016-10-31T23:19:54Z

If I do markovify.combine() with no weighting, is it effectively the same as training one model on the texts of all the combined models? Asking because I'd like to train a model on a lot of text files, and it works out easier to create a bunch of different ones and then combine them, as long as that works the way I'm expecting it to.

anjabeth · 2016-10-31T23:29:20Z

Clarifying: I've been playing around with "combine" and have gotten it to work, but I'm curious - when it "combines" the models with weighting, is it just using those weights to choose which corpus the words come from, or do the corpuses actually mix? (For example, if I trained a model on the KJV and Moby Dick, could I get sentences that combine both texts? Or would I just get the right fraction of sentences that come from each text?)

jsvine · 2016-11-01T03:19:37Z

If I do markovify.combine() with no weighting, is it effectively the same as training one model on the texts of all the combined models?

Yep!

I'm curious - when it "combines" the models with weighting, is it just using those weights to choose which corpus the words come from, or do the corpuses actually mix?

The latter. The corpuses are, effectively, mixed.

For example, if I trained a model on the KJV and Moby Dick, could I get sentences that combine both texts?

Yep! That's what should happen. (Would be curious to see the output.)

Or would I just get the right fraction of sentences that come from each text?

Nope! There's currently no way to do that with markovify.

anjabeth · 2016-11-02T00:13:20Z

Thanks so much! That's what I was guessing - I think the length difference between the texts was just giving me lots more Bible words, but I wanted to make sure that it wasn't a weighting mistake.

KJV/Moby Dick didn't produce anything terribly interesting on the couple of test runs I did (I'm currently just setting up the skeleton of my project), but I got some pretty fun results with Moby Dick + Pride and Prejudice:

"I was sure you could not be married all day"
"The envelope contained a sheet of blubber."
"Hold the steak in one hand, and a still slighter shuffling of women's shoes, and all was soon right again."

jsvine · 2016-11-02T02:41:17Z

Love those examples. Thanks for sharing!

jsvine closed this as completed Nov 2, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about combining models #37

Question about combining models #37

anjabeth commented Oct 31, 2016

anjabeth commented Oct 31, 2016

jsvine commented Nov 1, 2016

anjabeth commented Nov 2, 2016

jsvine commented Nov 2, 2016

Question about combining models #37

Question about combining models #37

Comments

anjabeth commented Oct 31, 2016

anjabeth commented Oct 31, 2016

jsvine commented Nov 1, 2016

anjabeth commented Nov 2, 2016

jsvine commented Nov 2, 2016