2017 icon indicating copy to clipboard operation
2017 copied to clipboard

Feed Me A Stray Cat

Open cmkawa opened this issue 8 years ago • 6 comments

Named after the scene from American Psycho... because this is probably going to be random, weird, and a little violent.

My program will generate a novel based on source material fed into it, using next-word prediction and style analysis with some adjustable settings for randomness and arc structure. The goal is to get it to produce something that is (at a minimum) readable; reasonably enjoyable; and to see how often it actually surprises me.

cmkawa avatar Oct 24 '17 07:10 cmkawa

Had a thought today -- I'll need to develop some requirements so that I have specific metrics to hit. That way I'll know when I'm done and (hopefully) won't suffer from feature creep. Some thoughts to guide that:

  • Word prediction generates a novel that looks like normal sentences/paragraphs/chapters, a la lorem ipsum, but with real words. First major milestone before any of the pseudo-intelligence can be implemented.
  • Getting copycat sentences below a certain threshold. Check generated text against source material to reduce the length of unoriginal chains (straight copies).
  • Avoiding repetition of uncommon words. Based on source material, give each word a uniqueness count (number of words since last occurrence) and then arbitrarily set an acceptable percentage of that value before it can appear again.
  • Reducing the number of nonsensical sentences. Train FMASC to avoid certain combinations of words that have been flagged during consumption of source material. This might be a stretch goal, and also depends on how good the word prediction algorithm actually turns out to be.

cmkawa avatar Oct 29 '17 08:10 cmkawa

Philosophical question: Is creativity really just a matter of finding that sweet spot between convention and chaos?

cmkawa avatar Nov 02 '17 04:11 cmkawa

I'm fascinated by your idea of quantifying the success of your novel-generation process! I thought about calculating the "novelity" (?) of generated text before, but have never collected ideas on how to do that, and yours seem pretty solid. Would be interested to see your creativity-gauge code when you have it!

eseyffarth avatar Nov 02 '17 15:11 eseyffarth

Just sharing a little progress.

image

cmkawa avatar Nov 08 '17 06:11 cmkawa

image

Char char char, char is not a word anymore char.

cmkawa avatar Nov 11 '17 18:11 cmkawa

Today, it became smarter than me.

image

I couldn't figure out where the third occurrence of "the great" was showing up without adding some debugging messages, and was convinced I had made a mistake in counting/entering somewhere. Upon closer inspection, lines 194 and 195 of the txt file show a newline delimited instance that CTRL-F wasn't catching. So it works!

cmkawa avatar Nov 12 '17 21:11 cmkawa