AJ Tran

Results 48 comments of AJ Tran

I am having the same result, a csv file with only "tweets". The account I'm scraping from is not explicit.. It has 109160 tweets, is that too much to scrape...

# Dev Diary ## Obtaining the corpus 1. I created a new JavaScript project with `npm init` 2. I downloaded the entire series as a PDF from reddit. [source](https://www.reddit.com/r/Animorphs/comments/3litxl/reformatted_ebook_editions_download_links/) 3....

## ~Additional consideration~ ~I think I will split my novel into 20 chapters. 20 chapters of 2500 words would meet the 50,000 word requirement.~

## Fine-tuning GPT-2 [This tutorial](https://minimaxir.com/2019/09/howto-gpt2/) is pretty spot on and I've used it before to train on other data sets (Taco-related, Poetry, Gothic Lit). The main thing is that I...

## Sample output from Step 1850 > Chapter 4 > When it was clear she’d be late for her night shift, Rachel and I set off. We couldn’t wait. We...

## gpt-2-simple documentation Take advantage of the "cloud" resources available while working on this Machine Learning project. Procedural text generation is a wonderful gateway into Natural Language Processing. So check...

## Findings from Documentation > GPT-2 can only generate a maximum of 1024 tokens per request (about 3-4 paragraphs of English text). > GPT-2 cannot stop early upon reaching a...

## First GPT-2 Output Okay! Let's make some invocations. I will use the prefix of the very first book, `"My name is"` ### Input Parameters ```python gpt2.generate(sess, length=1024, temperature=0.7, prefix="My...

## Analysis of First GPT-2 Output Let's grab some statistics from a free online [Word Counter Tool](https://wordcounter.net/) ### Statistics #### Details ``` 3,111 Words 780 Unique Words 15,066 Characters 12,255...