Preprocessing
In the opening comments to this notebook, there is a reference to "our next two weeks". Depending on how the workshop is taught, this probably needs to be updated to the next two days or lessons.
Reading in .csv How many entries are in "raw'? -> "tweets"
This notebook brings in numerous packages and libraries without introducing or explaining how to use them, such as codecs, BeautifulSoup, glob, etc. This could be confusing for learners who have not used these libraries before.
Add some open cells after the challenges for learners to insert their code.
Reading in multiple files - There is missing text after "How..."
There are so many sections in this notebook. Numbering the sections might help learners to keep their place and follow along.
Need to explain/contextualize use of multiple libraries - or drop?
Anything helpful to re-streamline the Text Analysis Fundamentals materials in Notebooks 4-4, 5-1, and 5-2 here? https://github.com/dlab-berkeley/DIGHUM101-2020
Hi @brooksjessup -- Can you commit and push these changes? Please close this comment when you are done. Let me know if you have any questions. Thanks!