Results 3 issues of Ferris Tseng

Right now, the training data all comes as a single package. It might be better to include it as compiled code that is generated from a JSON document.

NLTK has a way to realign sentences ending with characters such as ), }, ], ", etc...

enhancement

- Correct spelling of "SPECIMEN_SOURCE" - Set data type of nationality to WD because it's a withdrawn field