tatom
tatom copied to clipboard
Quantitative Text Analysis for the digitale Geisteswissenschaften
Hi Allen, I refer to this material in my courses, but unfortunately the links are broken, e.g.: https://de.dariah.eu/tatom/visualizing_trends.html I mailed the DARIAH team about this several times, and their response...
https://de.dariah.eu/tatom/getting_started.html For R an Octave/Matlab users I think it is and between R and Octave.
In the chapter on preprocessing, NLTK's PunktWordTokenizer is used directly (input 11). This no longer seems to work in NLTK version 3.0.3. In fact, this word tokenizer [was not supposed...
Hi Allen, https://de.dariah.eu/tatom/preprocessing.html#every-1-000-words def split_text(filename, n_words): ....: """Split a text into chunks approximately `n_words` words in length.""" ....: input = open(filename, 'r') ....: words = input.read().split(' ') ....: input.close() At...
https://de.dariah.eu/tatom/feature_selection.html Determine values for hyperparameters: Let us consider μ0 and σ20 first. In keeping with this observation we will set μ0 to be 3 and γ20 to be 1.52 In...
Pandas does make many operations much easier. Need to find sensible ways of integrating mentions of its uses. In principle, I think the tutorials should only require familiarity with the...
> Fertig. Den img-Tags der Bilder, die dargestellt werden sollen, müsstet Ihr > dann noch die CSS-Klasse 'fancybox', den Typ 'image' und bei Bedarf eine > Unterschrift mitgeben. Wie das...
> Die Präsentation der Bibliotheken und die Installationsunterstützung ist ja > eher für Einsteiger, auf der anderen Seite gibt es für Einsteiger im Rest > des Tutorials zu viel implizites...