Will Lowe
Will Lowe
The small adjustment to Snowball's algorithm would be to *Swissify* the umlauts by adding a e, as in ü -> ue, ö -> oe, ä -> ae. Snowball's standard German...
The fix: ```{r} lmF
In `biblib/bib.py`, changing the third import to ``` import collections.abc ``` and line 97 to read ``` elif isinstance(str_or_fp_or_iter, collections.abc.Iterable) and ``` appears to fix (tested on python 3.11)
FYI I was `str`ing in the first place so I could sketch out a `corpus_merge_docvars` function that I have now needed several times and hacked around. (Something like [`tmaptools::append_data`](https://www.rdocumentation.org/packages/tmaptools/versions/1.2/topics/append_data)). If...
Definitely not `+` or `c`. As in the `SpatialPolygonDataFrame` function I linked to above it's about having maybe incomplete or overcomplete hand constructed document metadata in a `data.frame` and (left)...
Yes, that would do it. Two small caveats. 1. Seems awkward to be required to key on rownames, but that's a minor thing. I guess it ensures they're unique :-)...
Four questions and proposed answers for the semantics of `+` with `corpus` 'corp' and `data.frame` 'newdocvars'. 1. Are matches determined exclusively (keyed on) the `rownames` of `corpus` and `data.frame`? 2....
Second suggestion: All this goes into an augmented `docvars` command instead: `docvars(corp)
@kbenoit Thoughts on these semantics or should I assume they're fine and send a PR?
Getting point estimates might well be 'as trivial as "plugging in" beta and psi then recovering theta (and alpha) accordingly', although you'd probably want to flip to multinomial form first....