Add Stockholm University Swedish POS Tagger
Swedish POS Tagger that uses the Stockholm-Umeå Corpus and tag set (http://www.ling.su.se/english/nlp/tools/stagger). Could then be used with the Maltparser and model which uses the same corpus and tag set. Mostly implemented and currently testing.
What would be the name/artifactId of the new module? I would suggest dkpro-core-stockholm?
Please mind that for new components, we now use the package name org.dkpro.core.XXX, and the Maven GroupId in new modules should be org.dkpro.core.
I'd rather use the upstream name "stagger" rather than "stockholm". Then, I've followed the naming scheme from stanford NLP, i.e.:
- artifactId: de.tudarmstadt.ukp.dkpro.core.stagger-gpl
- folder name: dkpro-core-stagger-gpl
- package name: de.tudarmstadt.ukp.dkpro.core.stagger
I can change that to:
- artifactId: org.dkpro.core.stagger
- groupId: org.dkpro.core
- folder name: dkpro-core-stagger
- package name: org.dkpro.core.stagger
Would that work?
That's fine.
@munterkalmsteiner I wonder if you somehow forgot about this? ;)
@reckart Indeed. I'll check the email you sent me on December 1 2016 with instructions on how to package the model. I think this was the last thing that was missing. PR will come in the coming days/weeks.