dkpro-core icon indicating copy to clipboard operation
dkpro-core copied to clipboard

Add Stockholm University Swedish POS Tagger

Open munterkalmsteiner opened this issue 9 years ago • 5 comments

Swedish POS Tagger that uses the Stockholm-Umeå Corpus and tag set (http://www.ling.su.se/english/nlp/tools/stagger). Could then be used with the Maltparser and model which uses the same corpus and tag set. Mostly implemented and currently testing.

munterkalmsteiner avatar Dec 01 '16 08:12 munterkalmsteiner

What would be the name/artifactId of the new module? I would suggest dkpro-core-stockholm?

Please mind that for new components, we now use the package name org.dkpro.core.XXX, and the Maven GroupId in new modules should be org.dkpro.core.

reckart avatar Dec 01 '16 08:12 reckart

I'd rather use the upstream name "stagger" rather than "stockholm". Then, I've followed the naming scheme from stanford NLP, i.e.:

  • artifactId: de.tudarmstadt.ukp.dkpro.core.stagger-gpl
  • folder name: dkpro-core-stagger-gpl
  • package name: de.tudarmstadt.ukp.dkpro.core.stagger

I can change that to:

  • artifactId: org.dkpro.core.stagger
  • groupId: org.dkpro.core
  • folder name: dkpro-core-stagger
  • package name: org.dkpro.core.stagger

Would that work?

munterkalmsteiner avatar Dec 01 '16 09:12 munterkalmsteiner

That's fine.

reckart avatar Dec 01 '16 10:12 reckart

@munterkalmsteiner I wonder if you somehow forgot about this? ;)

reckart avatar Sep 08 '19 15:09 reckart

@reckart Indeed. I'll check the email you sent me on December 1 2016 with instructions on how to package the model. I think this was the last thing that was missing. PR will come in the coming days/weeks.

munterkalmsteiner avatar Sep 19 '19 18:09 munterkalmsteiner