Leon Derczynski

Results 322 issues of Leon Derczynski

Dates should be formatted with hyphens separating years and months/weeks from their sub-parts. e.g: ``` 199 => 199 1993 => 1993 199307 => 1993-07 BC0045 => BC0045 BC004508 => BC0045-08...

When set to TimeML output, the "mod" attribute uses the TIMEX2 values "EARLY" and "LATE" instead of TIMEX3 values "START" and "END" (http://timeml.org/site/publications/timeMLdocs/timeml_1.2.1.html#timex3)

From TAC_2010_KBP_Source_Data/data/2010/wb/eng-WL-11-174596-12957493.sgm (http://pastebin.com/Wz2QKEAZ): Traceback (most recent call last): File "/usr/local/bin/annotate_timex", line 154, in print str(doc) UnicodeEncodeError: 'ascii' codec can't encode character u'\u201c' in position 662: ordinal not in range(128)

From TimeBank: wsj_0586.tml TERNIP: WARNING: Error whilst attempting to add TIMEX Traceback (most recent call last): File "/usr/local/lib/python2.6/dist-packages/ternip/formats/xml_doc.py", line 640, in reconcile self._add_timex(timex, sents[i], s_nodes[i]) File "/usr/local/lib/python2.6/dist-packages/ternip/formats/xml_doc.py", line 521, in...

It's possible to extract DCT (at day granularity) from filenames - is this attemped? From TimeBank: VOA19980331.1700.1533.tml WARNING: Could not determine document creation time, use -c to override

When running extras/terneval.py, the following output is included: .... chtb_245.eng.sgm recognition 0.083 extent 0.0 normalisation 0.0 TERNIP: WARNING: Malformed rule expression Traceback (most recent call last): File "../ternip/rule_engine/normalisation_rule.py", line 139,...

GATE's ANNIE tokeniser splits on different boundaries to TERNIP's (NLTK). This can cause many TERNIP rules to not match. For example, > > > nltk.word_tokenize('Example 31/12/2010 text.') > > >...

Tensorflow has an easy API - beware version problems (2.7 vs 3.4)

enhancement

Support timexes and events; start point for replication is Hector Llorens' NLDB paper: "Evaluating EmotiBlog Robustness for Sentiment Analysis Tasks" doi:10.1007/978-3-642-22327-3_41

enhancement

Add to code, and connect with something like readthedocs

enhancement