iknow icon indicating copy to clipboard operation
iknow copied to clipboard

Japanese: Additional expressions to be considered for the sub-word split hint attribute

Open makorin0315 opened this issue 3 years ago • 0 comments

The implementation for issue #234 addressed expression including the particle の only, i.e., numbered item 1. Currently a new attribute label JPno_join_Con is being used. Additional expressions to be further considered include:

  • The particle な that is preceded by Adjectival Nouns and succeeded by another Concept For example: 無力な国連 (this was numbered item 2 in issue #234)

  • expression that include での within the Concept ロームは半導体の原材料調達や生産での協業を探る見通し。→ での within 生産での協業

  • expression that include からの within the Concept 記者団にロシアからの攻撃か聞かれた。→ からの within ロシアからの攻撃

  • expressions that include pronouns, such as どこか or 誰か i.e., somewhere, someone 加盟国のどこかが武力攻撃を受ける可能性。→ 加盟国のどこか 誰かのいたずらかも知れない。→ 誰かのいたずら

  • expression that includes すべて, i.e., all すべての締結国への攻撃とみなされる。→ すべての締結国

  • other expressions 内需のもう一つの柱である設備投資。 → もう一つの柱 台湾海峡の平和と安定の重要性も改めて強調した。 → 安定の重要性 抗体陽性のlong-covidはまだ報告されていません。 → 抗体陽性のlong-covid

To be discussed with @Rei-hub and Dr. Torikai:

  • whether to implement each of these expressions
  • if they should all use different attribute labels or not

makorin0315 avatar Dec 10 '22 00:12 makorin0315