mtdata
mtdata copied to clipboard
Add MaCoCu corpora
List of available pairs:
- English-Turkish
- English-Bulgarian
- English-Croatian
- English-Slovene
- English-Macedonian
- English-Icelandic
- English-Maltese
English-Spanish and English-Dutch are Paracrawl 9 enriched DSI (domain) data, so there's no need to add them. More languages will come next year (Albanian, Serbian, Montenegrin and Bosnian).
EDIT: forgot the link macocu.eu