Jim O’Regan

Results 11 comments of Jim O’Regan

@nguyenq - does the new Vietnamese language pack fix this issue?

> I might be able to, though I'd much prefer to re-run the original scripts – @jimregan I don't suppose you've kept any records? I lost that laptop 4 years...

I lost my laptop three weeks ago, so it'll be a while before I can look at this. On Thursday, 25 October 2018, Kevin Brubeck Unhammer < [email protected]> wrote: >...

It seems I didn't make this clear before: ICX is used at _runtime_ — the characters in the ICX file are discarded by lt-proc before the string is parsed. The...

I've started doing some of this, because it's helpful for adding labels in other languages. And in finding overlap.

Also, wikidata has https://www.wikidata.org/wiki/Property:P1628 which has been used to add some links to DBpedia properties (e.g., https://www.wikidata.org/wiki/Property:P17)

Ok, well that mapping needs to go. And never be mentioned again!

This is what 02b0bba5fa8c92d131b8374b9f6ff3ab3ab3e5a1 did — you give lt-proc a .icx file with the characters you want to ignore. Soft hyphen was just a special-case default that Gema asked for.

FWIW, this is the Dockerfile I'm sending my students: ``` FROM ubuntu:22.04 RUN apt update RUN apt install -y git python3-pip libsndfile1 RUN apt install -y automake autoconf libtool RUN...