Santiago Castro

Results 109 comments of Santiago Castro

I think this is the same issue as #43

I was also having the same problem with Guake until I enabled an option called "Bold text is also bright (VTE >=0.52)".

> Hey Santiago! You bring up a very understandable concern. It's slightly discussed on calver.org, in so far as Twisted and Ubuntu LTS provide a time-based SLA on supported APIs....

> It's not at all just JavaScript, I can't even begin to enumerate the times I've seen this happen. > > Depending on the scale of the API break, the...

No idea. I'm using my branch in a repo: https://github.com/bryant1410/resume/blob/4b15222/.github/workflows/deploy.yml#L22

It doesn't, as the interface is the same.

It seems from the code :man_shrugging: https://github.com/huggingface/tokenizers/blob/11dd6c8baef9ae2b836d594215f14a208dbacfb2/tokenizers/src/tokenizer/mod.rs#L364

It could be a big impact if 1) your whole dataset fits in memory (you can sent chunks also) and 2) you tokenize altogether.