Benjamin Hoover

Results 10 issues of Benjamin Hoover

First off, I love this library. It is so much more elegant and intuitive than flax while being more fully featured than equinox (I guess it helps that I use...

Tokenization is perfectly aligned for many english sentences, but breaks whenever a SPACY_EXCEPTION is part of a larger, hyphenated word. For example, "whatve-you-dont" would produce two different tokenizations: ``` alnr...

![image](https://user-images.githubusercontent.com/24350185/123656012-86036800-d7fd-11eb-81d9-278233241431.png) Should also allow `Top-10 Diff`, corresponding to the search results ![image](https://user-images.githubusercontent.com/24350185/123656118-a0d5dc80-d7fd-11eb-9440-aa745be234a3.png) It is returned as part of the following packet: ``` return { "text": text, "tokens": tokens, "m1": {...

> how difficult would it be to show distribution over all the data and highlight the picked examples? ![image](https://user-images.githubusercontent.com/24350185/123303478-d4f28a00-d4eb-11eb-9cd1-273de5345cac.png)

Create datasets and analysis results of `gpt-gen` and `distillgpt2-gen`. Questions - How would you generate diverse phrases? (one per line)? Would you have a prompt dataset?

Consider porting documentation to `mkdocs` to have a professional feel for this tool

1. `mrpc` is a sufficient description of `glue_mrpc` since glue is the task name encompassing several datasets 2. We would like to provide a popup near the dataset name that...

I am having trouble integrating sverto into an existing Quarto project. I am encountering the following error: ``` hoo@Hoo-M1 test-sverto-2 % quarto preview Preparing to preview sverto-prerender.lua Error running filter...

Addresses #1461 Using quarto and its VS code extension, I find that writing `.qmd` files to be a smoother interactive alternative to `.ipynb` files. That `.qmd` files are plain text...