Brian Arbuckle

Results 15 comments of Brian Arbuckle

Let me know if you would like to add, a cocktail data set. I could create a new issue to workout the format of the data I currently have. It...

I should add that there are a handful of "made famous by" items. Might be a nice item to add.

Hi @ontocord thanks! It is up #1286

Hi @dctanner looks great! I have grappled with the question of what to do with the same title and different recipe. Unique ids solve the question, and that may be...

Thank you, @ontocord ! I just started reading through the [submitting datasets guide](https://github.com/LAION-AI/Open-Assistant/blob/main/openassistant/datasets/README.md). Since a pull request is involved, I'll start small and get some eyes on it. Then I...

Hi @ontocord, I have submitted my PR. I was unsure of where to add the dataset folder. I placed it in the `Open-Assistant/openassistant/datasets/cocktail_recipes`. I was also unsure of the python...

Do you mean something like the python [unicodedata](https://docs.python.org/3/library/unicodedata.html#module-unicodedata) module? ```python import unicodedata print(unicodedata.name("∀")) ``` returns ``` 'FOR ALL' ```

If the math is in latex I see there is [flatlatex](https://pypi.org/project/flatlatex/). Since grammar feels stilted for the `unicodedata.name`, and all the `unicodedata` package is a glorified dictionary, there is no...

Makes sense.

Will the input text be a token from spaCy or just the raw text?