Johannes Hötter
Johannes Hötter
**Is your feature request related to a problem? Please describe.** The creation of embeddings can range from straight-forward to super customized. Similar to labeling functions, the creation of embeddings should...
**Is your feature request related to a problem? Please describe.** I have multiple files which I want to combine, e.g. source_a and source_b. Or I want to modify data before...
**Is your feature request related to a problem? Please describe.** I have e.g. some tweets that I want to clean for my model. I have identified and built a weak...
**Is your feature request related to a problem? Please describe.** Similar to pandas, I want to be able to create new attributes given some logic to apply. **Describe the solution...
**Is your feature request related to a problem? Please describe.** The spaCy tokenizers sometimes lead to wrong tokens, e.g. for HTML data, tweets or often domain-specific terms. For instance, `'refinery...
**What is missing in the docs?** There is no description about the configuration page and its initial settings, and how to change them. For instance, see discussions thread #58
**Is your feature request related to a problem? Please describe.** Weak supervision is not only one specific algorithm, but you can actually choose from a set of formulas. Let users...
**Is your feature request related to a problem? Please describe.** If I create new heuristics or improve my active learner, I want to versionize my weakly supervised labels, as I...
**Is your feature request related to a problem? Please describe.** There are a lot of background tasks running, e.g. embedding creation, tokenization or zero-shot. It can become difficult to keep...
**Is your feature request related to a problem? Please describe.** Currently, refinery only supports zero-shot classification. **Describe the solution you'd like** Embed zero-shot models for extraction tasks from HuggingFace **Describe...