Writing weak supervision results into accessible metadata
Is your feature request related to a problem? Please describe. I have e.g. some tweets that I want to clean for my model. I have identified and built a weak supervision procedure to tag them automatically. Still, I can't get rid of these links inside the application.
Describe the solution you'd like
Feed the weakly supervised data back into the record data. For instance, if I know that "Check out this tool! https://github.com/code-kern-ai/refinery" contains the link "https://github.com/code-kern-ai/refinery", write that metadata into the record itself:
{
"tweet": "Check out this tool! https://github.com/code-kern-ai/refinery",
"tweet__entities": [
"link": [5, 6]
]
}
Describe alternatives you've considered Doing that modification outside of the application, e.g. in a jupyter notebook. However, this makes iteration harder and it currently is not possible to update the attributes themselves.
Additional context related to #40