Writing weak supervision results into accessible metadata

Open jhoetter opened this issue 3 years ago • 0 comments

Is your feature request related to a problem? Please describe. I have e.g. some tweets that I want to clean for my model. I have identified and built a weak supervision procedure to tag them automatically. Still, I can't get rid of these links inside the application.

Describe the solution you'd like Feed the weakly supervised data back into the record data. For instance, if I know that "Check out this tool! https://github.com/code-kern-ai/refinery" contains the link "https://github.com/code-kern-ai/refinery", write that metadata into the record itself:

{
  "tweet": "Check out this tool! https://github.com/code-kern-ai/refinery",
  "tweet__entities": [
    "link": [5, 6]
  ]
}

Describe alternatives you've considered Doing that modification outside of the application, e.g. in a jupyter notebook. However, this makes iteration harder and it currently is not possible to update the attributes themselves.

Additional context related to #40

Aug 02 '22 12:08 jhoetter