bricks
bricks copied to clipboard
[MODULE] - Text preprocessing using texthero
Please describe the module you would like to add to bricks Texthero is a cool library that might be useful for text preprocessing.
Do you already have an implementation? If so, please share it here. For instance:
import texthero as hero
import pandas as pd
text = "This sèntencé (123 /) needs to [OK!] be cleaned! "
s = pd.Series(text)
s = hero.remove_digits(s)
Additional context Library can be found here: https://github.com/jbesomi/texthero