Process for pretraining BERT and preprocessing the unlabeled data
Thanks for your code!Could you please share the code which pretrains BERT with the unlabeled review data from Amason and Yelp?It would be better if we could see the code which preprocessing the data before. Looking for your reply.Thank you!
https://github.com/howardhsu/BERT-for-RRC-ABSA/blob/master/pytorch-pretrained-bert/src/gen_pt_review.py
Getting the raw text should be straightforward and we didn't release the code.
On Thu, Aug 19, 2021 at 1:56 AM zjj0266 @.***> wrote:
Thanks for your code!Could you please share the code which pretrains BERT with the unlabeled review data from Amason and Yelp?It would be better if we could see the code which preprocessing the data before. Looking for your reply.Thank you!
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/howardhsu/BERT-for-RRC-ABSA/issues/21, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACRK372A76U3HTAD3GS2B6LT5TBLPANCNFSM5CNYA5HQ . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&utm_campaign=notification-email .
-- Homepage: https://howardhsu.github.io/ Linkedin: https://www.linkedin.com/in/hu-xu-9852403b/ Google Scholar: https://scholar.google.com/citations?user=SaH2yWMAAAAJ Twitter: https://twitter.com/Hu_Hsu Email: @.***