Parag Ekbote
Parag Ekbote
You could use the following libraries for parsing PDF & DOCX Documents: 1)PyPDF2: A pure-python library built as a PDF toolkit. It can be used to extract text, metadata, and...
Hi, can I please move the required files to examples/research_projects? Maybe that could close this pending PR? cc: @sayakpaul
I have opened a PR with the requested changes. Could you please review? cc: @sayakpaul
Thank you for @kashif for allowing me to lead the PR to its completion. Could we please close this PR as #9935 has been merged. cc: @sayakpaul
Could you please point to the files/ folder you'd like to document? I'd love to open a PR to fix this. cc: @clefourrier
I've removed the pre-training datasets from the general-purpose section. Do let me know if further modifications are required. cc: @mlabonne
I've updated the PR to remove the duplicated dataset. Could you please review? cc: @mlabonne
Can you please review? cc: @mlabonne
Can you please review? cc: @mlabonne
Can you please review the PR and suggest any improvements? cc: @mlabonne