node-word-extractor
node-word-extractor copied to clipboard
Read data from a Word document using node.js
Thanks for making such a great lib, I just wonder is there a way we can know that some text is prefix with Numbering indicator/ Bullet point? For example: I...
Bumps [minimist](https://github.com/substack/minimist) from 1.2.5 to 1.2.6. Commits 7efb22a 1.2.6 ef88b93 security notice for additional prototype pollution issue c2b9819 isConstructorOrProto adapted from PR bc8ecee test from prototype pollution PR See full...
Bumps [tmpl](https://github.com/daaku/nodejs-tmpl) from 1.0.4 to 1.0.5. Commits See full diff in compare view [](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter...
A second common use case, added by request in `Text::Extract::Word`, was to read form data from protected Word files. Again, the code for this still exists in the Perl component,...
The **handleEntry()** function in the **open-office-extractor.js** file has an instruction to read streams by 4096-byte chunks: `const chunk = readStream.read(0x1000);` Given the text in the *.docx file is not in...
On the issue https://github.com/morungos/node-word-extractor/issues/54. Unfortunately, it seems we can only get the encoding from the XML heading if we read the stream, and we better assume the encoding before that...
Hello! I was wondering if it would be possible to add some paging functionality. This issue could serve as three related requests: 1. A way to iterate through pages 2....
Bumps [ws](https://github.com/websockets/ws) from 7.4.6 to 7.5.10. Release notes Sourced from ws's releases. 7.5.10 Bug fixes Backported e55e5106 to the 7.x release line (22c28763). 7.5.9 Bug fixes Backported bc8bd34e to the...
Cannot read properties of null (reading 'open') while using Uint8Array as params for extract method.
If i pass Uint8Array data in below function like in example below const extracted = await extractor.extract(data); I got this error error TypeError: Cannot read properties of null (reading 'open')
How do i extract word to HTML