sbd
sbd copied to clipboard
Sentence Boundary Detection in javascript for node. http://tessmore.github.io/sbd/
Example: Input: Here is a question about this lesson: \n\n What is one of the key methods to effectively express and seek opinions in a professional or conversational context? Output:...
Bumps [postcss](https://github.com/postcss/postcss) from 8.2.14 to 8.4.31. Release notes Sourced from postcss's releases. 8.4.31 Fixed \r parsing to fix CVE-2023-44270. 8.4.30 Improved source map performance (by @romainmenke). 8.4.29 Fixed Node#source.offset (by...
With setting `newline_boundaries: false` the text below gets interpreted as a single sentence. If you lower case `alien` it turns into two sentences. ``` I love Alien. It's a great...
Bumps [semver](https://github.com/npm/node-semver) from 5.7.1 to 5.7.2. Release notes Sourced from semver's releases. v5.7.2 5.7.2 (2023-07-10) Bug Fixes 2f8fd41 #585 better handling of whitespace (#585) (@joaomoreno, @lukekarrys) Changelog Sourced from semver's...
How can I recognize and split a sentence if the input sentence does not have punctuation? For example: ``` On Jan 20 former Sen Barack Obama became the 44th President...
Go to the test website http://tessmore.github.io/sbd/ and put the TWO sentences below in the editor. It will return a count of THREE On Jan. 20, former Sen. Barack Obama became...
Hi, @Tessmore First of all your library is quite helpful, nice work. I was working with this code and got some unexpected results. code: ``` const sbd = require('sbd'); const...
Hey! ### Context: ``` const options = { newline_boundaries: true, html_boundaries: true, html_boundaries_tags: [ 'br', 'p', 'h1', 'h2', 'h3', 'h4', 'h5', 'h6', 'ul', 'div', 'figcaption', ], sanitize: true, preserve_whitespace: true,...
I recently came across an edge case where if a text has a typo putting end of sentence punctuation next to the next sentence without whitespace, the result in the...