Bump tika-version from 1.5 to 1.22 in /parent
Bumps tika-version from 1.5 to 1.22.
Updates tika-core from 1.5 to 1.22
Changelog
Sourced from tika-core's changelog.
Release 2.0.0 - ??? BREAKING CHANGES in 2.0.0
- Remove deprecated Metadata keys/properties (TIKA-1974).
Other changes
Release 1.23
Upgrade to POI 4.1.1 (TIKA-2851).
Upgrade to PDFBox 2.0.17 (TIKA-2951).
Ensure that the PDFParser respects custom configuration of Tesseract from tika-config.xml via Eric Pugh (TIKA-2970).
Add parser for XLIFF v1.2 files (TIKA-2975).
Add mime type detection support for WebAssembly (TIKA-2894).
Add an XLZ Parser (TIKA-2976).
Release 1.22 - 07/29/2019
... (truncated)
NOTE: Known regression: PDFBOX-4587 -- PDF passwords with codepoints between 0xF000 and 0XF0000 will cause an exception.
Add parser for HWP v5 files via SooMyung Lee (soomyung) and JinSup Kim (ddoleye) (TIKA-2909).
Fix order of closing streams to avoid "Failed to close temporary resource" exception (TIKA-2908).
Improve AutoDetectReader performance by caching encoding detector (TIKA-1568).
Prevent RTFParser from outputting illegal tag combinations (TIKA-2889).
Fix RereadableInputStream to release all resources (TIKA-2903).
Implement custom language identifier in the tika-eval module based on OpenNLP's language detector; add 18 languages and add common words lists for all 121 languages (TIKA-2790).
Fix NPE in MimeTypesReader.releaseParser() via Eamonn Saunders (TIKA-2896).
Fix RTFParser to extract more content (TIKA-2883).
Add clientSubmitTime to the metadata extracted from PST files (TIKA-2898).
Commits
-
aa2a385[maven-release-plugin] prepare release 1.22-rc4 -
de0fca9roll back for rc#4...update date -
4db132eroll back for rc#4 -
c5daaf4Merge remote-tracking branch 'origin/branch_1x' into branch_1x -
357c163include opennlp lang model in tika-eval during assembly -
0f3790e[maven-release-plugin] prepare for next development iteration -
c23f47e[maven-release-plugin] prepare release 1.23-rc3 -
c25b81dMerge remote-tracking branch 'origin/branch_1x' into branch_1x -
fd40040roll back for rc#3, again... -
950ee35[maven-release-plugin] prepare for next development iteration - Additional commits viewable in compare view
Updates tika-parsers from 1.5 to 1.22
Changelog
Sourced from tika-parsers's changelog.
Release 2.0.0 - ??? BREAKING CHANGES in 2.0.0
- Remove deprecated Metadata keys/properties (TIKA-1974).
Other changes
Release 1.23
Upgrade to POI 4.1.1 (TIKA-2851).
Upgrade to PDFBox 2.0.17 (TIKA-2951).
Ensure that the PDFParser respects custom configuration of Tesseract from tika-config.xml via Eric Pugh (TIKA-2970).
Add parser for XLIFF v1.2 files (TIKA-2975).
Add mime type detection support for WebAssembly (TIKA-2894).
Add an XLZ Parser (TIKA-2976).
Release 1.22 - 07/29/2019
... (truncated)
NOTE: Known regression: PDFBOX-4587 -- PDF passwords with codepoints between 0xF000 and 0XF0000 will cause an exception.
Add parser for HWP v5 files via SooMyung Lee (soomyung) and JinSup Kim (ddoleye) (TIKA-2909).
Fix order of closing streams to avoid "Failed to close temporary resource" exception (TIKA-2908).
Improve AutoDetectReader performance by caching encoding detector (TIKA-1568).
Prevent RTFParser from outputting illegal tag combinations (TIKA-2889).
Fix RereadableInputStream to release all resources (TIKA-2903).
Implement custom language identifier in the tika-eval module based on OpenNLP's language detector; add 18 languages and add common words lists for all 121 languages (TIKA-2790).
Fix NPE in MimeTypesReader.releaseParser() via Eamonn Saunders (TIKA-2896).
Fix RTFParser to extract more content (TIKA-2883).
Add clientSubmitTime to the metadata extracted from PST files (TIKA-2898).
Commits
-
aa2a385[maven-release-plugin] prepare release 1.22-rc4 -
de0fca9roll back for rc#4...update date -
4db132eroll back for rc#4 -
c5daaf4Merge remote-tracking branch 'origin/branch_1x' into branch_1x -
357c163include opennlp lang model in tika-eval during assembly -
0f3790e[maven-release-plugin] prepare for next development iteration -
c23f47e[maven-release-plugin] prepare release 1.23-rc3 -
c25b81dMerge remote-tracking branch 'origin/branch_1x' into branch_1x -
fd40040roll back for rc#3, again... -
950ee35[maven-release-plugin] prepare for next development iteration - Additional commits viewable in compare view
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
-
@dependabot rebasewill rebase this PR -
@dependabot recreatewill recreate this PR, overwriting any edits that have been made to it -
@dependabot mergewill merge this PR after your CI passes on it -
@dependabot squash and mergewill squash and merge this PR after your CI passes on it -
@dependabot cancel mergewill cancel a previously requested merge and block automerging -
@dependabot reopenwill reopen this PR if it is closed -
@dependabot ignore this [patch|minor|major] versionwill close this PR and stop Dependabot creating any more for this minor/major version (unless you reopen the PR or upgrade to it yourself) -
@dependabot ignore this dependencywill close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) -
@dependabot use these labelswill set the current labels as the default for future PRs for this repo and language -
@dependabot use these reviewerswill set the current reviewers as the default for future PRs for this repo and language -
@dependabot use these assigneeswill set the current assignees as the default for future PRs for this repo and language -
@dependabot use this milestonewill set the current milestone as the default for future PRs for this repo and language
You can disable automated security fix PRs for this repo from the Security Alerts page.