Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump tika-parsers from 1.18 to 1.21 #20

Conversation

dependabot-preview[bot]
Copy link
Contributor

Bumps tika-parsers from 1.18 to 1.21.

Changelog

Sourced from tika-parsers's changelog.

Release 2.0.0 - ???
BREAKING CHANGES in 2.0.0

  • Remove deprecated Metadata keys/properties (TIKA-1974).

Other changes

Release 1.21 - ????

  • Add optional AUTO mode to OCR'ing of PDFs. If tesseract is installed
    and on the path, and this option is selected programmatically
    or via TikaConfig(), the PDFParser will use heuristics to decide
    whether or not to run OCR per page on PDFs. (TIKA-2749)

  • The ZipContainerDetector's default behavior was changed to run
    streaming detection up to its markLimit. Users can get the
    legacy behavior (spool-to-file/rely-on-underlying-file-in-TikaInputStream)
    by setting markLimit=-1. The POIFSContainerDetector requires an underlying file;
    it will try to spool the file to disk; if the file's length is > markLimit,
    it will not attempt detection; set markLimit to -1 for legacy behavior (TIKA-2849).

  • Upgrade PDFBox to 2.0.14 (TIKA-2834).

  • Add CSV detection and replace TXTParser with TextAndCSVParser;
    users can turn off CSV detection by excluding the TextAndCSVParser
    and adding back the TXTParser via tika-config (TIKA-2833).

  • Add a CSVParser. CSV detection is currently based solely on filename
    and/or information conveyed via Metadata (TIKA-2826).

  • General upgrades: asm, bouncycastle, commons-codec, commons-lang3, cxf,
    guava, h2, httpcomponents, jackcess, junrar, Lucene, mime4j, opennlp, parso,
    sqlite-jdbc (provided), zstd-jni (provided) (TIKA-2824)

  • Bundle xerces2 with tika-parsers (TIKA-2802).

  • Upgrade jaxb to 2.3.2 (TIKA-2819).

  • Upgrade jackson to 2.9.8 (TIKA-2717).

  • Update tika-eval's common tokens lists (TIKA-2822).

  • Handle bad tags in tika-eval more robustly (TIKA-2810).

  • Add reports for tags in tika-eval (TIKA-2809).

  • Extract text from SDT element within textboxes in .docx files (TIKA-2807).

  • Try to handle truncated OOXML files more robustly (TIKA-2765).

... (truncated)
Commits
  • 7d4dd43 [maven-release-plugin] prepare release 1.21-rc2
  • 9f2bb6f Update CHANGES.txt for 1.21 release.
  • f1b7ee6 rollback to 1.21-SNAPSHOT for rc2
  • a597eb6 TIKA-2873 -- workaround for newly re-discovered bug in POI's ChunkedCipherInp...
  • dd17e10 TIKA-2872 -- add slf4j-log4j12 to tika-dl. Thank you, Oleg Tikhonov!
  • c4a9cc9 TIKA-2871 -- don't include testChm_oom.chm in multithreaded tests.
  • 698d5b3 [maven-release-plugin] prepare for next development iteration
  • 84e9af9 [maven-release-plugin] prepare release 1.21-rc1
  • 750b390 update CHANGES.txt for 1.21 release
  • 93512ee update license headers/exclusions for RAT
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Note: This repo was added to Dependabot recently, so you'll receive a maximum of 5 PRs for your first few update runs. Once an update run creates fewer than 5 PRs we'll remove that limit.

You can always request more updates by clicking Bump now in your Dependabot dashboard.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot ignore this [patch|minor|major] version will close this PR and stop Dependabot creating any more for this minor/major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
  • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
  • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
  • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
  • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language
  • @dependabot badge me will comment on this PR with code to add a "Dependabot enabled" badge to your readme

Additionally, you can set the following in your Dependabot dashboard:

  • Update frequency (including time of day and day of week)
  • Automerge options (never/patch/minor, and dev/runtime dependencies)
  • Pull request limits (per update run and/or open at any time)
  • Out-of-range updates (receive only lockfile updates, if desired)
  • Security updates (receive only security updates, if desired)

Finally, you can contact us by mentioning @dependabot.


Dependabot has been acquired by GitHub  🎉

@dependabot-preview dependabot-preview bot added the dependencies Pull requests that update a dependency file label May 23, 2019
@dependabot-preview
Copy link
Contributor Author

Superseded by #49.

@dependabot-preview dependabot-preview bot deleted the dependabot/maven/org.apache.tika-tika-parsers-1.21 branch August 2, 2019 10:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependencies Pull requests that update a dependency file
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant