Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump tika-core from 1.21 to 1.22 #5166

Merged

Conversation

dependabot-preview[bot]
Copy link
Contributor

Bumps tika-core from 1.21 to 1.22.

Changelog

Sourced from tika-core's changelog.

Release 2.0.0 - ???
BREAKING CHANGES in 2.0.0

  • Remove deprecated Metadata keys/properties (TIKA-1974).

Other changes

Release 1.22 - ???

  • NOTE: Known regression: PDFBOX-4587 -- PDF passwords with codepoints
    between 0xF000 and 0XF0000 will cause an exception.

  • Add parser for HWP v5 files via SooMyung Lee (soomyung) and
    JinSup Kim (ddoleye) (TIKA-2909).

  • Fix order of closing streams to avoid "Failed to close temporary resource"
    exception (TIKA-2908).

  • Improve AutoDetectReader performance by caching encoding
    detector (TIKA-1568).

  • Prevent RTFParser from outputting illegal tag combinations (TIKA-2889).

  • Fix RereadableInputStream to release all resources (TIKA-2903).

  • Implement custom language identifier in the tika-eval module based on
    OpenNLP's language detector; add 18 languages and add common words
    lists for all 121 languages (TIKA-2790).

  • Fix NPE in MimeTypesReader.releaseParser() via Eamonn Saunders (TIKA-2896).

  • Fix RTFParser to extract more content (TIKA-2883).

  • Add clientSubmitTime to the metadata extracted from PST files (TIKA-2898).

  • Improve StreamingZipContainerDetector for xltx, xltm and
    several other file formats (TIKA-2886).

Release 1.21 - 05/14/2019

  • Add optional AUTO mode to OCR'ing of PDFs. If tesseract is installed
    and on the path, and this option is selected programmatically
    or via TikaConfig(), the PDFParser will use heuristics to decide
    whether or not to run OCR per page on PDFs. (TIKA-2749)

  • The ZipContainerDetector's default behavior was changed to run
    streaming detection up to its markLimit. Users can get the
    legacy behavior (spool-to-file/rely-on-underlying-file-in-TikaInputStream)
    by setting markLimit=-1. The POIFSContainerDetector requires an underlying file;
    it will try to spool the file to disk; if the file's length is > markLimit,

... (truncated)
Commits
  • aa2a385 [maven-release-plugin] prepare release 1.22-rc4
  • de0fca9 roll back for rc#4...update date
  • 4db132e roll back for rc#4
  • c5daaf4 Merge remote-tracking branch 'origin/branch_1x' into branch_1x
  • 357c163 include opennlp lang model in tika-eval during assembly
  • 0f3790e [maven-release-plugin] prepare for next development iteration
  • c23f47e [maven-release-plugin] prepare release 1.23-rc3
  • c25b81d Merge remote-tracking branch 'origin/branch_1x' into branch_1x
  • fd40040 roll back for rc#3, again...
  • 950ee35 [maven-release-plugin] prepare for next development iteration
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot ignore this [patch|minor|major] version will close this PR and stop Dependabot creating any more for this minor/major version (unless you reopen the PR or upgrade to it). To ignore the version in this PR you can just close it
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
  • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
  • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
  • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
  • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language
  • @dependabot badge me will comment on this PR with code to add a "Dependabot enabled" badge to your readme

Additionally, you can set the following in your Dependabot dashboard:

  • Update frequency (including time of day and day of week)
  • Automerge options (never/patch/minor, and dev/runtime dependencies)
  • Pull request limits (per update run and/or open at any time)
  • Out-of-range updates (receive only lockfile updates, if desired)
  • Security updates (receive only security updates, if desired)

Finally, you can contact us by mentioning @dependabot.

@LinusDietz LinusDietz merged commit 839b8a9 into master Aug 6, 2019
@LinusDietz LinusDietz deleted the dependabot/gradle/org.apache.tika-tika-core-1.22 branch August 6, 2019 12:39
Siedlerchr added a commit that referenced this pull request Aug 9, 2019
…rter

# By David Méndez (47) and others
# Via GitHub (5) and David Méndez (3)
* upstream/master: (57 commits)
  fix wrong package (#5181)
  Remove logging message for non-existing nested files
  Bump applicationinsights-core from 2.4.0 to 2.4.1 (#5171)
  Bump archunit-junit5-engine from 0.10.2 to 0.11.0 (#5157)
  Bump applicationinsights-logging-log4j2 from 2.4.0 to 2.4.1 (#5172)
  Bump tika-core from 1.21 to 1.22 (#5166)
  Fix fail on testPerformExportForSingleEntry from DocBook5ExporterTest (#5168)
  Add a check for nested files and improve the code to skip lines (DefaultTexParser)
  Add latest changes to latexintegration (#5170)
  LaTeX integration latest changes (#5167)
  Move to extended enums for fields and entry types (#5148)
  Bump archunit-junit5-api from 0.10.2 to 0.11.0 (#5158)
  Revert temporal change
  Fix all issues from reviews of #5137
  Bump com.simonharrer.modernizer from 1.6.0-1 to 1.8.0-1 (#5154)
  Bump checkstyle from 8.22 to 8.23 (#5153)
  Add a new JabRefIcons.LATEX_CITATIONS
  Change toString() methods
  Update DefaultTexParser for explaining when and why it skips the citation matching
  Update TexParserResult for avoiding 'orElse(null)'
  ...

# Conflicts:
#	src/main/java/org/jabref/logic/importer/fileformat/PdfContentImporter.java
#	src/test/java/org/jabref/logic/importer/fileformat/PdfContentImporterTest.java
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant