Uploaded image for project: 'Hippo CMS'
  1. Hippo CMS
  2. CMS-10830

Upgrade to Tika 1.7 (from 1.3)

    XMLWordPrintable

Details

    • Improvement
    • Status: Closed
    • Normal
    • Resolution: Fixed
    • None
    • 12-GA
    • cms7-project
    • None
    • Turing
    • Turing Sprint 160, Turing Sprint 161 (7/18-8/1), Turing Sprint 162

    Description

      Current version of Jackrabbit (2.10.x) already uses Tika 1.7, and definitely when we upgrade to Jackrabbit 2.14 (REPO-1489), which also still uses Tika 1.7, we need to verify and validate the impact of upgrading/aligning Tika version as well.

      Tika is used in the CMS, repository and services-webfiles. Pdfbox is used in EForms and the repository

      Tika 1.7 depends on pdfbox 1.8.8. As we already override pdfbox in the repository to be 1.8.8 no conflict should be expected. However depending on he project pdfbox might be 1.7.1 from the 1.3 Tika dependency

      Update documentation from 1.3 -> 1.7
      https://tika.apache.org/1.4/index.html
      https://tika.apache.org/1.5/index.html
      https://tika.apache.org/1.6/index.html
      https://tika.apache.org/1.7/index.html

      Tika 1.3

      <properties>
        <poi.version>3.8</poi.version>
        <codec.version>1.5</codec.version> <!-- NOTE: sync with POI -->
        <mime4j.version>0.7.2</mime4j.version>
        <vorbis.version>0.1</vorbis.version>
        </properties>
      
      <artifactId>pdfbox</artifactId>
      <version>1.7.1</version>
      

      Tika 1.7

      <properties>
        <poi.version>3.11</poi.version>
        <codec.version>1.9</codec.version>
        <!-- NOTE: sync with POI -->
        <compress.version>1.8.1</compress.version>
        <tukaani.version>1.5</tukaani.version>
        <!-- NOTE: sync with commons-compress -->
        <mime4j.version>0.7.2</mime4j.version>
        <vorbis.version>0.6</vorbis.version>
        <pdfbox.version>1.8.8</pdfbox.version>
      </properties>
      

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            abanck Arent-Jan Banck (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: