[jira] Created: (TIKA-151) Stream compression support

View: New views
2 Messages — Rating Filter:   Alert me  

[jira] Created: (TIKA-151) Stream compression support

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Stream compression support
--------------------------

                 Key: TIKA-151
                 URL: https://issues.apache.org/jira/browse/TIKA-151
             Project: Tika
          Issue Type: New Feature
          Components: parser
            Reporter: Jukka Zitting


Tika should automatically detect and decode stream compression formats like gzip or bzip2. When parsing, such compression should be mentioned in the resulting metadata (compression=gzip), but should not otherwise affect the result of the parsing. In other words, the extracted text content should be the same regardless of whether the input stream has been compressed or not.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (TIKA-151) Stream compression support

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


     [ https://issues.apache.org/jira/browse/TIKA-151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-151.
--------------------------------

       Resolution: Fixed
    Fix Version/s: 0.2-incubating
         Assignee: Jukka Zitting

Implemented gzip and bzip2 support in revision 692281.

> Stream compression support
> --------------------------
>
>                 Key: TIKA-151
>                 URL: https://issues.apache.org/jira/browse/TIKA-151
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>             Fix For: 0.2-incubating
>
>
> Tika should automatically detect and decode stream compression formats like gzip or bzip2. When parsing, such compression should be mentioned in the resulting metadata (compression=gzip), but should not otherwise affect the result of the parsing. In other words, the extracted text content should be the same regardless of whether the input stream has been compressed or not.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

LightInTheBox - Buy quality products at wholesale price!