[jira] Created: (TIKA-54) Outlook msg parser

View: New views
10 Messages — Rating Filter:   Alert me  

[jira] Created: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Outlook msg parser
------------------

                 Key: TIKA-54
                 URL: https://issues.apache.org/jira/browse/TIKA-54
             Project: Tika
          Issue Type: New Feature
    Affects Versions: 0.1-incubator
            Reporter: Rida Benjelloun
             Fix For: 0.1-incubator


Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
Rida.



--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


     [ https://issues.apache.org/jira/browse/TIKA-54?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rida Benjelloun updated TIKA-54:
--------------------------------

    Attachment: OutlookMsgParser.zip

The zip file contain the OutlookMsgParser class, test file and libraries.

> Outlook msg parser
> ------------------
>
>                 Key: TIKA-54
>                 URL: https://issues.apache.org/jira/browse/TIKA-54
>             Project: Tika
>          Issue Type: New Feature
>    Affects Versions: 0.1-incubator
>            Reporter: Rida Benjelloun
>             Fix For: 0.1-incubator
>
>         Attachments: OutlookMsgParser.zip
>
>
> Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
> I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
> Rida.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


     [ https://issues.apache.org/jira/browse/TIKA-54?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rida Benjelloun updated TIKA-54:
--------------------------------

    Component/s: general

> Outlook msg parser
> ------------------
>
>                 Key: TIKA-54
>                 URL: https://issues.apache.org/jira/browse/TIKA-54
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubator
>            Reporter: Rida Benjelloun
>             Fix For: 0.1-incubator
>
>         Attachments: OutlookMsgParser.zip
>
>
> Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
> I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
> Rida.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


    [ https://issues.apache.org/jira/browse/TIKA-54?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12533975 ]

Jukka Zitting commented on TIKA-54:
-----------------------------------

Sorry, msgparser is GPL software, so we can't depend on it directly (doing so would force us to use the GPL for all of Tika).

Can we find an alternative Outlook parser?

> Outlook msg parser
> ------------------
>
>                 Key: TIKA-54
>                 URL: https://issues.apache.org/jira/browse/TIKA-54
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubator
>            Reporter: Rida Benjelloun
>             Fix For: 0.1-incubator
>
>         Attachments: OutlookMsgParser.zip
>
>
> Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
> I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
> Rida.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


    [ https://issues.apache.org/jira/browse/TIKA-54?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12534028 ]

Sami Siren commented on TIKA-54:
--------------------------------

how about using poi directly:
http://poi.apache.org/apidocs/org/apache/poi/hsmf/package-summary.html

> Outlook msg parser
> ------------------
>
>                 Key: TIKA-54
>                 URL: https://issues.apache.org/jira/browse/TIKA-54
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubator
>            Reporter: Rida Benjelloun
>             Fix For: 0.1-incubator
>
>         Attachments: OutlookMsgParser.zip
>
>
> Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
> I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
> Rida.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


    [ https://issues.apache.org/jira/browse/TIKA-54?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12534113 ]

Rida Benjelloun commented on TIKA-54:
-------------------------------------

I will try to implement it using POI. Thanks sami for the suggestion.
Regards

> Outlook msg parser
> ------------------
>
>                 Key: TIKA-54
>                 URL: https://issues.apache.org/jira/browse/TIKA-54
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubator
>            Reporter: Rida Benjelloun
>             Fix For: 0.1-incubator
>
>         Attachments: OutlookMsgParser.zip
>
>
> Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
> I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
> Rida.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


     [ https://issues.apache.org/jira/browse/TIKA-54?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dave Meikle updated TIKA-54:
----------------------------

    Attachment: TIKA54.patch

Patch containing a simple Outlook Message Parser implementation using POI

> Outlook msg parser
> ------------------
>
>                 Key: TIKA-54
>                 URL: https://issues.apache.org/jira/browse/TIKA-54
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubating
>            Reporter: Rida Benjelloun
>             Fix For: 0.2-incubating
>
>         Attachments: OutlookMsgParser.zip, TIKA54.patch
>
>
> Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
> I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
> Rida.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Issue Comment Edited: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


    [ https://issues.apache.org/jira/browse/TIKA-54?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12610645#action_12610645 ]

davemeikle edited comment on TIKA-54 at 7/4/08 12:21 PM:
----------------------------------------------------------

Patch containing a simple Outlook Message Parser implementation using POI

NOTE: The test relies on the test message in the previously attached ZIP

      was (Author: davemeikle):
    Patch containing a simple Outlook Message Parser implementation using POI
 

> Outlook msg parser
> ------------------
>
>                 Key: TIKA-54
>                 URL: https://issues.apache.org/jira/browse/TIKA-54
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubating
>            Reporter: Rida Benjelloun
>             Fix For: 0.2-incubating
>
>         Attachments: OutlookMsgParser.zip, TIKA54.patch
>
>
> Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
> I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
> Rida.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


    [ https://issues.apache.org/jira/browse/TIKA-54?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12630814#action_12630814 ]

Dave Meikle commented on TIKA-54:
---------------------------------

This has been checked in on 10/07/2008. Is there anything else that needs to be done on this issue?

> Outlook msg parser
> ------------------
>
>                 Key: TIKA-54
>                 URL: https://issues.apache.org/jira/browse/TIKA-54
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubating
>            Reporter: Rida Benjelloun
>             Fix For: 0.2-incubating
>
>         Attachments: OutlookMsgParser.zip, TIKA54.patch
>
>
> Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
> I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
> Rida.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (TIKA-54) Outlook msg parser

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message


     [ https://issues.apache.org/jira/browse/TIKA-54?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-54.
-------------------------------

    Resolution: Fixed
      Assignee: Jukka Zitting

Ah, sorry for not updating the issue after committing the patch. I wanted to integrate Outlook parsing with OfficeParser so we could make the magic autodetection work also for Outlook files.

See revision 695223 for the relevant changes.  With that I think we can resolve this as Fixed.

> Outlook msg parser
> ------------------
>
>                 Key: TIKA-54
>                 URL: https://issues.apache.org/jira/browse/TIKA-54
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubating
>            Reporter: Rida Benjelloun
>            Assignee: Jukka Zitting
>             Fix For: 0.2-incubating
>
>         Attachments: OutlookMsgParser.zip, TIKA54.patch
>
>
> Attached Outlook msg parser. I didn't create a patch for this issue because there is no mimetype detector for application/msoutlook in tika-mimetypes.xml.
> I have used the library (http://auxilii.com/msgparser/) for parsing msg file witch is based on POI.
> Rida.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

LightInTheBox - Buy quality products at wholesale price