Accessing contents from Nutch crawls

View: New views
1 Messages — Rating Filter:   Alert me  

Accessing contents from Nutch crawls

by schiffinho :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Hello,

I,'ve got the follwing problem:
I've setup Nutch to crawl some web pages and it works fine. Using Zend_Search_Lucene to search this index works also. The problem is that I don't know how to retrieve other fields than url and title from the search result (I want to output a google-like summary in the result). If I use the nutch software it works and I can retrieve the summary
So my question is, is it possible to retrieve the content field from the nutch created index in PHP?


Regards

LightInTheBox - Buy quality products at wholesale price!