logo       

Re: Dumping what I have?: msg#00294

nutch-user.lucene.apache.org

Subject: Re: Dumping what I have?

Hi Paul,

yeah there is a dump command

bin/nutch readlinkdb crawl/linkdb/ -dump dumpdir
You can also dump the CrawlDB, but I dont know if the complete data are
dumpable and this is usefull for you...

HTH

Mario

Paul Tomblin wrote:
> The nutch data files are pretty opaque, and even "strings" can't extract
> anything except the occasional URL. Is there any code to dump the contents
> of the various files in a human readable form?
>
>

--

Mario Schröder | http://www.finanz-checks.de
Office: +49 361 2152062
Phone: +49 34464 62301 Cell: +49 163 27 09 807
http://www.xing.com/go/invite/6035007.9c143c

<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | Mail Home | sitemap | FAQ | advertise