logo       

Re: nutch -threads in hadoop: msg#00232

nutch-user.lucene.apache.org

Subject: Re: nutch -threads in hadoop

Brian Tingle wrote:
Hey,


I'm playing around the nutch on hadoop; when I go
hadoop jar nutch-1.0.job org.apache.nutch.crawl.Crawl -threads ... is
that threads per node or total threads for all nodes?

Threads per map task - if you run multiple map tasks per node then you will get numThreads * numMapTasks per node.

So be careful to set it to a number that doesn't overwhelm your network ;)

--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com

Google Custom Search

News | Mail Home | sitemap | FAQ | advertise