logo       

Re: Nutch crawling status: msg#00268

nutch-user.lucene.apache.org

Subject: Re: Nutch crawling status


I've found the script here
http://wiki.apache.org/nutch/MonitoringNutchCrawls. But I'm not sure how can
I use it, when hadoop is on the farm of 15 machines? May be I should use
hadoop tasktracker instead of this script somehow?

caezar wrote:
>
> Hi All,
>
> Is there a way, to retrieve nutch crawling status at runtime? Let me
> describe what I mean. For instance if currently fetch job is running, I
> want to retrieve that fetch is running, how many URLs already fetched, how
> many errors occured. Hadoop farm is used.
>
> Thanks for any ideas.
>

--
View this message in context:
http://www.nabble.com/Nutch-crawling-status-tp24681707p24681949.html
Sent from the Nutch - User mailing list archive at Nabble.com.

<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | Mail Home | sitemap | FAQ | advertise