logo       

Crawling: msg#00216

nutch-user.lucene.apache.org

Subject: Crawling


Can anyone suggest me how may i know a page is updated before it is being
downloaded, so that i can recrawl it.
although i have used page info but that is not reliable.
--
View this message in context:
http://www.nabble.com/Crawling-tp24566308p24566308.html
Sent from the Nutch - User mailing list archive at Nabble.com.

<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

News | Mail Home | sitemap | FAQ | advertise