osdir.com
mailing list archive
F.A.Q.
-since 2001!
Monthly Indexes
2012-05
2012-04
2012-03
2012-02
2012-01
2011-12
2011-11
2011-10
2011-09
2011-08
2011-07
2011-06
2011-05
2011-04
2011-03
2011-02
2011-01
2010-12
2010-11
dev.nutch.apache.org (thread)
Page(s): 1 |
2
of 2
[jira] [Commented] (NUTCH-923) Multilingual support for Solr-index-mapping
[jira] [Created] (NUTCH-1368) SolrDeleteDuplicates.java:270
[jira] [Closed] (NUTCH-1368) SolrDeleteDuplicates.java:270
[jira] [Commented] (NUTCH-1100) SolrDedup broken
Unsubscribe me please
[jira] [Commented] (NUTCH-1323) AjaxNormalizer
[jira] [Commented] (NUTCH-1323) AjaxNormalizer
[jira] [Commented] (NUTCH-1323) AjaxNormalizer
[jira] [Commented] (NUTCH-1323) AjaxNormalizer
Build failed in Jenkins: nutch-trunk-maven #264
Jenkins build is back to normal : nutch-trunk-maven #265
[jira] [Created] (NUTCH-1367) Port ParserChecker to Nutchgora
[jira] [Commented] (NUTCH-1367) Port ParserChecker to Nutchgora
[jira] [Closed] (NUTCH-1367) Port ParserChecker to Nutchgora
[jira] [Resolved] (NUTCH-1367) Port ParserChecker to Nutchgora
[jira] [Created] (NUTCH-1366) speed up indexing by eliminating the indexreducer
[jira] [Updated] (NUTCH-1366) speed up indexing by eliminating the indexreducer
[jira] [Commented] (NUTCH-1366) speed up indexing by eliminating the indexreducer
[jira] [Commented] (NUTCH-1366) speed up indexing by eliminating the indexreducer
[jira] [Commented] (NUTCH-1366) speed up indexing by eliminating the indexreducer
[jira] [Closed] (NUTCH-1366) speed up indexing by eliminating the indexreducer
[jira] [Commented] (NUTCH-1366) speed up indexing by eliminating the indexreducer
[jira] [Updated] (NUTCH-1086) Rewrite protocol-httpclient
[jira] [Closed] (NUTCH-1077) Nutch 2 DbUpdateMapper throws ArrayOutOfBoundsException when running update
[jira] [Updated] (NUTCH-1077) Nutch 2 DbUpdateMapper throws ArrayOutOfBoundsException when running update
[jira] [Updated] (NUTCH-1306) Commit after finished writing to solr index
[jira] [Updated] (NUTCH-1306) Commit after finished writing to solr index
[jira] [Commented] (NUTCH-1026) Strip UTF-8 non-character codepoints
[jira] [Commented] (NUTCH-1026) Strip UTF-8 non-character codepoints
[jira] [Closed] (NUTCH-1026) Strip UTF-8 non-character codepoints
[jira] [Updated] (NUTCH-1325) HostDB for Nutch
[jira] [Commented] (NUTCH-1306) Commit after finished writing to solr index
[jira] [Commented] (NUTCH-1306) Commit after finished writing to solr index
[jira] [Commented] (NUTCH-1306) Commit after finished writing to solr index
[no subject]
Re:
Re:
[jira] [Created] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration
[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration
Date format issue Nutch-Solr with NUTCH-809 Parse-metatags plugin
[jira] [Created] (NUTCH-1364) Add a counter for malformed urls
[jira] [Created] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Commented] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Issue Comment Edited] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Commented] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Commented] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Commented] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Commented] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Closed] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Resolved] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Commented] (NUTCH-1363) Make parsing in FetcherJob actually work.
[jira] [Created] (NUTCH-1362) Fix error handling of urls with empty fields
[jira] [Updated] (NUTCH-1362) Fix error handling of urls with empty fields
[jira] [Commented] (NUTCH-1362) Fix error handling of urls with empty fields
[jira] [Commented] (NUTCH-1362) Fix error handling of urls with empty fields
[jira] [Closed] (NUTCH-1362) Fix error handling of urls with empty fields
[jira] [Commented] (NUTCH-1362) Fix error handling of urls with empty fields
[jira] [Created] (NUTCH-1361) Fix mishandling of malformed urls in generator job
[jira] [Created] (NUTCH-1360) Suport the storing of IP address connected to when web crawling
[jira] [Updated] (NUTCH-1360) Suport the storing of IP address connected to when web crawling
[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling
[jira] [Created] (NUTCH-1359) Add raw_headers support
[jira] [Created] (NUTCH-1358) Do not accept bogus arguments
[jira] [Updated] (NUTCH-1358) Do not accept bogus arguments
[jira] [Closed] (NUTCH-1358) Do not accept bogus arguments
[jira] [Commented] (NUTCH-1358) Do not accept bogus arguments
[jira] [Commented] (NUTCH-1358) Do not accept bogus arguments
[jira] [Created] (NUTCH-1357) All gora mapreduce functionality should go through StorageUtils
[jira] [Commented] (NUTCH-1357) All gora mapreduce functionality should go through StorageUtils
[jira] [Updated] (NUTCH-1357) All gora mapreduce functionality should go through StorageUtils
[jira] [Updated] (NUTCH-1357) All gora mapreduce functionality should go through StorageUtils
[jira] [Commented] (NUTCH-1016) Strip UTF-8 non-character codepoints
[jira] [Commented] (NUTCH-1016) Strip UTF-8 non-character codepoints
[jira] [Commented] (NUTCH-1016) Strip UTF-8 non-character codepoints
[jira] [Commented] (NUTCH-1016) Strip UTF-8 non-character codepoints
Re: [VOTE] Apache Nutch 1.5 release rc #1
Re: [VOTE] Apache Nutch 1.5 release rc #1
store additional information from page at outlinks - topic specific crawl
Re: store additional information from page at outlinks - topic specific crawl
Re: store additional information from page at outlinks - topic specific crawl
Re: store additional information from page at outlinks - topic specific crawl
Jason Trost Nutchgora Fork
[jira] [Commented] (NUTCH-1301) Index job resume switch to resume a failed job
[jira] [Commented] (NUTCH-1301) Index job resume switch to resume a failed job
[jira] [Updated] (NUTCH-1342) Read time out protocol-http
[jira] [Created] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
[jira] [Updated] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
[jira] [Commented] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
[jira] [Updated] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
[jira] [Updated] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
[jira] [Commented] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
[jira] [Updated] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
[jira] [Commented] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
[jira] [Commented] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.
[jira] [Created] (NUTCH-1355) nutchgora Configure minimum throughput for fetcher
[jira] [Updated] (NUTCH-1355) nutchgora Configure minimum throughput for fetcher
[jira] [Closed] (NUTCH-1355) nutchgora Configure minimum throughput for fetcher
[jira] [Commented] (NUTCH-1355) nutchgora Configure minimum throughput for fetcher
How to crawl https sites with certificate
[jira] [Created] (NUTCH-1354) nutchgora support fetcher.queue.depth.multiplier property
[jira] [Updated] (NUTCH-1354) nutchgora support fetcher.queue.depth.multiplier property
[jira] [Closed] (NUTCH-1354) nutchgora support fetcher.queue.depth.multiplier property
[jira] [Commented] (NUTCH-1354) nutchgora support fetcher.queue.depth.multiplier property
[jira] [Created] (NUTCH-1353) nutchgora DomainStatistics support crawlId, counter bug and reformatting
[jira] [Updated] (NUTCH-1353) nutchgora DomainStatistics support crawlId, counter bug and reformatting
[jira] [Closed] (NUTCH-1353) nutchgora DomainStatistics support crawlId, counter bug and reformatting
[jira] [Commented] (NUTCH-1353) nutchgora DomainStatistics support crawlId, counter bug and reformatting
[jira] [Created] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Updated] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Commented] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Updated] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Commented] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Updated] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Commented] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Updated] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Commented] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Commented] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
[jira] [Commented] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization
Build failed in Jenkins: Nutch-nutchgora #246
Jenkins build is back to normal : Nutch-nutchgora #247
[jira] [Closed] (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box
[jira] [Commented] (NUTCH-809) Parse-metatags plugin
[jira] [Commented] (NUTCH-809) Parse-metatags plugin
[jira] [Commented] (NUTCH-809) Parse-metatags plugin
[jira] [Commented] (NUTCH-809) Parse-metatags plugin
[jira] [Commented] (NUTCH-809) Parse-metatags plugin
Mapping file specifics
Re: Mapping file specifics
Re: Mapping file specifics
[jira] [Created] (NUTCH-1351) DomainStatistics to aggregate by TLD
[jira] [Updated] (NUTCH-1351) DomainStatistics to aggregate by TLD
[jira] [Created] (NUTCH-1350) remove unused dependancy because of access restriction
[jira] [Closed] (NUTCH-1350) remove unused dependancy because of access restriction
[jira] [Commented] (NUTCH-1350) remove unused dependancy because of access restriction
[jira] [Created] (NUTCH-1349) Make batchId explcit within debug logging.
[jira] [Commented] (NUTCH-1349) Make batchId explcit within debug logging.
[jira] [Updated] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI
[jira] [Updated] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI
[jira] [Commented] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI
[jira] [Updated] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI
[jira] [Updated] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI
[jira] [Commented] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI
[jira] [Closed] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI
[jira] [Resolved] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI
[jira] [Commented] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI
[jira] [Updated] (NUTCH-1294) IndexClean job with solr implementation.
[Nutch Wiki] Trivial Update of "CommandLineOptions" by LewisJohnMcgibbney
[Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney
[jira] [Closed] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml
[jira] [Commented] (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box
[jira] [Closed] (NUTCH-896) Gora-based tests need to have their own config files
[jira] [Updated] (NUTCH-896) Gora-based tests need to have their own config files
[jira] [Commented] (NUTCH-1321) IDNNormalizer
[jira] [Resolved] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml
[jira] [Commented] (NUTCH-1293) IndexingFiltersChecker to store detected content type in crawldatum metadata
[jira] [Resolved] (NUTCH-1293) IndexingFiltersChecker to store detected content type in crawldatum metadata
[jira] [Reopened] (NUTCH-1293) IndexingFiltersChecker to store detected content type in crawldatum metadata
[jira] [Commented] (NUTCH-1294) IndexClean job with solr implementation.
[jira] [Commented] (NUTCH-1294) IndexClean job with solr implementation.
[jira] [Commented] (NUTCH-1300) Indexer to normalize URL's
[jira] [Commented] (NUTCH-1300) Indexer to normalize URL's
[jira] [Updated] (NUTCH-1323) AjaxNormalizer
[jira] [Updated] (NUTCH-1323) AjaxNormalizer
[jira] [Commented] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml
[jira] [Commented] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml
[jira] [Commented] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml
[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml
[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml
[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml
[jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml
Page(s): 1 |
2
of 2