|
Re: Host specific parsing: msg#00280nutch-user.lucene.apache.org
Koch Martina wrote: Hi, Yes, and yes. With the current plugin system you can create a new "dispatcher" plugin, and then add other necessary plugins as <import> elements. This way they will be accessible from the same classloader, so that you can instantiate them directly in your dispatcher plugin. As for the lookup ... many solutions are possible. DB connections from map tasks may be problematic, both because of latency and the cost of setting up so many DB connections. OTOH, if you add local caching (using JCS or Ehcache) the hit/miss ratio should be decent enough. If the mapping of host names to plugins can be expressed by rules then maybe a simple rule set would be enough. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
|
|
||||||||||||||||||||||||||
|
|
|
| News | Mail Home | sitemap | FAQ | advertise |