web search engine
Nutch is an effort to build an open source search engine based on Lucene Java for the search and index component. The fetcher ("robot" or "web crawler") has been written from scratch solely for this project. Nutch has a highly modular architecture allowing developers to create plugins for the following activities: media-type parsing, data retrieval, querying and clustering.
|nutch-0.9.tar.gz||007079790967.5 MB||1188429806almost 11 years ago|
|nutch.spec||00000039663.87 KB||1223052966almost 10 years ago|