web search engine
Nutch is an effort to build an open source search engine based on Lucene Java for the search and index component. The fetcher ("robot" or "web crawler") has been written from scratch solely for this project. Nutch has a highly modular architecture allowing developers to create plugins for the following activities: media-type parsing, data retrieval, querying and clustering.
|nutch-0.9.tar.gz||007079790967.5 MB||1188429806over 10 years ago|
|nutch.spec||00000039663.87 KB||1223052966over 9 years ago|