LogoopenSUSE Build Service > Projects
Sign Up | Log In

Hadoop

Hadoop is a software platform that lets one easily write and
run applications that process vast amounts of data.

Here's what makes Hadoop especially useful:
* Scalable: Hadoop can reliably store and process petabytes.
* Economical: It distributes the data and processing across clusters
              of commonly available computers. These clusters can number
              into the thousands of nodes.
* Efficient: By distributing the data, Hadoop can process it in parallel
             on the nodes where the data is located. This makes it
             extremely rapid.
* Reliable: Hadoop automatically maintains multiple copies of data and
            automatically redeploys computing tasks based on failures.

Hadoop implements MapReduce, using the Hadoop Distributed File System (HDFS).
MapReduce divides applications into many small blocks of work. HDFS creates
multiple replicas of data blocks for reliability, placing them on compute
nodes around the cluster. MapReduce can then process the data where it is
located.
  

Source Files

Filename Size Changed Actions
hadoop-0.20-fuse.default 31 Bytes over 6 years ago Download File
hadoop-0.20.2-cdh3u2.tar.gz 68.1 MB over 6 years ago
hadoop-0.20.default 257 Bytes about 7 years ago Download File
hadoop-init.tmpl 3.05 KB over 6 years ago Download File
hadoop-init.tmpl.suse 4.39 KB over 6 years ago Download File
hadoop.nofiles.conf 42 Bytes over 6 years ago Download File
hadoop.spec 16.9 KB over 6 years ago Download File

Comments for home:BotKeeper:cdh:3u2 (0)