worker321 发表于 2016-12-5 09:47:41

Cloudera’s Distribution for Apache Hadoop

  记得从0.1版本就使用过,当时还是用的是Apache Hadoop,现在都已经有自己的增强版本了,真的不错。

  HDFS – Self healing distributed file system
  MapReduce – Powerful, parallel data processing framework
  Hadoop Common – a set of utilities that support the Hadoop subprojects
  HBase – Hadoop database for random read/write access
  Hive – SQL-like queries and tables on large datasets
  Pig – Dataflow language and compiler
  Oozie – Workflow for interdependent Hadoop jobs
  Sqoop – Integrate databases and data warehouses with Hadoop
  Flume – Highly reliable, configurable streaming data collection
  Zookeeper – Coordination service for distributed applications
  Hue – User interface framework and SDK for visual Hadoop applications
  下载:http://www.cloudera.com/downloads/
  Hadoop 介绍:http://www.sfbayacm.org/wp/wp-content/uploads/2010/01/amr-hadoop-acm-dm-sig-jan2010.pdf
页: [1]
查看完整版本: Cloudera’s Distribution for Apache Hadoop