Cloudera’s Distribution for Apache Hadoop

    技术2024-12-28  52

    记得从0.1版本就使用过,当时还是用的是Apache Hadoop,现在都已经有自己的增强版本了,真的不错。

     

    HDFS – Self healing distributed file system

    MapReduce – Powerful, parallel data processing framework

    Hadoop Common – a set of utilities that support the Hadoop subprojects

    HBase – Hadoop database for random read/write access

    Hive – SQL-like queries and tables on large datasets

    Pig – Dataflow language and compiler

    Oozie – Workflow for interdependent Hadoop jobs

    Sqoop – Integrate databases and data warehouses with Hadoop

    Flume – Highly reliable, configurable streaming data collection

    Zookeeper – Coordination service for distributed applications

    Hue – User interface framework and SDK for visual Hadoop applications

     

     

    下载:http://www.cloudera.com/downloads/

    Hadoop 介绍:http://www.sfbayacm.org/wp/wp-content/uploads/2010/01/amr-hadoop-acm-dm-sig-jan2010.pdf

     

    最新回复(0)