Just Analytics Blog | Performance Management News, Views and Op-ed

Hadoop Ecosystem

Written by Umesh Kakkad | Nov 22, 2013 1:59:00 PM

Various components of Hadoop ecosystem

1. Hadoop File System (HDFS): Clustered redundant file system for Hadoop

2. HBASE: Column oriented database scaling to billions of rows

3. MAPREDUCE: Parallel computation on cluster of servers

4. MAHOUT: Library of machine learning & data mining algorithms

5. HIVE: Datawarehouse with SQL like access

6. PIG: high level programming language for Hadoop

7. HCATALOG: Schema and datatype sharing over PIG, HIVE and MAPREDUCE

8. SQOOP: Imports data from relational databases

9. FLUME: Collection and import of log and event data

10. WHIRR: Cloud agnostic deployment of clusters

11. OOZIE: Orchestration and workflow management

12. AMBARI: Deployment configuration and Monitoring

13. ZOOKEEPER: Configuration Management and co-ordination.