Pages

Tuesday 23 July 2013

Hadoop-Big data | training

Magnific Training design customized training services. According to actual learning needs, you can choose courses of special interests, in-depth technologies, and methods of delivery to reflect needs in your business strategy. TCloud Computing instructional designers can tailor the training content to include examples and scenarios relative to your needs. 

Visit: www.magnifictraining.com

Hadoop Training Course Content:

1. Understanding Big Data – What is Big Data ?

  • Real world issues with BIG Data – Ex: How facebook manage peta bytes of data.

  • Will regular traditional approach works?

2. How Hadoop Evolved

  • Back to Hadoop evolution.

  • The ecosystem and stack: HDFS, MapReduce, Hive, Pig…

  • Cluster architecture overview

3. Environment for Hadoop development

  • Hadoop distribution and basic commands

  • Eclipse development

4. Understanding HDFS

  • Command line and web interfaces for HDFS

  • Exercises on HDFS Java API

5. Understanding MapReduce

  • Core Logic: move computation, not data

  • Base concepts: Mappers, reducers, drivers

  • The MapReduce Java API (lab).



6. Real-World MapReduce

  • Optimizing with Combiners and Partitioners (lab)

  • More common algorithms: sorting, indexing and searching (lab)

  • Relational manipulation: map-side and reduce-side joins (lab)

  • Chaining Jobs

  • Testing with MRUnit

7. Higher-level Tools

  • Patterns to abstract “thinking in MapReduce”

  • The Cascading library (lab)

  • The Hive database (lab)

Interested ? Enroll into our online Apache Hadoop training program now.


1 comment:

 

Sample text

Sample Text

Sample Text