Thursday 26 September 2013

Big data school ( Hive,Map reduce,Hdfs) | Training

Big data school ( Hive,Map reduce,Hdfs) | Training
Course Contents
The course covers the following topics:
  • The Motivation For Hadoop

    • Problems with traditional large-scale systems
    • Requirements for a new approach

  • Hadoop: Basic Concepts

    • What is Hadoop?
    • The Hadoop Distributed File System
    • How MapReduce Works
    • Anatomy of a Hadoop Cluster

  • Writing a MapReduce Program

    • Examining a Sample MapReduce Program
    • Basic API Concepts
    • The Driver Code
    • The Mapper
    • The Reducer
    • Hadoop's Streaming API

  • The Hadoop Ecosystem

    • Hive and Pig
    • HBase
    • Flume
    • Other Ecosystem Projects

  • Integrating Hadoop Into The Workflow

    • Relational Database Management Systems
    • Storage Systems
    • Importing Data from RDBMSs With Sqoop
    • Importing Real-Time Data with Flume

  • Delving Deeper Into The Hadoop API

    • Using Combiners
    • The configure and close Methods
    • SequenceFiles
    • Partitioners
    • Counters
    • Directly Accessing HDFS
    • ToolRunner
    • Using The Distributed Cache

  • Common MapReduce Algorithms

    • Sorting and Searching
    • Indexing
    • Classification/Machine Learning
    • Term Frequency - Inverse Document Frequency
    • Word Co-Occurrence

  • Using Hive and Pig

    • Hive Basics
    • Pig Basics

  • Debugging MapReduce Programs

    • Testing with MRUnit
    • Logging
    • Other Debugging Strategies

  • Advanced MapReduce Programming

    • A Recap of the MapReduce Flow
    • Custom Writables and WritableComparables
    • The Secondary Sort
    • Creating InputFormats and OutputFormats
    • Pipelining Jobs With Oozie.

  • Joining Data Sets in MapReduce Jobs

    • Map-Side Joins
    • Reduce-Side Joins

  • Graph Manipulation in Hadoop

    • Introduction to graph techniques
    • Representing Graphs in Hadoop
    • Implementing a sample algorithm: Single Source Shortest Path.

    • or full course details please visit our website www.hadooponlinetraining.net

    • Duration for course is 30 days or 45 hours and special care will be taken. It is a one to one training with hands on experience.

    • * Resume preparation and Interview assistance will be provided.
    • For any further details please 

    • contact India +91-9052666559
    •          Usa : +1-678-693-3475.

    • visit www.hadooponlinetraining.net

    • please mail us all queries to info@magnifictraining.com

No comments:

Post a Comment