BigData

Big Data Hadoop


Hadoop is a platform written in java where we work with large amount of data. Hadoop eco system has lots of tools which makes data processing easy with the help of the bigdata. Big Data Hadoop training will make you an expert in HDFS, MapReduce, Hbase, Hive, Pig, Yarn, Oozie, Flume and Sqoop using real-time use cases on Retail, Social Media, Aviation, Tourism, Finance domain.

    • Mon – Fri ( 6 Weeks ) | 06.30 AM - 6.30 PM Time (IST)   (any 2 hours)

    • Sat – Sun ( 8 Weeks ) | 07.30 AM - 07:00 PM Time (IST)   (any 3 hours)


Why this course?

  • Global Hadoop Market to Reach $84.6 Billion by 2021 – Allied Market Research
  • Shortage of 1.4 -1.9 million Hadoop Data Analysts in US alone by 2018– Mckinsey
  • Hadoop Administrator in the US can get a salary of $123,000 – indeed.com
  • Hadoop practitioners are among the highest paid IT professionals today with salaries ranging till $85K (source: indeed job portal), and the market demand for them is growing rapidly.

Course Objective

By the end of this course, you will be able to:

  • Master fundamentals of Hadoop 2.7 and YARN and write applications using them
  • Setting up Pseudo node and Multi node cluster on Amazon EC2
  • Master HDFS, MapReduce, Hive, Pig, Oozie, Sqoop, Flume, Zookeeper, HBase
  • Learn Spark, Spark RDD, Graphx, MLlib writing Spark applications
  • Master Hadoop administration activities like cluster managing,monitoring,administration and troubleshooting
  • Configuring ETL tools like Pentaho/Talend to work with MapReduce, Hive, Pig, etc
  • Detailed understanding of Big Data analytics
  • Hadoop testing applications using MR Unit and other automation tools.
  • Work with Avro data formats
  • Practice real-life projects using Hadoop and Apache Spark
  • Be equipped to clear Big Data Hadoop Certification.

Who should take the course?

  • Programming Developers and System Administrators
  • Architects
  • Experienced working professionals , Project managers
  • Big DataHadoop Developers eager to learn other verticals like Testing, Analytics, Administration
  • Mainframe Professionals, Architects & Testing Professionals
  • Business Intelligence, Data warehousing and Analytics Professionals
  • Graduates, undergraduates eager to learn the latest Big Data technology can take this Big Data Hadoop Certification online training

Pre-requisite

There is no pre-requisite to take this Big data training and to master Hadoop. But basics of UNIX, SQL and java would be good.At Greens Technology, we provide complimentary unix and Java course with our Big Data certification training to brush-up the required skills so that you are good on you Hadoop learning path.


Big Data Hadoop Course Syllabus

Introduction to hadoop world
  • 1.1 Dataaaaaaa.....Bigdata..!
  • 1.2 What is bigdata? 3 + 1 Vs.
  • 1.3 What is Hadoop , why hadoop & Its history.
  • 1.4 Hadoop Eco System an overview.
    (HDFS,MAPREDUCE,SQOOP,FLUME,PIG,HIVE,OOZIE,HBASE..etc)
  • 1.5 Current Requirements and Future possibilities in Hadoop.
  • 1.5 Wait..Finally what hadoop is not?
  • 1.6 Hadoop installation
Hadoop Architecture In-depth travel
  • 2.1 HDFS - An introduction.
  • 2.2 How data is stored in hdfs? (Travel of a byte).
  • 2.3 Hadoop Daemons:
  • 2.3.1 Name node.
  • 2.3.2 Data node.
  • 2.3.3 Job Tracker.
  • 2.3.4 Task tracker.
  • 2.4 Fault tolerance in hadoop.
  • 2.5 Download Hadoop
Map Reduce 1.0 & Yarn
  • 3.1 Mapreduce history
  • 3.2 Mapreduce architecture,Key-Value pair.
  • 3.3 YARN 2.0 architecture.
  • 3.4 Java Implementation of map reduce.
  • 3.5 Mapper, Reducer, Combiner Different combination.
Data Injection
Sqoop & flume:
  • 4.1 Sqoop Introduction.
  • 4.2 Sqoop configuration.
  • 4.3 Sqoop Sample project.
  • 4.4 Flume introduction.
  • 4.5 Flume configuration.
  • 4.6 Flume sample Project.
Data Transformation and analysis
Pig & Hive:
  • 5.1 Hive introduction.
  • 5.2 Hive data model.
  • 5.3 Hive implementation of sample project.
  • 5.4 Pig Introduction.
  • 5.5 Pig Data structure.
  • 5.6 Pig Implementation on sample project.
  • 5.7 How pig & hive is used in real time project?
  • Module 5 assignment.
  • Module 6:

Addition data base and work flow Hbase,oozie & Zookeeper:

  • 6.1 oozie introduction.
  • 6.2 oozie Overview and configuration.
  • 6.3 zookeeper overview.
  • 6.4 HBASE Introduction.
  • 6.5 HBASE Overview.
  • 7. SPARK Over view