Big Data Introduction Course

Unlock the potential of big data with our Big Data Introduction Course, designed for IT Managers, Systems Analysts, and anyone eager to master real-time data processing and storage technologies using Hadoop.

Course Thumbnail

Essential Skills Gained

Checkmark

Introduce attendees to Big Data and its significance.

Checkmark

Explore the Apache Hadoop project and its ecosystem.

Checkmark

Explain the functions of HDFS, MapReduce, and related technologies.

Checkmark

Provide real-world big data use cases and applications.

Format

  • Instructor-led
  • 1 days with lectures and hands-on labs.

Audience

  • IT Managers
  • Systems Analysts
  • Data Analysts
  • System Administrators

Description

Big data is becoming an important technology trend that is starting to change the way organizations view information and the technology to capture, process, and store data.  Big data is the capability to manage a high volume of disparate data, at the right speed, and within the right timeframe to allow real time analysis and reaction. Course Objective: The objective of this 3 hour course is to introduce attendees to Big data and the associated Big data products.  This course covers at a high level,  an introduction to Big data, the history of Big data, and The Apache Hadoop project.  Attendees will also learn about the Hadoop Distributed File System (HDFS), MapReduce, and the Hadoop Ecosystem of open source products.  We will also explore some Big data use cases.  Attendees will come away with an understanding about what Big Data is and how it works. Topics Covered:

  • Introduction to Big Data
  • Big Data Design Criteria
  • HDFS
  • The Big Data Processing Engine
  • Big Data Players
  • Big Data History
  • Apache Hadoop Project
  • MapReduce
  • Apache Hadoop Ecosystem
  • Big Data Use Cases

Calendar icon

Upcoming Course Dates

No upcoming dates. Please check back later.

Course Outline

Download PDF

Module 01: Introduction to Big Data

  1. What is Big Data?

  2. Big Data History

  3. Big Data Design Criteria

  4. Big Data in Practice

Module 02: Apache Hadoop Project

  1. Architecture Overview

  2. Hadoop Distributed File System

  3. MapReduce

  4. The Big Data Processing Engine

Module 03: Apache Hadoop Ecosystem

  1. Hadoop Ecosystem Overview

  2. Hadoop Open Source Products

  3. Pig

  4. Impala

  5. Hive

  6. Oozie

  7. Hahout

  8. HBase

  9. Flume

  10. Zookeeper

  11. Sqoop

  12. YARN

Module 04: Big Data Players

  1. Cloudera

  2. HortonWorks

  3. MongoDB

  4. GreenPlum

  5. Splunk

Module 05: Big Data Use Cases

  1. Google

  2. Big Data Around the World

Your Team has Unique Training Needs.

Your team deserves training as unique as they are.

Let us tailor the course to your needs at no extra cost.