Big Data on AWS

Unlock the potential of cloud-based big data solutions with our Big Data on AWS course, designed for solutions architects, data scientists, and data analysts to master AWS tools like EMR, Redshift, and Kinesis for secure and cost-effective big data environments.

Course Thumbnail

Essential Skills Gained

Checkmark

Design secure and cost-effective big data environments on AWS.

Checkmark

Implement cloud-based big data solutions using Amazon EMR, Redshift, and Kinesis.

Checkmark

Understand the integration of Apache Hadoop tools with AWS services.

Checkmark

Leverage best practices for data analysis and visualization on AWS.

Format

  • Instructor-led
  • 3 days with lectures and hands-on labs.

Audience

  • Solutions Architects
  • System Operator Administrators
  • Data Scientists
  • Data Analysts

Description

In this course, you will learn about cloud-based big data solutions such as Amazon Elastic MapReduce (EMR), Amazon Redshift, Amazon Kinesis, and the rest of the AWS big data platform. You will learn how to use Amazon EMR to process data using the broad ecosystem of Apache Hadoop tools like Hive and Hue. Additionally, you will learn how to create big data environments, work with Amazon DynamoDB, Amazon Redshift, and Amazon Kinesis, and leverage best practices to design big data environments for security and cost-effectiveness.

Calendar icon

Upcoming Course Dates

No upcoming dates. Please check back later.

Course Outline

Download PDF

Lesson 1: Overview of Big Data

Lesson 2: Data Ingestion, Transfer, and Compression

Lesson 3: AWS Data Storage Options

Lesson 4: Using DynamoDB with Amazon EMR

Lesson 5: Using Kinesis for Near Real-Time Big Data Processing

Lesson 6: Introduction to Apache Hadoop and Amazon EMR

Lesson 7: Using Amazon Elastic MapReduce

Lesson 8: The Hadoop Ecosystem

Lesson 9: Using Hive for Advertising Analytics

Lesson 10: Using Streaming for Life Sciences Analytics

Lesson 11: Using Hue with Amazon EMR

Lesson 12: Running Pig Scripts with Hue on Amazon EMR

Lesson 13: Spark on Amazon EMR

Lesson 14: Running Spark and Spark SQL Interactively on Amazon EMR

Lesson 15: Using Spark and Spark SQL for In-Memory Analytics

Lesson 16: Managing Amazon EMR Costs

Lesson 17: Securing your Amazon EMR Deployments

Lesson 18: Data Warehouses and Columnar Datastores

Lesson 19: Introduction to Amazon Redshift

Lesson 20: Optimizing Your Amazon Redshift Environment

Lesson 21: The Big Data Ecosystem on AWS

Lesson 22: Visualizing and Orchestrating Big Data

Lesson 23: Using Tibco Spotfire to Visualize Big Data

Your Team has Unique Training Needs.

Your team deserves training as unique as they are.

Let us tailor the course to your needs at no extra cost.