Big Data Advanced
Equip with Big Data frameworks & industry use cases.Duration: 6-8 weeks
Course Description
This course is designed to equip participants with the best skill sets needed to accelerate in the world of Big Data. This course will be a mix of classroom lectures & hands-on practical sessions on Cloudera Hadoop distribution covering some advanced development frameworks of latest Big data technologies that include Apache Spark, Apache Kafka, Apache Flume, Advance Hive, Hbase etc. The course will cover a detailed explanation of each framework including self-assessments & a lot of discussions on industry use cases. The course would give you a steep edge in the Big Data world with a lot of career opportunities in the Analytics world with a lot of career opportunities.
Curriculum Structure
- Big Data Advanced
- Quick Recap of Hadoop Fundamentals
- Apache Hive Advanced
- Apache Kafka Setup & its Overview
- Spark Introduction & Need
- Working with RDDs in Spark
- Running Spark on a Cluster
- Parallel Programming with Spark
- Writing Spark Applications
- Spark Streaming & Connecting with Kafka
- Spark Ml
- Spark Best Design Patterns
- Flume
- HBase - another Storage Option
- Industry Case Studies & End to End Architecture
- Overview of Serverless Offerings by Cloud Providers
- Preparation & Discussion for Cloudera CCA Spark & Hadoop Developer Exam CCA-175