Big Data Analysis with Apache Spark Python PySpark

Big Data Analysis with Apache Spark Python PySpark
Learn to analyse batch, streaming data with Data Frame of Apache Spark Python and PySpark

**Spark can perform up to 100x faster than Hadoop MapReduce Data processing framework, **Which makes apache spark one of most demanded skills.

The top companies like **Google, Facebook, Microsoft, Amazon, Airbnb ** using **Apache Spark **to solve their big data problems!. Data analysis, on huge amount of data is one of the most valuable skills now a days and This course  will teach such kind of skills to complete in big data job market.

This course will teach

  • Introduction to big data and Apache spark
  • Getting started with databricks
  • Detailed installation step on ubuntu - linux machine
  • Python Refresh for newbie
  • Apache spark Dataframe API
  • Apache spark structured streaming with end to end example
  • Basics of Machine Learning and feature engineering with Apache spark.

This course is not complete, will be adding new content related to Spark ML.

Note : This course will teach only Spark 2.0 Dataframe based API only not RDD based API. As Dataframe based API is the future of spark.


Ankit Mistry

Who is the target audience?
  • Anyone who wants to learn advance big data skill
  • Anyone who knows Hadoop and wants to move ahead in faster data processing
  • Anyone wants to make career as data Engineer, Data analyst, Machine Learning Engineer
  • Interested in learning Apache spark and pyspark for big data analysis
  • Anyone wants learn cutting edge technology in Data processing


Spark and Python for Big Data with PySpark

Apache Spark 2.0 with Java -Learn Spark from a Big Data Guru

Big Data with Apache Spark and AWS

Hands on Big Data with Apache Hadoop, Python and HDInsight

Apache Spark 2.0 + Scala : DO Big Data Analytics & ML

Python for Data Analysis and Visualization - 32 HD Hours !