Learn to analyse batch, streaming data with Data Frame of Apache Spark Python and PySpark
**Spark can perform up to 100x faster than Hadoop MapReduce Data processing framework, **Which makes apache spark one of most demanded skills.
The top companies like **Google, Facebook, Microsoft, Amazon, Airbnb ** using **Apache Spark **to solve their big data problems!. Data analysis, on huge amount of data is one of the most valuable skills now a days and This course will teach such kind of skills to complete in big data job market.
This course will teach
- Introduction to big data and Apache spark
- Getting started with databricks
- Detailed installation step on ubuntu - linux machine
- Python Refresh for newbie
- Apache spark Dataframe API
- Apache spark structured streaming with end to end example
- Basics of Machine Learning and feature engineering with Apache spark.
This course is not complete, will be adding new content related to Spark ML.
Note : This course will teach only Spark 2.0 Dataframe based API only not RDD based API. As Dataframe based API is the future of spark.
Who is the target audience?
- Anyone who wants to learn advance big data skill
- Anyone who knows Hadoop and wants to move ahead in faster data processing
- Anyone wants to make career as data Engineer, Data analyst, Machine Learning Engineer
- Interested in learning Apache spark and pyspark for big data analysis
- Anyone wants learn cutting edge technology in Data processing
☞ Spark and Python for Big Data with PySpark
☞ Apache Spark 2.0 with Java -Learn Spark from a Big Data Guru
☞ Big Data with Apache Spark and AWS
☞ Hands on Big Data with Apache Hadoop, Python and HDInsight
☞ Apache Spark 2.0 + Scala : DO Big Data Analytics & ML
☞ Python for Data Analysis and Visualization - 32 HD Hours !