PySpark 3.5 Introduction & RDD Tutorial with Examples
Author: Naveen Nelamali (SparkByExamples.com)
What is PySpark?
PySpark Features & Advantages
PySpark Architecture
PySpark Ecosystem
Use Cases and Applications of PySpark
Install PySpark 3.5 on MacOS
Install PySpark 3.5 on Windows
Install Anaconda, PySpark 3.5 and Jupyter Notebook
What is Spark Session?
Creating SparkSession
SparkSession Most Used Methods
What is Spark Context?
What does SparkContext do?
SparkContext Most Used Methods
FAQ's or Interview Questions
RDD - Introduction
RDD - Create RDD from Parallelize
RDD - Collect Data from RDD
RDD - Read Text and CSV File
RDD - How to Parallelize RDD?
RDD - Transformations
RDD - Actions
RDD - Word Count Example
RDD - Repartition
RDD - Types of RDD
RDD - Cache and Persistence
Spark Persistence Levels
RDD - Broadcast Variables
RDD - Accumulator Variable
PySpark Next Steps
RDD Examples to Explore
The PySpark Tutorial explained concepts very well along with hands-on examples !
The PySpark Tutorial explained concepts very well along with hands-on examples !
Read LessGood for beginners into world of Pyspark, Pls add more courses for next levels
Good for beginners into world of Pyspark, Pls add more courses for next levels
Read LessAwesome introduction
Awesome introduction
Read LessVery good one. Thanks
Very good one. Thanks
Read Lessso far,i have been finding the texts easy to learn,understand and apply
so far,i have been finding the texts easy to learn,understand and apply
Read Less