Log In
Or create an account ->
Imperial Library
Home
About
News
Upload
Forum
Help
Login/SignUp
Index
Cover
Table of Contents
Apache Spark 2 for Beginners
Apache Spark 2 for Beginners
Credits
About the Author
About the Reviewer
www.PacktPub.com
Preface
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
1. Spark Fundamentals
Understanding Apache Spark
Installing Spark on your machines
References
Summary
2. Spark Programming Model
Understanding Spark RDD
Data transformations and actions with RDDs
Monitoring with Spark
The basics of programming with Spark
Creating RDDs from files
Understanding the Spark library stack
Reference
Summary
3. Spark SQL
Why Spark SQL?
Anatomy of Spark SQL
DataFrame programming
Understanding Aggregations in Spark SQL
Understanding multi-datasource joining with SparkSQL
Introducing datasets
Understanding Data Catalogs
References
Summary
4. Spark Programming with R
Basics of the R language
DataFrames in R and Spark
Spark DataFrame programming with R
Understanding aggregations in Spark R
Understanding multi-datasource joins with SparkR
References
Summary
5. Spark Data Analysis with Python
Setting up a dataset
Data analysis use cases
Charts and plots
References
Summary
6. Spark Stream Processing
Micro batch data processing
A log event processor
Windowed data processing
More processing options
Kafka stream processing
Spark Streaming jobs in production
References
Summary
7. Spark Machine Learning
Why Spark for machine learning?
Wine quality prediction
Model persistence
Wine classification
Spam filtering
Feature algorithms
Finding synonyms
References
Summary
8. Spark Graph Processing
The Spark GraphX library
Tennis tournament analysis
Applying the PageRank algorithm
Connected component algorithm
Understanding GraphFrames
Understanding GraphFrames queries
References
Summary
9. Designing Spark Applications
Microblogging with Lambda Architecture
Implementing Lambda Architecture
Working with Spark applications
Coding style
Setting up the source code
Understanding data ingestion
Generating purposed views and queries
Understanding custom data processes
References
Summary
← Prev
Back
Next →
← Prev
Back
Next →