Log In
Or create an account -> 
Imperial Library
  • Home
  • About
  • News
  • Upload
  • Forum
  • Help
  • Login/SignUp

Index
Cover Table of Contents Apache Spark 2 for Beginners Apache Spark 2 for Beginners Credits About the Author About the Reviewer www.PacktPub.com Preface What you need for this book Who this book is for Conventions Reader feedback Customer support 1. Spark Fundamentals Understanding Apache Spark Installing Spark on your machines References Summary 2. Spark Programming Model Understanding Spark RDD Data transformations and actions with RDDs Monitoring with Spark The basics of programming with Spark Creating RDDs from files Understanding the Spark library stack Reference Summary 3. Spark SQL Why Spark SQL? Anatomy of Spark SQL DataFrame programming Understanding Aggregations in Spark SQL Understanding multi-datasource joining with SparkSQL Introducing datasets Understanding Data Catalogs References Summary 4. Spark Programming with R Basics of the R language DataFrames in R and Spark Spark DataFrame programming with R Understanding aggregations in Spark R Understanding multi-datasource joins with SparkR References Summary 5. Spark Data Analysis with Python Setting up a dataset Data analysis use cases Charts and plots References Summary 6. Spark Stream Processing Micro batch data processing A log event processor Windowed data processing More processing options Kafka stream processing Spark Streaming jobs in production References Summary 7. Spark Machine Learning Why Spark for machine learning? Wine quality prediction Model persistence Wine classification Spam filtering Feature algorithms Finding synonyms References Summary 8. Spark Graph Processing The Spark GraphX library Tennis tournament analysis Applying the PageRank algorithm Connected component algorithm Understanding GraphFrames Understanding GraphFrames queries References Summary 9. Designing Spark Applications Microblogging with Lambda Architecture Implementing Lambda Architecture Working with Spark applications Coding style Setting up the source code Understanding data ingestion Generating purposed views and queries Understanding custom data processes References Summary
  • ← Prev
  • Back
  • Next →
  • ← Prev
  • Back
  • Next →

Chief Librarian: Las Zenow <zenow@riseup.net>
Fork the source code from gitlab
.

This is a mirror of the Tor onion service:
http://kx5thpx2olielkihfyo4jgjqfb7zx7wxr3sd4xzt26ochei4m6f7tayd.onion