This is a series of small exercises exploring various bits of Apache Spark. We’re not focusing on setting up Spark servers, rather how to use (some of) its features.
Please clone the repository from github