Using Vagrant to test Apache Spark applications

Apache Spark is fast becoming the established platform for developing big data applications both in batch processing and, more recently, processing real-time data with the use of Spark streaming. For me, Apache Spark really shines in that it allows you to write applications to run on a Yarn Hadoop cluster and there is little to…