English | March 5, 2018 | ASIN: B07B8L93PC | 111 Pages | PDF | 3 MB
Mastering big data requires an aptitude at every step of information processing. Post-processing, one of the most important steps, is where you find Apache Spark frequently employed. Spark Succinctly, by Marko ?valjek, addresses Spark's use in the ultimate step in handling big data. This e-book, the third installment in ?valjek's IoT series, teaches the basics of using Spark and explores how to work with RDDs, Scala and Python tasks, JSON files, and Cassandra.
Many of the leading companies in the world today face the problem of big data. There are various definitions by various authors on what big data actually is. To be honest, it's a vague definition, and most of the authors in the field have their own. The scope of this book is too limited to go into discussions and explanations, but if you are a seasoned developer in the field, you probably have your own view of what big data is.