Cassandra and Spark
October 14th 2015 10:00 - 10:50
Apache Cassandra is a leading open-source distributed database capable of amazing feats of scale, but its data model requires a bit of planning for it to perform well. Of course, the nature of ad-hoc data exploration and analysis requires that we be able to ask questions we hadn’t planned on asking—and get an answer fast. Enter Apache Spark. Spark is a distributed computation framework optimized to work in-memory, and heavily influenced by concepts from functional programming languages.