[Part 1] Stream Preprocessing with Spark Streaming, Kafka, and Cassandra. [Part 2] Clustering and Analysis of Graph Databases with GraphX