Twitter sentiment analysis using Spark and Stanford CoreNLP and visualization using elasticsearch and kibana
A project on Spark Streaming to analyze Popular hashtags from live twitter data streams. Data is ingested from different input sources like Twitter source, Flume and Kafka and processed downstream using Spark Streaming.
The source folder is organized into 2 packages i.e. Kafka and Streaming. Each class in the Streaming package explores different approach to consume data from Twitter source. Below is the list of classes:
Discussed in blog —
Spark Streaming part 1: Real time twitter sentiment analysis
Discussed in blog —
Spark streaming part 2: Real time twitter sentiment analysis using Flume
Discussed in blog —
Spark streaming part 3: Real time twitter sentiment analysis using kafka
Discussed in blog —
Data guarantees in Spark Streaming with kafka integration
Discussed in blog —
Data guarantees in Spark Streaming with kafka integration