Big Data Beyond Hadoop
Real-Time Analytical Processing (RTAP) Using
Spark and Shark
Jason Dai
Engineering Director & Principal Engineer
Intel Software and Services Group
CCF YOCSEF
Shanghai
Agenda
Big Data beyond Hadoop
Introduction to Spark and Shark
Case study: real-time analytical processing (RTAP)
Big Data beyond Hadoop
Big Dta today
• The is in the room
Big Data beyond Hadoop
• Real-time analytical processing (RTAP)
– Discover and explore data iteratively and interactively for real-time insights
• Advanced machine leaning and data mining (MLDM)
– Graph-parallel predictive analytics (non-SQL)
• Distributed in-memory analytics
– Exploit available main memory in the entire cluster for >100x speedup
RTAP: Real-Time Analytical Processing
Real-Time Analytical Processing (RTAP)
• Data ingested & processed in a streaming fashion
• Real-time data queried and presented in an online fashion
• Real-time and history
Data/Big/Hadoop/RTAP/Real-Time/Analytical/data/Processing/yond/analytics/
Data/Big/Hadoop/RTAP/Real-Time/Analytical/data/Processing/yond/analytics/
-->