Beginner's Introduction to Text Mining: An App Store Reviews Exercise
This notebook is written as a workshop material for a session by Plug and Play Indonesia. The workshop is part of a series that serve as an introduction to data science and machine learning, and its intended audience are novices as well as junior professionals in the field of data science. Plug and Play is an accelerator for mobile startups, and as such, I have chosen to show application of text mining techniques and capabilities in processing app store reviews.
I work for the company HyperGrowth, and among other things, we develop automation tools for mobile app businesses. One of our product is GrowthBot, a free automation chatbot that deliver growth metrics, app reviews, and performance scorecards to mobile marketers. Our customers use GrowthBot to better analyze and manage (reply to) user reviews of their apps, and the advantages are plenty: accurate market feedback, faster product iteration, and an unparalled timeliness in responding to users’ needs and feedback. This notebook aims to unpack some of these concepts and show how we can utilize R and some simple text mining packages to:
jsonlite
- Convert JSON data into R objects tm
- A text mining package. Comprehensive documentation here SnowballC
- Word stemming syuzhet
- Extracting sentiments wordcloud
- Generate wordcloud (surprised?) ggplot2
- Data visualization based on the grammar of graphics. A ggplot2 cheatsheet, also by me Data endpoint: http://bot.hypergrowth.co:3005/reviews
Courtesy of the bot development team at HyperGrowth
The project is completed using R notebook, and exported to both HTML and PDF. All three files (Rmd, HTML and PDF) are available in this repo.
I have since left the team at HyperGrowth and GrowthBot (GrowthBot’s tech is sold to one of our investors and our customer-facing APIs / apps are no longer in service). My email is now inactive. I’m working at Algoritma as a course producer teaching an ever larger community about R programming, Python programming, NLP, Machine Learning and Visualization. Connect with me on Facebook or on my LinkedIn page instead.
Enjoy programming!