项目作者: yanki-maker32

项目描述 :
Graph Replication, Data Wrangling, Data Cleaning, Sentiments, and Tokens with R, using the wikipedia data tables for Grammy Nominated Songs from 1980s-2010s
高级语言: HTML
项目地址: git://github.com/yanki-maker32/Sentiment-Analysis-Graph-Replication.git


Project Description:
1) Transfer data from wikipedia, store in same xpath
2) Clean the data and wrangle into a new frame
3) Create Decade variable, use case_when or ifelse to interpret into right date ranges
for example case_when %In% 1980:1989 ~ 1980s
4) Clean sentiment table rename each sentiment string values into 1s or 0s
for example positive ~ 1
5) graph using ggplot
Prereqs:
Data Cleaning
Data Renaming
Sentiments
GGplot
Genius
RegEx