项目作者: WajdiBenSaad

项目描述 :
Importing and analyzing Twitter Data with R. Conencting to Twitter, Importing Data, Cleaning and Analyzing the Tweets.
高级语言: R
项目地址: git://github.com/WajdiBenSaad/R_Twitter.git
创建时间: 2019-03-18T12:50:28Z
项目社区:https://github.com/WajdiBenSaad/R_Twitter

开源协议:

下载


~~ Project still in progress. Please Check the project board to follow the upcoming updates ~~

This work is about Playing with Twitter Data in R!

Steps covered in this projets:

The first step is to configure and to conenct to the Twitter API. Since this work is not going to be a “Web Scrapping” Projet, we will need to connect directly to Twitter Data streams through thier API.

From here we will extract the tweets text and other information that we will import from the Twitter API.

In order for us to analyze the tweets, this particular type of data needs to be cleaned and stored in a certain way. Ponctuations, White spaces, Stop Words..and others should be cleaned and handeled in a specific way.

A document-term matrix or term-document matrix is a mathematical matrix that describes the frequency of terms that occur in a collection of documents.