POTATo: a Python toolkit to analyze and visualize chat histories
Panda Online Text Analysis Tool
Find documentation and extensive guides here: https://alicebarbe.github.io/Pandas-Online-Text-Analysis-Tool/
This collection of Python visualizations and analyses of chat messaging
histories was born out of our desire to see the evolution of our relationships
through our messages. The visualizations currently provided are a small sample
of what could be done with the collected data and the conversation metrics
we examine, but we judged these to be the most interesting plots for now.
make_bin_plots generates what we called “binned” plots: the chat messaging
history is sliced into time intervals (for example, days), and a data point
is taken for each slice (for example, number of words written in the day).
This is then plotted in an area graph.
make_aggregate_plots generates plots using the text messaging history as a
whole, ignoring time evolution. This consists of broad word/text per person
bar graphs, a scatterplot of the number of words per conversation, and a
scatterplot of the number of words vs duration of conversation.
make_heatmap_plots generates heatmaps with hours on x-axis and days on y-axis
(though this can be changed to an arbitrary size matrix).
make_word_ratio_scatterplots generates a scatterplot with each point
representing a word, its position on the x-axis representing the ratio of its
use between two senders, and its position on the y-axis representing how many
times it occurs in the messaging history. This is the only plot that does not
work for more than two senders.
This toolkit was developed for Python 3.7 or later. Additionally, the following
packages are needed: