A novel framework for learning latent information diffusion mechanism in online discussion forum
A nested point process in modeling social activities on online social forums,
which is a a class that models the arrival in time of random events and their
interaction with the state of a system.
This repository contains three components:
This code is written in Python. To use it you will need:
It is recommended to use Anaconda since it includes all the Python-related dependencies
The data used in this project can be downloaded from this
link.
To produce the model-required Reddit data, one can also utilize Pushshift API to download Reddit threads and reply data in a given time range of different subforums.
To train the model, try to run the ntpp.py.
python ntpp.py
Note that the parameter value ranges are hyper-parameters, and different range
may result different performance in different dataset, be sure to tune
hyper-parameters carefully.