TrAnET: Tracking and Analyzing the Evolution of Topics in Information Networks

The dataset is available in the dataset folder and is encoded in the ArnetMiner v8 format.

To unzip

cat merged-dataset-v8-splitted.z* > merged-dataset-v8-splitted-complete.zip
unzip merged-dataset-v8-splitted-complete.zip

Install dependencies

pip install -r requirements.txt

Download the nltk data:

a) open a python shell

python

b) import nktk and download lemmer data

import nltk
nltk.download('wordnet')
nltk.download('punkt')

Install python3-dev:

sudo apt-get install python3-dev

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
dataset		dataset
demo_files		demo_files
notebook		notebook
resources		resources
utilities		utilities
.gitignore		.gitignore
README.md		README.md
config.py		config.py
data_ingestion.py		data_ingestion.py
requirements.txt		requirements.txt
topic_modeling.py		topic_modeling.py

Provide feedback