Light

aherzog / dynamic_eigenvector_centralities Goto Github PK

View Code? Open in Web Editor NEW

This project forked from mgglenn/dynamic_eigenvector_centralities

0.0 1.0 0.0 242 KB

Implementation of DEC algorithm to detect emerging keywords/events from microblogging stream (Twitter).

Python 100.00%

dynamic_eigenvector_centralities's Introduction

Dynamic Eigenvector Centralities

The original code for this project was developed by Neela Avudaiappan. This version of the code is a fork from Grace Glenn's implementation, who fixed bugs and restructured the code.

Paper / Purpose

This code serves as an implementation of the work on calculating keywords and their emerging importance outlined in:

Neela Avudaiappan, Alexander Herzog, Sneha Kadam, Yuheng Du, Jason Thatcher, and Ilya Safro, " Detecting and summarizing emergent events in microblogs and social media streams by dynamic centralities", in Proceedings of the 2017 IEEE International Conference on Big Data, 2017

Results

The results in the paper have been replicated on the Boston dataset using time intervals of 60 and 15 minutes, located in boston_examples.

Requirements

All code is run in Python 3.6 (Anaconda 4.3.0)
Data to be processed should be stored in ordered text files (i.e., file1.txt, file2.txt, ... fileN.txt for N intervals, or some other numbered format.)
Text files should contain one-document (i.e., one tweet) per line

Usage

Ensure all requirements are satisfied. The program can be run as follows.

# after repo has been downloaded
cd dynamic_eigenvector_centralities
pip install requirements.txt
python dec_main.py --input_folder /home/username/time_series_data/ --P 6 --output_folder /home/username/dec_results/

Files

dec_main.py Runs the full algorithm to compute DEC values described in the
dec_graph.py contains code for the graph logic of the algorithm
dec_text.py contains code for preprocessing and cleaning the data
break_files.py a useful script for dividing time-series CSV data

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.