Code Monkey home page Code Monkey logo

hashtag_recommender's Introduction

Twitter Hashtag Recommendation System

This Exploratory Project includes building a Hashtag Recommendation System for Twitter.

Background

This project is a part of the Exploratory Project included in semester IV of academic session 2015-16. It was carried out by Deepak Yadav and Faizan Khan, Department of Computer Science and Engineering, IIT(BHU), under the guidance of Associate Professor Dr. S.K.Singh. The primary aim of the project was to help the students apply their skills to a real world project and gain experience.

Details

The major details for the coding project are as follows:

Problem Definition

Hashtags provide users with a tagging mechanism to help organize, group, and create visibility for their posts. This is a simple idea but can be challenging for the user in practice which leads to infrequent usage. Hashtag recommendation comes with numerous challenges including processing huge volumes of streaming data and content which is small and noisy. We will use preprocessing methods to reduce noise in the data and determine an effective method of hashtag recommendation to get better recommendations.

Objective

To build a Twitter Hashtag Recommendation System. The System will recommend the user other hashtags based on his/her given input.

  • Domain : Twitter Tweets retrieved with the Twitter API
  • Purpose : Recommend similar hashtags to users
  • Personalization Level : Generic
  • Interfaces : Explicit Input, Recommended Output

Contributing to the Project

We will be more than happy to improve upon this basic project. Any advice is appreciated. Here is how you can contribute to the project:

  • The easiest way to contribute is to point out issues.
  • Send us PRs with what you think can be enhanced in the project.
  • Contact us(see below) with the features you would like to see in the project and are willing to contribute to.

Requirements

The basic requirements of the project are as follows (Note: The project was developed with the following specifications and may also work with earlier versions):

Resources used in the project

Getting Data

The recommender uses the Twitter API to obtain the raw tweets for preprocessing. To obtain a large volume of data, the Streaming API is used. This tutorial provides a very basic introduction to getting tweets using a Python library called Python Twitter Tools to connect to Twitter API and downloading the data from Twitter.

Preprocessing

Cleaning the tweets before processing is done using various steps.

App GUI

The app uses a very basic interface developed using the Python Tkinter module. This basic tutorial may be helpful.

Contact details

Please contact us regarding any issues and/or suggestions

Deepak Yadav

Department of Computer Science and Engineering IIT(BHU), Varanasi, India

Faizan Uddin Fahad Khan

Department of Computer Science and Engineering IIT(BHU), Varanasi, India

hashtag_recommender's People

Contributors

cryptomanic avatar faizan-khan-iit avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.