Code Monkey home page Code Monkey logo

enron-email's Introduction

Exploration of Enron-Emails dataset

https://enrondetective.herokuapp.com/

Objective:

Data exploration of +500K emails to find describing insights of the company anditeration among its employees.

About the data

Enron-Email dataset

The Enron email dataset contains approximately 500,000 emails generated by employees of the Enron Corporation. It was obtained by the Federal Energy Regulatory Commission during its investigation of Enron's collapse

Version of dataset: May 7th, 2015 , as published at (https://www.cs.cmu.edu/~./enron/)

Techniques used to explore:

  • Natural Language Processing

Applied for data cleaning as well as tokenization to extract main content. network

  • Work in Pandas

Helpful to find first insights and stats from the data. For example, looking at the email traffic was possible to identify working hours, holidays and hiring seasons. network

  • Sentiment Analysis

Specific departments among the conversations were selected in order to evaluate and compare email content, sentiments defined by the content and its variation according with the hour of work.

  • Network-analysis

To visualy find the iterations between employees and identify "islands of information", botleneck in comunication and most relevant actors in the company.

network

Results

Available at : (https://enrondetective.herokuapp.com/)

Collaborators:

Chirag Sharma Wim Christiaansen

enron-email's People

Contributors

chiragsharma8 avatar nedraki avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.