Code Monkey home page Code Monkey logo

data-analysis's Introduction

math and data analysis functions

shingling

  • k-shingles generation
  • minhashing

jaccard similarity

  • jaccard similarity calculation
  • jaccard distance calculation
  • jaccard conditional comparaison

adwords problem

  • greedy_adwords
  • balance_adwords
  • generalized_balance_adwords

frequency problem

  • items frequency
  • the algorithm of savasere, omniescinski and navathe

graph problem

  • graph construction
  • shortest_path
  • longest path
  • centrality
  • independent graphs detection
  • clustering_coef
  • dijkstra
  • dijkstra with heap

recommendation problem

  • hamming distance
  • euclidean distance
  • pearson correlation
  • tanimoto score
  • euclidean similarity
  • pearson similarity
  • tanimoto similarity
  • top similars
  • top similar with map reduce
  • recommendation user filtred
  • recommendation item filtred

Radix tree

  • insert
  • remove
  • search
  • longest prefix

Decision tree

  • Divide data
  • Gini impurity
  • Entropy
  • Variance
  • Buil tree
  • Prune
  • Classify
  • Draw tree

Page Rank

A very simple version/implementation of the page rank algorithm.

  • Page rank
  • Advanced version of page rank, topic sensitive
  • spam farms
  • spam farms
  • trust rank
  • Hiperlink induced topic search
  • Map reduce to efficiently calculates the page rank
  • Jaccard simiarity to be found in data analysis repo

Map-Reduce

Implementation of map reduce, and some examples.

  • Map Reduce class
  • Estimation of pi number
  • Calculation of frequency of Items from multiple files

data-analysis's People

Contributors

mmourafiq avatar

Watchers

Qian Zhai avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.