Code Monkey home page Code Monkey logo

topnkof's Introduction

Experimental Code for KBS: Scalable KDE-based Top-n Local Outlier Detection over Large-Scale DataStreams

Overview

The detection of local outliers over high-volume data streams is critical for diverse real-time applications in the real world, where the distributions in different subsets of the data tend to be skewed. However, existing methods are not scalable to large-scale high-volume data streams owing to the high complexity of the re-detection of data updates. In this work, we propose a top-n local outlier detection method based on Kernel Density Estimation (KDE) over large-scale high-volume data streams.

Main Methods

The proposed method consists two versions: UKOF and LUKOF method.

Main Class for UKOF method: cellpruning.lof.pruning.ComputeTopNKDE

Main Class for LUKOF method: cellpruning.lof.pruning.ComputeTopNKDE_LazyUpdate

Environment

  • Eclipse

  • Build and Use the Software Artifact

1.Open Eclipse

2.Import the code named "TopNKOF"

3.Set parameters in "util.SQConfig", such as the number of nearest neighbors k, top outliers n, window size w and slide size s.

4.Run the corresponding main methods for UKOF and LUKOF, namely "cellpruning.lof.pruning.ComputeTopNKDE" and "cellpruning.lof.pruning.ComputeTopNKDE_LazyUpdate"

Dataset

extensive experiments are conducted on ten real-world and synthetic datasets. The real-world datasets are extracted from UCI Machine Learning Repository.

synthetic dataset: Interchanging RBF, Moving Squares, Mixture RBF

real-word datasets: Vowels, KDDCup, Subhttp, Smtp, ForestCover, Mobike, GeoLife

topnkof's People

Contributors

liufang0812 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.