Code Monkey home page Code Monkey logo

kdd_cup2017's Introduction

KDD_CUP2017

问题简要描述:

My Solution for task1 本质上是一个时间序列预测问题。预测在未来的一段时间,通过高速公路多条路径的平均时间(20min的时间窗) 给出了历史5个月的历史记录

数据预处理: 0. 数据聚合操作 ? 对数变换

  1. 缺失值处理 - 邻近插值 2. 异常值处理 - 箱线图

特征工程: 用滑动窗口的方法产生训练集 提取的特征主要有:

1.前两小时的平均时间 2.前两小时的车流量 3.节假日(one-hot representation) 4.上周8点到10点的特征

模型选择: 1.线性回归 ? 分为数回归 quantile regression 2.LSTM 3. randomforest

模型融合: stacking

最终成绩 182/3582 top 5%

kdd_cup2017's People

Contributors

johnson0722 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.