Code Monkey home page Code Monkey logo

kuaishou2018's Introduction

2018年**高校计算机大赛——大数据挑战赛

赛事内容

本次大赛基于脱敏和采样后的数据信息,预测未来一段时间活跃的用户。参赛队伍需要设计相应的算法进行数据分析和处理,比赛结果按照指定的评价指标使用在线评测数据进行评测和排名,得分最优者获胜。

比赛链接:https://www.kesci.com/home/competition/5ab8c36a8643e33f5138cba4

代码及方案说明

1、ExtractFeatures.ipynb

提取特征的文件,在复赛中采用了3个训练窗口,间隔为7的提取方法。即:

窗口1:行为信息-6 to 9, 未来一周10 to 16

窗口2:行为信息1 to 16, 未来一周17 to 23

窗口3:行为信息8 to 23, 未来一周24 to 30

以下代码段为设置窗口的参数:

===============================================================================

TRAIN_PREDICT_DAY = range(10,17)
TRAIN_REGISTER_DAT = range(1,10)
TRAIN_ACT_DAT = range(-6,10)

TEST_PREDICT_DAY = range(31,38)
TEST_REGISTER_DAT = range(1,31)
TEST_ACT_DAT = range(15,31)

===============================================================================

2、train.ipynb

训练模型的文件,复赛中lgb单模为0.9121,cat单模为0.9120

融合方案

1、不同模型加权融合或者blending;

2、不同窗口间隔组成的特征进行融合,上面提到的特征窗口间隔为7,那么间隔为4,5,6的特征也能提取出来,每种间隔单独训练模型后进行融合,权重跟间隔大小成正相关;这种融合方法在复赛中能充分利用训练数据,能提不少分。

kuaishou2018's People

Contributors

zesenchen avatar

Stargazers

 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.