Code Monkey home page Code Monkey logo

rossman's Introduction

rossmann-kaggle-udacity


Kaggle Rossmann Store Sales 竞赛,我的深度神经网络模型,在Private LeaderBoard上评分~0.103, 能排到第二名。

Requirements

项目中我使用R语言进行数据探索、可视化和特征工程等工作,R的版本是x64 3.4.3。需要安装的R包有:

  • zoo
  • data.table
  • rlist
  • ggplot2
  • rhdf5

我使用Python3.6 构建模型,需要安装的Python库有:

  • numpy
  • pandas
  • matplotlib
  • tensorflow
  • keras
  • xgboost
  • lightgbm

Run

  1. 检查本项目应该包含以下子目录: googletrend, input, models, output, R, weather

  2. 提取特征(如果直接使用output目录下的all_data.h5,则可以省略该步骤):

    • 改变工作目录至R子目录
    • 运行脚本data.R,这个过程大约需花费3到5分钟,它将在output目录下生成名为all_data.h5的特征文件。
  3. 安装并配置Kaggle API, 参见 https://github.com/kaggle/kaggle-api

  4. 简单XGBoost模型: python ross_xgb.py

  5. Entity-Embedding模型:

    • 修改ross_main.py的设置
    MODEL = NN_Embedding_Base
    N_NETWORKS = 1
    EPOCHS = 20
    
    • python ross_main.py
  6. EE-Residual模型:

    • 修改ross_main.py的设置
    MODEL = NN_Embedding
    N_NETWORKS = 1
    EPOCHS = 25
    
    • python ross_main.py
  7. EE-tree模型: python ross_ee_tree.py

  8. 最终提交的融合模型:

    • 修改ross_main.py的设置
    MODEL = NN_Embedding
    N_NETWORKS = 10
    EPOCHS = 25
    
    • python ross_main.py

rossman's People

Contributors

jeansding avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.