Code Monkey home page Code Monkey logo

janatahack-customer-segmentation's Introduction

Janatahack-Customer-Segmentation

AV organised weekend hackathon on customer segmentation.

Team: Last but not the least

Team member : Atif Hassan and Sayantan Basu

Public Leaderboard : 5

Dataset format

dataset

One of the main tricks to this hackathon was to identify the common IDs in Train.csv and Test.csv. And simply by copying the target variables( with common IDs) from Train.csv to Test.csv gave a jump from 0.54 to 0.91 in the leaderboard.

Upon identifying this leak the sole target was to build a model and improve the remaining part of the Test.csv data. A simple model with XGBoost gave us ~0.94 on leaderboard.

Finally with some more feature engineering and finetuning a score of ~0.96 was reached on public LB which resulted onto ~0.95 on private LB.

Train.csv and Test.csv are the original files provided in the competition. I combined the above files and sorted them based on ID to create a new file( say Combined.csv). Using this file I filled all the missing values in Combined.csv using ffil() and then separated the train and test files back as New_train.csv and New_test.csv. New_train.csv and New_test.csv are modified versions with all missing values filled based on previous rows in the combined sorted Train.csv and Test.csv file.

Link to the competition page with LB details

janatahack-customer-segmentation's People

Contributors

sayantanbasu05 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.