Code Monkey home page Code Monkey logo

database_project's Introduction

CS150A Database Project

Logistics

Please type your Chinese name and ID.

  • Your Name:
  • Your ID:

If you are a team, please write names and IDs of both people.

Due date: 23:59, January, 8th, 2023.

You need to finish an entire machine learning system on provided dataset. You do not need to implement a machine learning algorithm from scratch, you are free to call any existing libraries for data science.

Submission

Submissions are done via Gradescope and 上科大云盘 CS150A Project: http://pan.shanghaitech.edu.cn/cloudservice/outerLink/decode?c3Vnb24xNjY5OTAxNDA3MDQ0c3Vnb24=

You need to submit three parts.

  • Submit the report to gradescope
    • To form a team, remember to select your teammate when you are submitting at gradescope.
  • Submit the completed test.csv to CS150A Project/test.csv
    • Name it as [Student1 Name]_[Student1 ID](+[Student2 Name]_[Student2 ID])_test.csv
  • Submit your codes with an introduction file to CS150A Project/Code
    • Name it as [Student1 Name]_[Student1 ID](+[Student2 Name]_[Student2 ID]).zip

Your score of this project will be evaluated based on these three parts.

Note: For those who don't obey our submission rules, we'll give 0 point. If you have any questions about this, feel free to post them on Piazza.

Report

  • A report at most 4-page to describle the entire pipeline of your work. You should use the provided the report template, follow the guideline and instructions given in the template and fill into the corresponding part.

Answers of the rest in testset

  • We'll only offer a subset of correct answers for test data. To submit your results, you should complete the missing values of Correct First Attempt in test.csv, which means replace NaN with the value your model predicts.
  • Then you need to submit your completed test.csv to CS150A Project/test.csv (Don't submit train.csv.)

Codes

  • You need to upload your codes with an introduction file about how you organize your code.
  • Name your codes as Student1-Name_Student1-ID(_Student2-Name_Student2-ID).zip
  • Submit it to CS150A Project/Code

We'll do duplicate checking for all the submitted codes, so don't copy other people's codes.

Bonus

We'll offer additional points for those using PySpark to implement the algorithms. To earn the bonus, state clearly in the report about your implementation.

database_project's People

Contributors

justinhzc avatar yuhehao avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.