Code Monkey home page Code Monkey logo

dsc-1-12-04-obtaining-our-data-lab-online-ds-sp-000's Introduction

Obtaining Our Data - Lab

Introduction

In this lab you'll practice your munging and transforming skills in order to load in your data to solve a regression problem.

Objectives

You will be able to:

  • Understand the ETL process and the steps it consists of
  • Understand the challenges of working with data from multiple sources

Task Description

Your boss gives you a general description of some of the datasets at your disposal for analyzing weekly store sales. They're eventually looking for you to build a model to help determine what factors impact sales, and model future sales forecasting for business planning.

Most of the properietary store data sits in the company sql database, accessible by all managers and above. The database is called Walmart.db Your boss provides you with the following basic schema:

She then tells you that she's put together a second dataset on general economy statistics for the various dates that she would also like you to incorporate in your analysis. That data, she says, is stored in a file economy_data.csv.

As a first step in creating your model for providing recommendations and projections, load and synthesize these disperate datasets into a singular unified DataFrame. Then save your results to a file Merged_Store_Data.csv.

Make sure you check the various data types and merge appropriately.

# Your code here

Summary

Nice work! You're working more and more independently through the workflow, and ensuring data integrity!

dsc-1-12-04-obtaining-our-data-lab-online-ds-sp-000's People

Contributors

mathymitchell avatar loredirick avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.