Code Monkey home page Code Monkey logo

m-clark / data-processing-and-visualization Goto Github PK

View Code? Open in Web Editor NEW
30.0 5.0 13.0 129.73 MB

This document forms the basis of several workshops/talks that get into everyday programming with R, but also includes mirrored code in Python as Jupyter notebooks.

Home Page: https://m-clark.github.io/data-processing-and-visualization

License: Other

R 6.67% Jupyter Notebook 93.30% TeX 0.02% Python 0.01%
visualization data-processing data-science r dplyr datatable ggplot2 htmlwidgets tidyverse workshop

data-processing-and-visualization's Introduction

Practical Data Science

The focus of this document is on using R for data processing, programming, modeling, visualization, and presentation of results. It contains exercises for additional practice, and most of the content has been translated to Python and is available via Jupyter notebooks.

link

Outline

Part 1: Information Processing

  • Understanding Basic R Approaches to Gathering and Processing Data

    • Overview of Data Structures
    • Getting data in and out
    • Indexing
  • Getting Acquainted with Other Approaches to Data Processing

    • Pipes, and how to use them
    • tidyverse
    • data.table
    • Misc.

Part 2: Programming Basics

  • Using R more fully

    • Dealing with objects
    • Iterative programming
    • Writing functions
  • Going further

    • Code style
    • Vectorization
    • Regular expressions

Part 3: Modeling

  • Model Exploration

    • Key concepts
    • Understanding and fitting models
    • Overview of extensions
  • Model Criticism

    • Model Assessment
    • Model Comparison
  • Machine Learning

    • Concepts
    • Demonstration of techniques

Part 4: Visualization

  • Thinking Visually

    • Visualizing Information
    • Color
    • Contrast
    • and more...
  • Using ggplot2

    • Aesthetics
    • Layers
    • Themes
    • and more...
  • Adding Interactivity

    • Package demos
    • Shiny

Part 5: Presentation

  • Building Better Data-Driven Products
    • Reproducibility concepts
  • Starting out with R markdown
    • Standard documents
  • Customization and more
    • Themes, CSS, etc.

data-processing-and-visualization's People

Contributors

imgbotapp avatar m-clark avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

data-processing-and-visualization's Issues

data.table chapter issues

  • Change 'basics' title to data.table basics or something so as not to conflict with programming basics link
  • remove data_frame from code

summaries

A couple chapters have a 'Summary' section which leads to link problems. They need to be renamed more specifically, e.g. Summary of...

Add all exercises and more content to Jupyter notebooks

The initial notebooks were just code snippets. Later sections have copied more of the text, so to be consistent, the earlier ones need to be updated, and we want to add the exercises to all notebooks insofar as it's possible.

  • Part I
  • Part II
  • Part III
  • Part IV
  • Part V

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.