Code Monkey home page Code Monkey logo

bigdata's Introduction

Big Data Analytics

About

Lecture Materials for the course Big Data Analytics

Prerequisites

Compilation depends on bookdown and knitr

install.packages("bookdown")
install.packages("knitr")

In order to get the intended RStudio/Merbivore-style syntax highlighting in the ioslides output, copy style/prettify.js and style/r-lang.j to the respective ioslides folder in the rmarkdown library of your R installation, overwriting the default files with the same names (in OSX with R 3.4, this is: /Library/Frameworks/R.framework/Versions/3.4/Resources/library/rmarkdown/rmd/ioslides/ioslides-13.5.1/js/prettify/).

Install the packages used in the code examples:

install.packages("qdapRegex")
install.packages("pacman")
# get a list of all rmd files (slides and notes)
notes_files <- list.files("materials/notes", pattern = "\\.Rmd", full.names =TRUE)
slides_files <- list.files("materials/slides", pattern = "\\.Rmd", full.names = TRUE)
all_files <- c(notes_files, slides_files)

# parse the rmds, extract a list of package dependencies
rmds <- lapply(all_files, readLines)
to_install <- lapply(rmds, qdapRegex::rm_between, 
                     left = c("library(", "require("),
                     right = c(")", ")"),
                     extract = TRUE)
to_install <-  unique(na.omit(unlist(to_install)))
to_install <- to_install[! to_install %in% c("PACKAGE-NAME",  "<PACKAGE NAME>")]

# install all missing packages
pacman::p_load(char = to_install)

Compile all material

Run the this in the terminal to compile all materials (also tests all the R code in the examples):

sh makeall_rintro.sh

Course materials

  • See materials/slides for the weekly lecture slides.
  • See materials/notes for the weekly lecture notes.
  • See materials/sourcecode for the code examples shown in the notes.

Work with Git/GitHub

NOTE: Depending on your operating system, you might have to install Git manually before using it with RStudio. You will find detailed instructions here.

Clone this course's repository

  1. In RStudio, navigate to a folder on your hard-disk where you want to have a local copy of this course's GitHub repository.
  2. In RStudio, switch to the Terminal, and type the following command.
git clone https://github.com/umatter/BigData.git

This creates a new directory BigData with all the course material in it. Whenever there are some updates to the course's repository on GitHub, you can update your local copy with:

git pull

(Make sure you are in the BigData folder when running git pull.)

Fork this course's repository

  1. Go to https://github.com/umatter/BigData, click on the 'Fork' button in the upper-right corner (follow the instructions).

  2. Clone the forked repository (see the cloning of a repository above for details). Assuming you called your forked repository BigData-forked, you run the following command in the terminal (replacing <yourgithubusername>:

git clone https://github.com/`<yourgithubusername>`/BigData-forked.git
  1. Switch into the newly created directory:
cd BigData-forked
  1. Set a remote connection to the original repository
git remote add upstream https://github.com/umatter/BigData.git

You can verify the remotes of your local clone of your forked repository as follows

git remote -v

You should see something like

origin	https://github.com/<yourgithubusername>/BigData-forked.git (fetch)
origin	https://github.com/<yourgithubusername>/BigData-forked.git (push)
upstream	https://github.com/umatter/BigData.git (fetch)
upstream	https://github.com/umatter/BigData.git (push)
  1. Fetch changes from the original repository. New material has been added to the original course repository and you want to merge it with your forked repository. In order to do so, you first fetch the changes from the original repository:
git fetch upstream
  1. Make sure you are on the master branch of your local repository:
git checkout master
  1. Merge the changes fetched from the original repo with the master of your (local clone of the) forked repo.
git merge upstream/master
  1. Push the changes to your forked repository on GitHub.
git push

Now your forked repo on GitHub also contains the commits (changes) in the original repository. If you make changes to the files in your forked repo. you can add, commit, and push them as in any repository. Example: open README.md in a text editor (e.g. RStudio), add # HELLO WORLD to the last line of README.md, and save the changes. Then:

git add README.md
git commit -m "hello world"
git push

bigdata's People

Contributors

umatter avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.