Code Monkey home page Code Monkey logo

biglm's Introduction

About Me

Hi, my name is ZJ. I love contributing to the open-source data science community. I mainly code in R and Julia with a dash of Python and a minimal amount of Scala thrown in.

Projects

Project Language Notes
{disk.frame} R {disk.frame} https://diskframe.com is the most popular larger-than-RAM data manipulation framework in R.
SortingLab.jl Julia Some of the fastest sorting algorithms in Julia including faster String sorting and sortperm (i.e. R's order) algorithms
JDF.jl Julia A fast DataFrames serialization format and package.
ShortStrings.jl Julia A package that uses Integer (bits types) to store strings more efficiently. Great for sorting and group-by operations. It's been handed over to the JuliaString org. Note: you probably should be using InlineStrings.jl instead
TableScraper.jl Julia A simple scraper for well-formed tables from webapges
PkgVersionHelper.jl Julia A one function, upcheck(), package for checking if your Project.toml contains the most up-to-date packages
DataConvenience.jl Julia Some convenience functions for data manipulation and data related tasks
CuCountMap Julia Fast CUDA.jl based countmap of small types e.g. UInt8
Parquet writer in Julia Julia I wrote the Parquet writer in pure Julia which was contributed back to Parquet.jl

Other contriutions: I have also contributed a fast countmap in StatsBase.jl for small type.

GitHub stats

Social Media & websites

Platform Handle/URL Notes
Twitter @evalparse
LinkedIn daizj
Youtube Data Science ZJ
Baduk Go Weiqi Ratings A ranking list of the professional Go player strengths estimated from the last 365 days of games
Baduk Go Weiqi Ratings2 A ranking list of the professional Go player strengths estimated from the last 365 days of games

Interesting Projects I can no longer find time to maintain

Project Language Notes
JLBoost.jl Julia A pure Julia implementation of XGBoost-like boosting trees
ShinySky R A collection of Shiny widget. One of the earliest "popular" Shiny packages.
FastGroupBy Julia A fast group by functionality. I shared the ideas I had with the main developer of DataFrames.jl and DataFrames.jl were onto the same kind of ideas anyway, so they ended up optimizing the group-by. This makes it unnecesssary to keep maintaining the package.

Awesome lists

Category Notes
awesome-eda Exploratory Data Analysis
awesome-ml-fraemworks Machine Learning Framework
awesome-data-science-notebook-engines Data Science notebooks like Jupyter
awesome-visual-flow-data-science
awesome-markdown-table-editors
awesome-big-medium-data-frameworks
awesome-feature-engineering
awesome-flow

biglm's People

Contributors

xiaodaigh avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

Forkers

msaidf levykort1

biglm's Issues

Mixed effects

Hello,

Is there any way to include random effects with biglm?
(I have 1000 repeated measures for each individual)

I'd like to fit a big model (30GB) and lme4 is not able to deal with databases so big.

Regards.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.