Code Monkey home page Code Monkey logo

Welcome to my Homapage!

Senior Data Scientist โœจ Machine Learning ๐Ÿš€ Model Development ๐Ÿ”ง Tool Development ๐Ÿ’ป Software Programmer ๐Ÿ“ฆ R Software ๐Ÿ“ˆ Leadership


ย 

About Me

I love to solve problems.

Often the problem can be understanding a complex biological process, but it can also be as simple as fixing something that's broken (e.g. a door that jams, a bicycle, or even machine learning software). In particular, I like to apply my data science skills to better understand, or even solve, the problems we face.

Over the past 12+ years I have combined my statistical knowledge and Open-Source Software tools to solve complex problems in the Life Sciences proteomics (high dimensional) space. In so doing, I have created a comprehensive R-based machine learning analysis ecosystem that standardizes and enables biomarker discovery and predictive model development.

Sometimes the problem is inconsistency across teams or analysts ... thus I promote adherence of "tidy" data principles and am a strong proponent reproducible research and use of bioinformatics pipelines.

Other times the problem can be sharing results across the organization ... thus developing an Application Program Interface (API) infrastructure that enables anyone to access model results with ease.

With my teaching background, I find it important to mentor junior team members while simultaneously leading more senior members. This collaborative spirit is essential to building and effective team that delivers to stakeholders, fosters a sense of accomplishment, and drives revenue generation.

I am always open to discuss possible roles ๐Ÿ”ญ and whether my skill set can solve problems in your space. Please reach out via:

How Where
๐Ÿ“ซ Gmail Badge
โ˜Ž๏ธ 970.631.9838
๐Ÿ”— www.linkedin.com/in/stu-field-sr-data-sci

Skills

Machine Learning ๐Ÿš€ Statistics ๐Ÿ“Š Open-Source ๐Ÿ’ป Software Tools ๐Ÿ”ง
Random Forest Logistic regression R Linux๐Ÿง, MacOS ๐ŸŽ
Naive Bayes Linear regression C++ Git, GitHub :octocat:
Lasso/ridge regression GLMMs Python ๐Ÿ BASH, GNU
k-Nearest neighbour Mixed-effects models LaTeX BitBucket
PCA Survival analysis CI/CD Slack
Ensemble methods Multivariate statistics Docker ๐Ÿ‹ AWS
Maximum Likelihood ANOVA Kubernetes

Additional Skills

  • Analysis of high-throughput, multi-plex, high-dimensional, proteomics assay data
  • Accomplished leader driving small group projects to completion
  • Proven record of accomplishment via publication in peer reviewed, international journals
  • Project development and management, experimental design, and data analysis

Other Interests

  • ๐Ÿ’ฌ Favorite food: ๐ŸŸ ๐ŸŒฎ
  • ๐Ÿ“š I am currently learning woodworking๐Ÿชต ... I'm not very good, but I can make a lot of sawdust!
  • ๐Ÿ’ฌ Ask me about: bikes and R ... I'll talk your๐Ÿ‘‚ off!
  • ๐Ÿšด I'm an avid cyclist: come say hi on

More Details

  • I maintain several R software libraries (๐Ÿ“ฆ) that implement statistical and machine learning techniques in biomarker discovery. Some of my popular published (CRAN) ๐Ÿ“ฆ are:
  • These projects support analyses in the general health care (Life Sciences) space to generate proteomic based clinical insights in health spaces such as:
    • cardiovascular disease
    • liver disease (NASH/NAFLD)
    • alcohol effects
    • biological aging
    • exercise status
    • metabolic disease
  • Favorite techniques:
    • logistic regression (ol' faithful)
    • random forest
    • naive Bayes
    • KKNN (nearest neighbor)
    • survival analyses
    • ensemble methods
  • I am a proponent of the open-source software, conducting the majority of my research/analysis via Linux toolkits, R, and the RStudio IDE.
  • I promote conforming to the adherence of so-called "tidy" data, a philosophy of data science designed to share underlying data structure, grammar, and format which facilitates the generation of reproducible analyses.

๐Ÿ”ง Tools & Languages

๐Ÿ”ง GitHub Commits


๐Ÿ“ˆ GitHub Stats

Stu's GitHub Stats

Contributions


๐Ÿ”— Links & Resources


Stu Field's Projects

tech-notes icon tech-notes

Repository containing technical notes on various statistical and machine learning, computer science, and related topics

test_plumber icon test_plumber

Just a test plumber API to show the glmnet warning on Rsconnect deployment

testpkg icon testpkg

Just a test package to debug devtools and builds

testthat icon testthat

An R ๐Ÿ“ฆ to make testing ๐Ÿ˜€

theme-dinky icon theme-dinky

The Dinky theme for GitHub pages, ported to Jekyll Bootstrap.

tibble icon tibble

A modern re-imagining of the data frame

tidy-tools-2018 icon tidy-tools-2018

The tity-tools workshop with Hadley Wickham; this is an "unofficial" fork of the original repository used for the course

tidyr icon tidyr

Easily tidy data with spread and gather functions.

usethis icon usethis

Set up commonly used ๐Ÿ“ฆ components

white-pine-blister-rust icon white-pine-blister-rust

Full set of functions, subroutines, and data objects for the high-elevation white pine blister rust (WPBR) project, a collaboration between the Colorado State University Dept. of Biology/Mathematics and the USDA Forest Service

wtf-2019-rsc icon wtf-2019-rsc

What They Forgot to Teach You About R, 2019 January 15/16 @ rstudio::conf

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.