Welcome to my Homapage!

Senior Data Scientist ✨ Machine Learning 🚀 Model Development 🔧 Tool Development 💻 Software Programmer 📦 R Software 📈 Leadership

About Me

I love to solve problems.

Often the problem can be understanding a complex biological process, but it can also be as simple as fixing something that's broken (e.g. a door that jams, a bicycle, or even machine learning software). In particular, I like to apply my data science skills to better understand, or even solve, the problems we face.

Over the past 12+ years I have combined my statistical knowledge and Open-Source Software tools to solve complex problems in the Life Sciences proteomics (high dimensional) space. In so doing, I have created a comprehensive R-based machine learning analysis ecosystem that standardizes and enables biomarker discovery and predictive model development.

Sometimes the problem is inconsistency across teams or analysts ... thus I promote adherence of "tidy" data principles and am a strong proponent reproducible research and use of bioinformatics pipelines.

Other times the problem can be sharing results across the organization ... thus developing an Application Program Interface (API) infrastructure that enables anyone to access model results with ease.

With my teaching background, I find it important to mentor junior team members while simultaneously leading more senior members. This collaborative spirit is essential to building and effective team that delivers to stakeholders, fosters a sense of accomplishment, and drives revenue generation.

I am always open to discuss possible roles 🔭 and whether my skill set can solve problems in your space. Please reach out via:

How	Where
📫
☎️	970.631.9838
🔗	www.linkedin.com/in/stu-field-sr-data-sci

Skills

Machine Learning 🚀	Statistics 📊	Open-Source 💻	Software Tools 🔧
Random Forest	Logistic regression	R	Linux🐧, MacOS 🍎
Naive Bayes	Linear regression	C++	Git, GitHub
Lasso/ridge regression	GLMMs	Python 🐍	BASH, GNU
k-Nearest neighbour	Mixed-effects models	LaTeX	BitBucket
PCA	Survival analysis	CI/CD	Slack
Ensemble methods	Multivariate statistics	Docker 🐋	AWS
Maximum Likelihood	ANOVA		Kubernetes

Additional Skills

Analysis of high-throughput, multi-plex, high-dimensional, proteomics assay data
Accomplished leader driving small group projects to completion
Proven record of accomplishment via publication in peer reviewed, international journals
Project development and management, experimental design, and data analysis

Other Interests

💬 Favorite food: 🐟 🌮
📚 I am currently learning woodworking🪵 ... I'm not very good, but I can make a lot of sawdust!
💬 Ask me about: bikes and R ... I'll talk your👂 off!
🚴 I'm an avid cyclist: come say hi on

More Details

I maintain several R software libraries (📦) that implement statistical and machine learning techniques in biomarker discovery. Some of my popular published (CRAN) 📦 are:
These projects support analyses in the general health care (Life Sciences) space to generate proteomic based clinical insights in health spaces such as:
- cardiovascular disease
- liver disease (NASH/NAFLD)
- alcohol effects
- biological aging
- exercise status
- metabolic disease
Favorite techniques:
- logistic regression (ol' faithful)
- random forest
- naive Bayes
- KKNN (nearest neighbor)
- survival analyses
- ensemble methods
I am a proponent of the open-source software, conducting the majority of my research/analysis via Linux toolkits, R, and the RStudio IDE.
I promote conforming to the adherence of so-called "tidy" data, a philosophy of data science designed to share underlying data structure, grammar, and format which facilitates the generation of reproducible analyses.

stufield / somascan.db Goto Github PK

somascan.db's Introduction

Welcome to my Homapage!

About Me

Skills

Additional Skills

Other Interests

More Details

🔧 Tools & Languages

🔧 GitHub Commits

📈 GitHub Stats

Contributions

🔗 Links & Resources

somascan.db's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent