Code Monkey home page Code Monkey logo

ggcorset's Introduction

Greetings

I'm Kyla (she/her), a self-proclaimed datalark! My journey as a data analyst has taken me on numerous adventures across various disciplines. My background is in sociology, and for the past 4+ years I have worked at PBCAR, working on projects related to mental health and substance use research.

An avid R user, I find it rewarding to implement new statistical methods and data visualization techniques. I enjoy documenting both data processing and visualization techniques to help others gain new skills. I also use my R skills as a volunteer with CCODWG, starting first with building a dashboard to disseminate data to the public and now helping to automate the data collection process.

ggcorset's People

Contributors

kbelisar avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

ggcorset's Issues

custom colour for reference lines

This is a great data viz tool! Well done.

I have a suggestion related to the (currently grey) reference lines...

It would be nice to be able to give them a custom colour, or even remove them entirely.

geom_line(data = data.long2, mapping = aes(group = group), colour = "#B3B3B3",

It would probably just be a case of a new option set to the current #B3B3B3...

I can make a PR if you like?

[Statistical issue; proposal] Changing the SEs to CIs or SDs

Dear @kbelisar
Thank you for making this package. These plots are very eye-catching. I like it.

But let me, please, kindly ask you to consider changing the SEs to either SD (for description) or CIs (for inference). SEs itself has no direct application in statistical inference, as no distribution of the test statistic is employed.

SE, measuring the precision of sampling, itself is just a semi-product, used to calculate the confidence intervals depending on the used distribution. The SE itself says nothing about the comparison as long as we don't pick the right distribution - and this distribution doesn't have to be even symmetric, like chi2, F, non-central T.

As a consequence - for the same SE we may have (very) different CIs, depending on the used test statistic. This may result in different result of testing for the same SE.

Unfortunately to science, researchers:

  1. routinely naively look for overlap of SEs to compare groups, which leads to wrong conclusions. SEs are just too short, so the lack of overlap doesn't necessarily mean statistical significance. Actually, comparing group CIs is also wrong (wrong SD is used) - oppositely they are too long so overlapping doesn't necessarily mean lack of stat. significance. The only CI that makes sense for the inference on comparing groups is the CI of the difference or ratio.

  2. even worse - they often "abuse" SEs by making use of the fact, that SEs are shorter than CIs (SE is "expanded" to a CI through the quantile of the test statistic distribution ), which is used to "show up" the effect and cheat that it is "better visible".

Given the above arguments, would you be willing to change SE to CI?

In the simplest case - by assuming the t-distribution.
Ideally - flexibly: by allowing the user to specify the distribution of interest or BCa bootstrap, by returning the by default the BCa (bias corrected and accelerated) CIs, which, if failed, are calculated as percentile CIs.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.