Code Monkey home page Code Monkey logo

dsc-data-ethics-ab-testing's Introduction

Data Ethics and AB Testing

Introduction

When using A/B testing as a method to optimize websites, ad campaigns and more, you may tread potentially dangerous waters when it comes to ethics. Often the purpose for A/B testing is to maximize revenue or optimize some other metric, but it's also important to consider user impact when building A/B tests.

Objectives

You will be able to:

  • Describe the ethical considerations of low-stakes and high-stakes A/B testing
  • List the three ethical areas of application in the Belmont Report

Ethical Concerns with a Classic Example

graphic of phones showing A and B

Image credit American Marketing Association

Some A/B tests are low-stakes, like the classic example of the color of a button. One might wonder how there could possibly be ethical concerns about this kind of test! However there are two relevant ethical considerations: usability and consent.

Usability

Often in the tech world, user interface design choices are determined by aesthetic preferences or trends, and these choices may or may not result in the most usable products. Companies with more resources might have in-house or contracted usability testing teams, but this will always be limited in the number of participants they are able to recruit prior to launching the product.

A/B testing can be ethically beneficial because it might flag usability issues that were not apparent during the initial testing phase. And concepts like statistical power can help teams to determine how many users should be in each test group.

Informed Consent

Human Subjects Research

Traditionally when you experimentally manipulate conditions in order to understand their impacts on people, that is considered to be human subjects research. Many research institutions have institutional review boards (IRBs) specifically dedicated to ensuring that ethical practices are followed when performing human subjects research, particularly medical research.

The Belmont Report

The ethical standards applied by IRBs to human subjects research are rooted in the Belmont Report of 1976. This report was produced by the Belmont Commission in the wake of the Tuskegee Syphilis Study, a notoriously unethical study that recruited African-American men with syphilis and withheld modern treatment in order to study the effects of the untreated disease.

The Belmont Report lists three practices that are necessary for ethical human subjects research, the first of which is informed consent. Informed consent means that three elements must be present:

  1. Research subjects (now more commonly referred to as participants) must be given information about the research procedure
  2. They must comprehend the information that they are given
  3. Their participation in the study must be voluntary

Because these standards were developed for medical research, they may seem excessive for an A/B testing context. In general following the recommendations in the Belmont Report is only mandatory for researchers working in academic or government institutions, not for data professionals working in industry.

But even in a low-stakes A/B test like changing the color of a button, consider how you might be able to get informed consent, and what negative consequences might arise from not doing so. You may be legally covered by mentioning A/B testing in a "Terms and Conditions" page, but newer European regulations are requiring explicit consent, and this is increasingly seen as an ethical norm.

Higher-Stakes Ethical Concerns

Perhaps the most high-profile A/B test of the 21st Century so far is the Facebook "emotional contagion" study published in 2014, formally titled Experimental evidence of massive-scale emotional contagion through social networks. This study used Facebook's A/B testing tools as well as natural language processing (NLP) techniques to manipulate the news feeds of over 600,000 Facebook users. Some users had posts with "negative emotional content filtered out, while others had posts with "positive emotional content" filtered out, and then researchers measured the impact of these changes on the "emotional content" of those users' future Facebook posts.

There was broad backlash to this study, in part because there had been no informed consent process (not even in the ToS). But possibly what angered the public the most was Facebook not following the two other ethical areas of application from the Belmont Report: assessment of risks and benefits, and ethical selection of subjects.

Assessment of Risks and Benefits

While the Belmont Report does not state that human subjects research must be risk-free, it does say that there should be a "systematic assessment of risks and benefits". Those benefits include "anticipated benefit to society in the form of knowledge to be gained from the research" while risks include "risks of psychological harm, physical harm, legal harm, social harm and economic harm".

The stated benefit of the "emotional contagion" study was to learn whether the emotional contagion observed in in-person studies also occurred in large online social networks. The main risk pointed out by critics was that Facebook was potentially causing depression in its users. Activists and politicians argued in response that this risk was not worth the stated benefit.

An alternative to A/B testing in a sensitive context like this could be a data mining approach, as in this paper, which collected and analyzed already-existing data from an online depression community rather than performing an experiment that manipulated people's online experiences. This may not meet the "gold standard" of a randomized controlled trial in terms of statistical strength but may provide comparable insights with much lower risk to users.

Ethical Selection of Subjects

The final application area listed in the Belmont Report is the ethical selection of subjects. Subjects must be ethically selected on a social and individual level, and researchers should consider distributive justice in choosing subjects due to "social, racial, sexual and cultural biases institutionalized in society". It also emphasizes that vulnerable subjects should receive additional protections "because they are easy to manipulate as a result of their illness or socioeconomic condition".

The "emotional contagion" study concerned activists because participants appeared to have been chosen randomly, and potentially-vulnerable people experiencing depression or under the age of 18 were apparently not excluded from the study.

In response to concerns about the study, Facebook CTO Mike Schroepfer posted this update to Facebook's research practices. It acknowledged that this particular study could have been done non-experimentally, and also established a new framework for research ethics:

Guidelines: we’ve given researchers clearer guidelines. If proposed work is focused on studying particular groups or populations (such as people of a certain age) or if it relates to content that may be considered deeply personal (such as emotions) it will go through an enhanced review process before research can begin. The guidelines also require further review if the work involves a collaboration with someone in the academic community.

Note how the examples of "particular groups or populations" corresponds to the ethical selection of subjects application area, and "content that may be considered deeply personal" corresponds to the assessment of risks and benefits application area. Facebook also established "a panel including our most senior subject-area researchers, along with people from our engineering, research, legal, privacy and policy teams, that will review projects falling within these guidelines" that resembles an IRB.

Even though following the Belmont Report is not mandatory for a company like Facebook, its basic principles have proven relevant in the 21st Century.

Additional Resources

Summary

Ethical concerns can arise from performing A/B tests as well as from failing to perform them. A/B tests can help to reveal usability issues that otherwise might have gone unnoticed. However they also can have negative ethical consequences if the Belmont Report application areas of informed consent, assessment of risks and benefits, and ethical selection of subjects are not applied. These negative consequences can be more severe for high-stakes A/B tests but also apply to low-stakes tests like changing the color of a button.

dsc-data-ethics-ab-testing's People

Contributors

hoffm386 avatar jessepisel avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.