Code Monkey home page Code Monkey logo

computational-use-of-data-agreement's Introduction

Computational Use of Data Agreement (C-UDA)

Goal

Sharing data can help address some of society’s biggest challenges and can help individuals and organizations be more innovative, efficient, and productive. The Computational Use of Data Agreement (C-UDA) is intended to define a specific data use scenario involving the use of data sets for AI training purposes, in a manner consistent with law. The C-UDA complements the Open Use of Data Agreement (O-UDA), an agreement intended for situations where data can be shared with minimal requirements.

Overview

The C-UDA is a simple agreement that allows the data holder to make data available to anyone for computational use purposes, such as artificial intelligence, machine learning, and text and data mining. In short:

  • It is intended for scenarios where the data distributor may desire, or is required, to restrict the use of data sets, including scenarios where the data set contains material not owned or controlled by the data distributor.
  • It addresses data that is assembled from lawfully accessed, publicly available sources to be used for computational analysis.
  • Redistribution of the Results from use of the data under the agreement—including results of analysis of the data or ML models trained with the data—carries no obligations.
  • Redistribution of data under the agreement—modified or unmodified—requires use of the C-UDA.
  • The redistribution obligations are designed to encourage sharing by limiting the liability of the data provider and ensuring that those downstream can identify where the data came from.

Contemplated use case

We envision that this agreement is suitable for situations where the original data provider owns or has lawfully acquired the material in the data set (because they have express permission to use the material), or where they have assembled materials from lawfully and publicly accessible sources and the data is appropriate for distribution for computational use purposes. Permission to redistribute this material is limited to computational analysis to remain compliant with legal precedent and statutory exceptions and to respect the legitimate interests of third party rights owners.

This agreement is not recommended where the data provider includes material in the data set that (i) was not lawfully accessed and is not appropriate for distribution for computational use purposes, (ii) is subject to a legally binding restriction that restricts its further distribution, or (iii) raises privacy concerns arising from its distribution. Data Providers may need to consider whether additional measures are appropriate to ensure that data is not made available for use beyond legally permissible computational uses.

While this agreement does not authorize uses beyond computational use, it does not restrict the use of any portions of a data set that are in the public domain or that can be used, modified, or distributed for any use permitted by any legal exception or limitation.

With this agreement, Microsoft is not giving legal advice. Please consider your own circumstances and seek your own legal counsel as needed.

The C-UDA does not meet the Open Data Definition

The C-UDA is not intended to be and should not be described as an open data license. Specifically, it does not permit use for any purpose as described in Section 2.1.1 and 2.1.8 of the Open Definition. The C-UDA is intended to address situations in which data cannot be shared under an open license, but it is possible for a data provider to permit computational use. For situations in which an open data license is appropriate, see the O-UDA and other open data licenses.

Why a "computation" only agreement?

We developed the C-UDA to address a gap among current public agreements. Data that is useful for computational analysis may often include copyrightable content, and global legal precedent and legislation have confirmed that copyrighted works may be used for computational use without express consent of the owner. However, continued perceived uncertainty over copyright law has caused many data providers to resort to limitations that significantly restrict who can use data, or how the data can be used, in ways that may be more restrictive than those permitted by applicable law or legislation. These restrictions may create uncertainly or cause confusion among users that greatly limits the usefulness and benefit of data sets containing copyrighted works in artificial intelligence activities, such as machine learning. The C-UDA does not restrict who can use such data, but it limits the use of data to computational analysis to be consistent with applicable law and legislation, and to respect the legitimate interest of rights holders.

Contributing

This project welcomes contributions and suggestions under CC0-1.0. To suggest edits, open a Pull Request or to start a discussion open an Issue. Or, if you prefer to submit comments via email, please submit them to [email protected]. If you wish your comments to remain anonymous, please submit them by email and say so in the first line of the email.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

For more information on Microsoft’s resources to Removing Barriers to Data Innovation, visit here.

Legal Notices

Microsoft and any contributors grant you a license to content in this repository under CC0-1.0, see the LICENSE file.

computational-use-of-data-agreement's People

Contributors

microsoft-github-policy-service[bot] avatar microsoftopensource avatar royaljust avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

computational-use-of-data-agreement's Issues

Icon?

Creative commons licenses have a great set of round icons which can be used to quickly signal the license of a work released into the public. These are included in icon packs such as FontAwesome.

Any chance you'll release a similar set of icons for C-UDA and O-UDA?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.