Code Monkey home page Code Monkey logo

aicert's Introduction


Logo

AICert

Website Blog

Making AI Traceable and Transparent

Table of Contents
  1. About the project
  2. Getting started
  3. Limitations
  4. Contact

πŸ”’ About The Project

πŸ› οΈ AICert aims to make AI traceable and transparent by enabling AI builders to create certificates with cryptographic proofs binding the weights to the training data and code. AI builders can be foundational model providers or companies that finetune the foundational models to their needs.

πŸ‘©β€πŸ’» End users are the final consumers of the AI builders’ models. They can then verify these AI certificates to have proof that the model they talk to comes from a specific training set and code, and therefore alleviates copyright, security and safety issues.


TPM We leverage Trusted Platform Modules (TPMs) in order to attest the whole stack used for producing the model, from the UEFI, all the way to the code and data, through the OS.

Measuring the software stack, training code and inputs and binding them to the final weights allows the derivation of certificates that contain irrefutable proof of model provenance.

βœ… Use cases

AICert addresses some of the most urgent concerns related to AI provenance. It allows AI builders to:

  • Prove their AI model was not trained on copyrighted, biased or non-consensual PII data

  • Provide an AI Bill of Material about the data and code used, which makes it harder to poison the model by injecting backdoors in the weights

  • Provide a strong audit trail with irrefutable proof for compliance and transparency

    ⚠️ WARNING: AICert is still under development. Do not use it in production! If you want to contribute to this project, do not hesitate to raise an issue.

πŸ” Features

  • AI model traceability: create AI model ID cards that provide cryptographic proof binding model weights to a specific training set and code
  • Non-forgeable proofs: leverage TPMs to ensure non-forgeable AI model ID cards
  • Flexible training: use your preferred tooling for training
  • No slowdown induced during training
  • Azure support

🎯 Coming soon

  • Benchmark linking: provide cryptographic binding of model weights to specific benchmarks that were run for this specific model
  • Multi-Cloud support with AWS and GCP coverage
  • Single and multi-GPU support

(back to top)

πŸš€ Getting started

(back to top)

⚠️ Limitations

While we provide traceability and ensure that a given set of weights comes from applying a specific training code on a specific dataset, there are still challenges to solve:

  • The training code and data have to be inspected. AICert does not audit the code or input data for threats, such as backdoors injected into a model by the code or poisonous data. It will simply allow us to prove model provenance. It is up to the AI community or end-user to inspect or prove the trustworthiness of the code and data.
  • AICert itself has to be inspected, all the way from the OS we choose to the HTTP server and the app we provide to run the code on the training data.

We are well aware that AICert is not a silver bullet, as to have a fully trustworthy process, it requires scrutiny of both our code and the code and data of the AI builder.

However, by combining both, we can have a solid foundation for the AI supply chain.

(back to top)

πŸ“‡ Contact

Contact us Twitter LinkedIn

(back to top)

aicert's People

Contributors

clauverjat avatar lyie28 avatar jofrost avatar pramit-d avatar

Forkers

pramit-d

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.