armsp / aifu Goto Github PK

AI flub ups

CSS 1.18% JavaScript 98.71% MDX 0.11%

accountability ai algorithm algorithmic-bias algorithmic-fairness algorithmic-governance disaster ethics government policy public-policy societal-impact tragedy watchdog nlp llm natural-language natural-language-processing

aifu's Introduction

AIFU

AI flub ups

AIFU aims to be a platform for researchers and anyone looking for a place to learn about the repercussions of using AI blatently, too soon, indiscriminately, harmfully. Its an experiment in meta analysis. What makes it interesting is that all of the information is "generated" and "retreived" using language model(s). Currently, the design of the platform leans towards using causal language models as judge, jury and executioner if you will.

How to use the platform?

We intend you to submit links to articles that you may think document/inform of cases where harm "may" have been caused becasue of AI. Your submission will be recorded publicly in our GitHub repository as an issue and the processing pipeline will start. You may have to wait for a few seconds to a few minutes for the workflow to finish and a short response of the outcome of the workflow run will be posted as a reply to the issue. For example -> #84

Steps -

Go to the country of the concerned article/news. If the article/news does not pertain to a country then submit it as global.
Enter the headline, url, media organization and other details you can discern (see examples when you navigate to a country)
Press Submit to start the workflow

Research Question

AI is going to pervade software systems just as software systems have pervaded the social fabric. And this work is quite prescient because we have already seen how AI systems can cause widespread harm.

“Had a great conversation with Yoshua Bengio. Both of us agreed that a good step forward for AI risk is to articulate the concrete scenarios where AI can lead to significant harm.” - Andrew Ng

Through this platform, my aim is to monitor the utilization of AI by governmental entities. By conducting thorough analysis of the collected data, I want to find out if we can provide "concrete course(s) of action" or policy recommendations to lawmakers, the general public, and other stakeholders regarding their engagement with AI-based services. Crucially, this initiative will enable us to gain insights into the preferences and purposes of government agencies in employing AI. By doing so, we can proactively address potential risks associated with AI based implementation in specific domains such as child welfare and recidivism, while promoting fair and equitable utilization of AI.

Outline

The objective of this project is to establish a comprehensive archive or repository that serves as a resource for policymakers, researchers, students, and other stakeholders seeking to acquire knowledge regarding the potential adverse consequences arising from the indiscriminate, premature, or injudicious use of algorithms and AI.

Motivation Necessity: Knowing about something harmful is crucial for several reasons -

by identifying the underlying factors that led to the harmful event, steps can be taken to prevent similar incidents from occurring in the future.
important for developing a deeper understanding of the risks and downsides associated with a particular technology, process, or behavior. Armed with this knowledge, we can take proactive steps to mitigate risks and prevent harm, improving the safety and wellbeing of ourselves and those around us.
knowledge of the harmful consequences of AI can help to drive the development of better, more responsible AI technology. By understanding the ways in which AI can be harmful, researchers and developers can work to address these issues and build AI systems that are safer, more transparent, and more trustworthy.
it is imperative for policymakers and the general public to be aware of the potential harmful consequences of using AI for several reasons. Policymakers have a responsibility to regulate the use of AI technology to ensure that it is safe, ethical, and serves the public interest. In order to do this effectively, they need to have a comprehensive understanding of the risks and downsides associated with AI.
the general public needs to be informed about the potential harmful impacts of AI so that they can make informed decisions about how they interact with the technology. By being aware of the potential risks, individuals can take steps to protect their privacy, security, and well-being when using AI-powered tools and services.
to a certain extent its also addresses issues of public health and therefore national security

What makes this project intresting is that I plan to archive, extract and collate relevant information from the articles using AI itself. Therefore this platform and project is an exercise in -

collaborating with AI to examine the misuses and harm from the tool in a meta-analysis
working together to explore its possible negative impacts in a self-reflective manner.
leveraging AI to gain insights into the potential risks and downsides of AI technology
using AI to inform our understanding of the possible negative impacts and consequences associated with AI

In short causal language model is the judge, jury, and executioner.

Related literature

Most of the existing databases and projects around this are more about general (deeper) understanding of AI, accountability, fairness and transparency, where real life examples are just case studies. They are very broad in their general reach which I believe dilutes their effectiveness. Some of the important resources that I found in my literature review research are -

The AI Index by The Stanford Institute for Human-Centered Artificial Intelligence

They do not gather information about the harms caused by AI but about the developments, growth, measuring trends and progress and market of AI in various countries. They also track global AI legislation. Quoting them -

The AI Index Report tracks, collates, distills, and visualizes data related to artificial intelligence. Its mission is to provide unbiased, rigorously vetted, and globally sourced data for policymakers, researchers, executives, journalists, and the general public to develop intuitions about the complex field of AI.

Reports are available from the year 2017 with a gap in 2020.

The AI Now Institute

They focus on accountability and on policy recommendations. Their aim is not to create any database of harms. However studying their reports and recommendations would be instrumental for the last step of this project which is policy recommendations. Quoting them -

Founded in 2017, the AI Now Institute produces diagnosis and policy research to address the concentration of power in the tech industry. We develop policy strategy to redirect away from the current trajectory: unbridled commercial surveillance, consolidation of power in very few companies, and a lack of public accountability.

Data Harm Record by Data Justice Lab

This is not a database but a running document that has not been updated since August 2020. The definition of harm is closer to the kind of cases that I wish to have in my database but the cases they have studied range from generic harm to specific harms. Since its not being maintained anymore, my plan is to use all the articles that it references and to input it to the system I have designed. The idea is that all those cases should be integrated and to observe why a particular case was rejected.

AI Incident Database

This is the closest database to what I am building. Quoting them -

The AI Incident Database is dedicated to indexing the collective history of harms or near harms realized in the real world by the deployment of artificial intelligence systems. Like similar databases in aviation and computer security, the AI Incident Database aims to learn from experience so we can prevent or mitigate bad outcomes.

They however collect a rather broad spectrum of cases - incidents where intelligent systems have caused safety, fairness, or other real-world problems. It has also changed a lot since the time I started working. A while ago you could only browse and read the content of submitted incident's article(s). Since then they have added a multitude of features such as Named Entities, CSET Taxonomy Classifications, GMF Taxonomy Classifications, a timeline of reports and a list of similar reports/variants. However not all incidents have all this information associated with them and its not apparently clear on what basis are incients chosen to be worked upon to provide all the relevant information.

To the best of my knowledge, nothing like AIFU has been done before.

Data

How does the corpus help you answer your research question? Citing possible solutions based on implementations in other countries, latest developments in AI that successfully mitigate the shortcomings of previous scenarios/models, guardrails placed by other countries and regulations in other countries. With all the information and analysis, I want to find out if in the end we can “Recommend” some “concrete” course of action(s) or policy decisions to lawmakers, general public and other stakeholders.

Approach

Once a user submits any article by providing its URI and Headline, we first try to figure out if the article is not at all related to what we are looking for by analyzing the headline itself. The idea here is to reject irrelevant articles and not so much accept relevant articles becasue that happens in the next step. However, it is important to understand that subsequent articles about an event may be ignored if the workflow depends too heavily on Headline itself. To mitigate that we use the content of the article itself and also provide method to "update" a case by providing subsequent articles for that case.

After that, we extract the contents of the article using the provided URI. Following which, using prompt engineering, we try to extract all the relevant content by asking questions. Finally we also summarize the article. The outputs are then serialized in the database.

Before actually storing the contents in the database, we also query the database for exactly similar articles (maybe the same article was already stored before or maybe a different media reported on the same case, both of which are highly likely). If an exactly similar article is found then we try to merge the information extracted now with whatever we had extracted earlier and update the database record. If the article and case is completely new then we just make a new record in the database. However, Near Duplicate Detection or identifying similar articles by different media is a research field in itself. Currently we have observed that using using universal sentence encoder as a baseline works for us. However as the workflow becomes more complicated and streamlined and accurate, we will certainly have to revisit this part for automatic detection of duplicates and similar articles.

API Endpoints

Endpoint	Request type	Comment
`/records`	GET	Responsible for providing the details of cases per country to the frontend.
`/submit`	POST	Responsible for creating Automatic GitHub issues/discussions whenever a user submits an article. Also triggers the information extraction endpoint in the background (steps listed in Approach).
`/heartbeat`	GET	Used for testing if the server is alive.
`/export`	GET	Provides a json file of exported content from the database collection for a given country.

More endpoints may be added later on based on the needs of the project.

Policy Recommendations Literature Review and Starting Point

(Near) Duplicate Detection This is a whole research area in itself. After trying a few approaches I have settled with Universal Sentence Encoder as a baseline. This part of the pipeline would be made more robust over time.
Methods that do not yield good results - TFIDF, Jaccquard Distance. See jupyter notebook for experiments. My first attempt was to use TFIDF. For making the vocabulary and TFIDF weights I used the latest news dataset from n24 news released as part of the research N24News: A New Dataset for Multimodal News Classification Method deployed - Universal Sentence Encoder. USE can work on full text, the embedddings just get diluted.

Methods to explore for Future Work -

there is a research by google for huge datasets though - https://github.com/google-research/deduplicate-text-datasets
Try USE based on other architectures - https://tfhub.dev/s?q=google%2Funiversal-sentence-encoder%2F4%20OR%20google%2Funiversal-sentence-encoder-large%2F5
Try a multi modal approach where we use similarity with topic modelling as well as journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

Definition of Success for the project

Data: plateaus over time
Views: Increase and then decrease and plateau
Information about removing bias, increasing fairness would become commonplace and part of the development lifecycle itself.
Ethicists would be part of the team

Technical Details and Workflow

Git/GitHub Branches

gatsby - the main branch for UI. This is where the development code for the frontend resides. The frontend is designed using Gatsby, React and MUI.
gh-pages - built automatically by the GitHub Action that is triggered by a push to gatsby branch. This branch holds the deployment code for the Frontend.
backend - this branch contains the code for the backend server that runs on Azure. It also has jupyter notebooks with experiments on NLP tasks. It is important to note that the backend code is redacted and not all files are present since some files contain private credentials for deploying to Azure or otherwise.

BACKEND SERVER

stateDiagram-v2
      
    [maintainer] --> CommitBackend: update backend
    CommitBackend --> LoginToAzureContainerRegistry
    LoginToAzureContainerRegistry --> BuildAndTagDockerImage
    BuildAndTagDockerImage --> PushImageToRegistry
    PushImageToRegistry --> AzureContainerGroupDeploy: az container create --resource-group <"resource group name"> --file <"deploymentfile.yml">

FRONTEND UI

stateDiagram-v2

    [maintainer] --> CommitFrontEnd: update frontend
    CommitFrontEnd --> PushToGitHub
    PushToGitHub --> RunGitHubAction : GitHub PAT, Secret Key
    RunGitHubAction --> Deployment
    state Deployment {
        UpdateDeploymentCodeOnBranch --> DeployToGHPages: build and deploy (yaml) |ubuntu, node, checkout, build, commit, push
    }

User Submission

flowchart TB
    SubmitArticle --> A[Detect Headline Language] --> |Language Supported| D
    A --> |Language Not Supported| C[Make GH Issue] --> Stop
    D[Classify Headline] --> |Accepted|E[Detect Content Language]
    D --> |Rejected|C 
    E --> |Language Not Supported| C
    E --> |Language Supported| F
    F[Make GH Issue] --> J[Information Retreival Pipeline]

Information Extraction Pipeline

flowchart TB
    A[Submit Link and Headline] --> B{"Does the headline suggest<br> it might be related to AI<br> and its potential harm?"}
    B --> |Yes| C[Extract text from article link]
    B --> |No| E[End]
    C --> D[Assess from content if it concerns actual harm caused to humans]
    D --> |No harm to humans| F[Reject, save in a different collection]
    D --> |Harm caused to humans| G[Extract Information based on provided Prompts]
    G --> H[Check if exactly similar article exists in the database]
    H --> |No| I
    H --> |Yes| J[Merge context and update extracted information]
    J --> I[Update information in Database]

Major Changes Log

Frontend Tech Stack - React, gatsby, vega-lite, MUIv5
Cases are shown as tables where rows can be expanded. This is the current approach until we face performance issues.
Setup a MongoDB database on Azure
Letting gpt do everything: ~~search the internet~~, summarize, answer, extract information (Langchain), we don't want to link to paywalled articles, figure out what goes in the db and what does not on its own
Refactored server code for easier and quicker deployments and developments in Production and Development environments

Features to come

Future Development Activities

Decision Log and Tradeoffs

Using AI for meta-analysis. Using AI for everything!
The whole platform has AI generated data and information.
You input a link. LLM decides if the link is relevant or not. If it is relevant then it extracts information (answers to questions above) from the text. Can ask for more info or gather more info from other articles about the same issue. Can identify an article that talks about the same issue and update information instead of adding new information about the issue. The idea is not to provide an exact text from the articles but "generated text" (asking LM to make a "report", because we don't want to provide links to paywalled websites). The agent itself updates the database.
I don't expect submissions with content larger than 5000 words. Therefore we don't need to split the content and make embeddings and find similiar splits and then do Q&A or Info Retreival using prompts. The whole content can be embedded for detecting duplicates or articles that talk about the same case but are written by different media houses.
Since the content fits the context of LLM we are using, we can create and save the embedding of the whole content in a regular document database instead of using specialized vector databases which increase complexity of the deployment infrastructure and the info ret pipeline.
I think using LangChain made things unnecessarily complicated. I could have just used openai's python library along with Microsoft's guidance library for making LLM outputs stable.
Similarity and near duplicate and duplicate detection is a whole research area in itself. This needs to be tackled properly and given more time to implement properly.
(Harm) Scoring does not work because we do not have a reference. There is high noise becasue GPT 4 can't do direct scoring (give a score to an input or response).

Clock - 95 hrs

aifu's People

Contributors

Stargazers

Watchers

Forkers

leylagogebakan

aifu's Issues

nld

Headline
Dutch scandal serves as a warning for Europe over risks of using algorithms
URL
https://www.politico.eu/article/dutch-scandal-serves-as-a-warning-for-europe-over-risks-of-using-algorithms/
Media Organization
Politico
Submitted By: [Name: None, GitHub: @None]

dnk

Headline
How Denmark’s Welfare State Became a Surveillance Nightmare
URL
https://www.wired.com/story/welfare-fraud-industry
Media Organization
WIRED
Submitted By: [Name: None, GitHub: @None]

bel

Headline
Belgian man dies by suicide following exchanges with chatbot
URL
https://www.brusselstimes.com/belgium/430098/belgian-man-commits-suicide-following-exchanges-with-chatgpt
Media Organization
The Brussels Times
Submitted By: [Name: None, GitHub: @None]

bel

usa

Headline
ICE’s New York Office Uses a Rigged Algorithm to Keep Virtually All Arrestees in Detention. The ACLU Says It’s Unconstitutional.
URL
https://theintercept.com/2020/03/02/ice-algorithm-bias-detention-aclu-lawsuit/
Media Organization
The Intercept
Submitted By: [Name: None, GitHub: @None]

bel

usa

Denmark

Headline
How Denmark’s Welfare State Became a Surveillance Nightmare
URL
https://www.wired.com/story/algorithms-welfare-state-politics/
Media Organization
WIRED
Submitted By: [Name: Shantam, GitHub: @armsp]

nld

usa

Headline
This Court Used ChatGPT to Decide Bail in a Murder Case
URL
https://www.vice.com/en/article/ak3dzk/india-court-chatgpt-bail-murder-case
Media Organization
VICE
Submitted By: [Name: , GitHub: @]

Denmark

Headline
How Denmark’s Welfare State Became a Surveillance Nightmare
URL
https://www.wired.com/story/welfare-fraud-industry/
Media Organization
WIRED
Submitted By: [Name: , GitHub: @]

usa

ita

Headline
Sfida Musk-Zuckerberg, l'ultimatum del ceo di Meta: «Non è serio». Salta l'incontro?
URL
https://www.corriere.it/tecnologia/23_agosto_13/sfida-elon-musk-mark-zuckerberg-l-ultimatum-del-ceo-di-facebook-non-e-serio-salta-l-incontro-4739934f-8f43-435e-bdae-79c02c1bfxlk.shtml
Media Organization
Corriere Della Sera
Submitted By: [Name: None, GitHub: @None]

usa

Headline
Racial Bias Found in a Major Health Care Risk Algorithm
URL
https://www.scientificamerican.com/article/racial-bias-found-in-a-major-health-care-risk-algorithm/
Media Organization
Scientific American
Submitted By: [Name: , GitHub: @armsp]

broad

Headline
Video Game Voice Actors Doxed and Harassed in Targeted AI Voice Attack
URL
https://www.vice.com/en/article/93axnd/voice-actors-doxed-with-ai-voices-on-twitter
Media Organization
VICE
Submitted By: [Name: None, GitHub: @None]

Presentation - 13th June | First Draft - 15th July | Final Draft - 1st Sept

Presentations are 10 minutes long with 5 more minutes for questions (15 minutes total per group).

The presentation should have 6-12 slides. Here is an example outline:

title, your name(s), and the course info / date.
research question (what are you asking?) and motivation (why is this project interesting?)
literature review (previous work on this question/area)
description of data sources
summary statistics
methods
preliminary results
what changed since the outline, and why
plan/timeline for rough draft

broad

Belgium

usa

Headline
‘Thousands of Dollars for Something I Didn’t Do’ Because of a bad facial recognition match and other hidden technology, Randal Reid spent nearly a week in jail, falsely accused of stealing purses in a state he said he had never even visited.
URL
https://www.nytimes.com/2023/03/31/technology/facial-recognition-false-arrests.html
Media Organization
NYT
Submitted By: [Name: , GitHub: @armsp]

nld

Headline
Toeslagenaffaire: ‘Grondbeginselen rechtsstaat zijn geschonden’
URL
https://www.ad.nl/politiek/toeslagenaffaire-grondbeginselen-rechtsstaat-zijn-geschonden~a9b6382b/
Media Organization
Algemeen Dagblad
Submitted By: [Name: None, GitHub: @None]

bel

usa

ita

Headline
The algorithm that blew up Italy’s school system
URL
https://algorithmwatch.org/en/algorithm-school-system-italy/
Media Organization
Algorithm Watch
Submitted By: [Name: elliott ash, GitHub: @]

Belgium

usa

First release by 1st Sept

usa

broad

Headline
Eating Disorder Helpline Fires Staff, Transitions to Chatbot After Unionization
URL
https://www.vice.com/en/article/n7ezkm/eating-disorder-helpline-fires-staff-transitions-to-chatbot-after-unionization
Media Organization
VICE
Submitted By: [Name: None, GitHub: @None]

nld

Global

Headline
The Fraud-Detection Business Has a Dirty Secret \n When systems designed to catch welfare cheats go wrong, people find themselves trapped between secretive governments and even more opaque private companies.
URL
https://www.wired.com/story/welfare-fraud-industry/
Media Organization
WIRED
Submitted By: [Name: , GitHub: @armsp]

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.