Problem the choice of y target in the adversarial model

Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods

This is the code for our paper, "Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods."

Read the paper.

Getting started

Setup virtual environment and install requirements:

conda create -n fooling_limeshap python=3.7
source activate fooling_limeshap
pip install -r requirements.txt

You should be able to run the code now!

We provide a short walk through on COMPAS in COMPAS_Example.ipynb. This is a nice place to get started to see how our method works. Applications of the attack on each data set can be found in compas_experiment.py, cc_experiment.py, and german_experiment.py.

References

Please consider citing our paper if you found this work useful!

@inproceedings{advlime:aies20,
  author = {Dylan Slack and Sophie Hilgard and Emily Jia and Sameer Singh and Himabindu Lakkaraju},
  title = {Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods},
  booktitle = {AAAI/ACM Conference on AI, Ethics, and Society (AIES)},
  year = {2020}
}

Contact

This code was developed by Dylan Slack, Sophie Hilgard, and Emily Jia. Reach out to us with any questions!

Our emails are: [email protected], [email protected], and [email protected].

	for _ in range(perturbation_multiplier):
	perturbed_xtrain = np.random.normal(0,self.perturbation_std,size=X.shape)
	p_train_x = np.vstack((X, X + perturbed_xtrain))
	p_train_y = np.concatenate((np.ones(X.shape[0]), np.zeros(X.shape[0])))

	all_x.append(p_train_x)
	all_y.append(p_train_y)

dylan-slack / fooling-lime-shap Goto Github PK

fooling-lime-shap's Introduction

Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods

Getting started

References

Contact

fooling-lime-shap's People

Contributors

Stargazers

Watchers

Forkers

fooling-lime-shap's Issues

Problem the choice of y target in the adversarial model

Strong dependence on using kmeans background samples for SHAP

Using different estimator

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent