Missing weighted_loss()

Paper

Distribution-Free, Risk-Controlling Prediction Sets

@article{bates-rcps,
  title={Distribution-Free, Risk-Controlling Prediction Sets},
  author={Bates, Stephen and Angelopoulos, Anastasios N and Lei, Lihua and Malik, Jitendra and Jordan, Michael I},
  journal={arXiv preprint arXiv:2101.02703},
  year={2020}
}

Basic Overview

For general information about RCPS, you can check our blog post. This GitHub contains the code we used for the experiments in the RCPS paper. Each experiment lives in a different, appropriately named folder. The directory core contains code common to all of our experiments, including the implementations of concentration bounds and choice of lambda hat. The repository is still a work in progress; we will be continually updating the code to make it more user-friendly and remove clutter from our development. If you have trouble reproducing our results, please email [email protected].

Getting Started

We store some large files in our git repo via git-lfs; you may need to install and configure it from here. After installing git-lfs, you can clone this repository. Then, you can create the rcps conda environment by running the following line:

conda create --name rcps --file ./requirements.txt

Each experiment requires different datasets. For the ./imagenet and ./hierarchical_imagenet experiments, you will need to point the scripts towards the val directory of your local copy of the Imagenet dataset. Similarly, for ./coco, you need to point the scripts towards your local copy of the 2017 version of MS COCO, available here. For the ./polyp and ./protein examples, a bit more work must be done.

Polyp data

We used data from five different datasets: HyperKvasir-SEG, CVC-ClinicDB, Kvasir-SEG, CVC-ColonDB, and ETIS-LaribPolypDB. Download each of these datasets and unzip them into the folder ./polyps/PraNet/data/TestDataset/{datasetname}. Then run the script ./polyps/PraNet/process_all_data.py, which should store the outputs of the tumor prediction model in the proper directory so you can run our experiments.

Protein data

For the AlphaFoldv1 experiments in ./proteins, you can point the scripts to the alphafold CASP-13 test set, available here.

License

MIT License

	def WSR_mu_plus(x, delta, maxiters): # this one is different.
	n = x.shape[0]
	muhat = (np.cumsum(x) + 0.5) / (1 + np.array(range(1,n+1)))
	sigma2hat = (np.cumsum((x - muhat)**2) + 0.25) / (1 + np.array(range(1,n+1)))
	sigma2hat[1:] = sigma2hat[:-1]
	sigma2hat[0] = 0.25
	nu = np.minimum(np.sqrt(2 * np.log( 1 / delta ) / n / sigma2hat), 1)
	def _Kn(mu):
	return np.max(np.cumsum(np.log(1 - nu * (x - mu)))) + np.log(delta)
	if _Kn(1) < 0:
	return 1
	return brentq(_Kn, 1e-10, 1-1e-10, maxiter=maxiters)

aangelopoulos / rcps Goto Github PK

rcps's Introduction

Paper

Basic Overview

Getting Started

Polyp data

Protein data

License

rcps's People

Contributors

Stargazers

Watchers

Forkers

rcps's Issues

Missing weighted_loss()

Applications to financial industry or NLP

Some questions about the implementation of the Waudby-Smith–Ramdas (WSR) bound

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent