fallbacks's Introduction

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty

This is the official repository for the paper "From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty." Our research investigates undesirable behaviors in large language models (LLMs), such as hallucinations and sequence repetitions, identifying them as fallback mechanisms under uncertainty. We found that as models become more advanced, they shift towards more complex fallback behaviors.

For details about our experiments and findings using Llama 2 and 3, OLMo, and Pythia models across various settings, please see our paper.

🚨 All details about the code, datasets, and instructions on how to reproduce our experiments will be updated soon.

Setup Environment

We will upload the setup instructions soon.

Datasets

The data for the different experiments is available in the data directory. It will be uploaded to HuggingFace hub as well soon.

Generate and Analyze Generations from the Model

We will provide instructions on how to generate and analyze outputs from the models soon.

Create Plots

We will provide instructions on how to create the plots soon.

Citation

If you find our work useful, please cite our paper as follows:

@article{ivgi2024fallbacks,
  title={From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty}, 
  author={Maor Ivgi and Ori Yoran and Jonathan Berant and Mor Geva},
  year={2024},
  journal={arXiv:2407.06071},
  url={https://arxiv.org/abs/2407.06071}, 
}

Recommend Projects

mivg / fallbacks Goto Github PK

fallbacks's Introduction

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty

Setup Environment

Datasets

Generate and Analyze Generations from the Model

Create Plots

Citation

fallbacks's People

Contributors

Stargazers

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent