Code Monkey home page Code Monkey logo

l2p4dst's Introduction

Prompt Pool based Class-Incremental Continual Learning for Dialog State Tracking

This is the official code for "Prompt Pool based Class-Incremental Continual Learning for Dialog State Tracking" (ASRU2023). This code is based on CPT4DST. The package requirements and the dataset processing is the same as CPT4DST.

Experiments

The following script will sequentially execute model training and evaluation.

python prompt_pool_tuning.py \
    --train data/sgd_train.json \
    --dev data/sgd_dev.json \
    --test data/sgd_test.json \
    --schema data/all_schema.json \
    --select_method bert-encoder \
    --epochs 3\
    --learning_rate 2e-5 \
    --batch_size 8 \
    --gpu_id 0 \
    --dataset_order 1 \ # The order of training tasks
    --top_n 10 \ # the number of selected prompts for each task
    --pool_size 150\ # the number of prompts in the prompt pool
    --penalty \ # whether to add penalty items
    --M 50 \ # the rehearsal buffer size
    --dis_method euclidean\ # the method of calculating distance
    --model_name t5-small \ # the backbone, can be chosen from 'google/mt5-small', 't5-small', 't5-base', 't5-large'
    --dataset sgd \ # the dataset, sgd or CM-Pickup (not released currently)
    --memory_type fix_size \ # the storage method for rehearsal buffer, fix_size (maintain a fixed buffer) or incre_size (store the same number of samples for each task)

l2p4dst's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.