Code Monkey home page Code Monkey logo

author-profiling-pan2023's Introduction

Leveraging Large Language Models with Multiple Loss Learners for Few-Shot Author Profiling

This repository contains the code and data for the paper "Leveraging Large Language Models with Multiple Loss Learners for Few-Shot Author Profiling" by Hamed Babaei Giglou, Mostafa Rahgouy, Jennifer D’Souza, Milad Molazadeh Oskuee, Hadi Bayrami Asl Tekanlou, and Cheryl D Seals. The paper was presented at the 14th International Conference of the CLEF Association (CLEF 2023).

The objective of author profiling (AP) is to study the characteristics of authors through the analysis of how language is exchanged among people. Studying these attributes sometimes is challenging due to the lack of annotated data. This indicates the significance of studying AP from a low-resource perspective. This year at AP@PAN 2023 the major interest raised in profiling cryptocurrency influencers with a few-shot learning technique to analyze the effectiveness of advanced approaches in dealing with new tasks from a low-resource perspective.

Architecture of Proposed Framework

Architecture of Proposed Framework

How to use this repository

Directories

  • dataset/: Contains the datasets used in the paper.
  • assets/: Contains the model checkpoints used in the paper.
  • visualization/: Contains the code for the visualizing experiments in the paper.
  • results/: Contains the results of the experiments.

Requirements

  • Python 3.9 or higher
  • PyTorch 1.9.x or higher
  • Transformers 4.3.x or higher

Usage

  1. Clone the repository:
https://github.com/HamedBabaei/author-profiling-pan2023
cd author-profiling-pan2023
  1. Install the required packages:
pip install -r requirements.txt
  1. Run the experiments:
    1. Inference fsl
    bash inference_fsl.sh 
    1. Inference fsl
    bash inference_fsl_biencoder.sh 
    1. Baseline (random)
    bash random_baseline.sh 
    1. Baseline (Zero Shot)
    bash zero_shot_baseline.sh
    1. Train & Test SBERT
    bash train_test_runner_sbert.sh
    1. Train & Test flanT5
    bash train_test_runner_flan_t5.sh 

Citation

If you use this code in your research, please cite the following paper:

@InProceedings{giglou:2023,
  author =                   {Hamed Babaei Giglou, Mostafa Rahgouy, Jennifer D’Souza, Milad Molazadeh Oskuee , Hadi Bayrami Asl Tekanlou and Cheryl D Seals},
  booktitle =                {{CLEF 2023 Labs and Workshops, Notebook Papers}},
  month =                    sep,
  publisher =                {CEUR-WS.org},
  title =                    {{Leveraging Large Language Models with Multiple Loss Learners for Few-Shot Author Profiling}},
  year =                     2023
}

author-profiling-pan2023's People

Contributors

hadibayrami avatar hamedbabaei avatar miladmolazadeh avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

Forkers

hadibayrami

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.