Code Monkey home page Code Monkey logo

2024_siop_machine_learning_competition's Introduction

SIOP 2024 Machine Learning Competition

The repository holds competition data, winning solutions / code, and presentations.

Visit the competition portal to learn how the competition process works and view the leaderboard.

Competition Overview

This year, we decided to focus on large language models (LLMs). For more information on LLMs, please visit our LLM primer.

We chose to focus on LLMs because they have demonstrated impressive abilities in NLP (and NLU/NLG). By training on massive text datasets, LLMs can generate human-like text and excel at diverse linguistic tasks. However, thoughtfully harnessing the potential of LLMs for the field of I-O Psychology requires rigorous design and evaluation. This year's competition focused on developing best practices for applying LLMs to I-O tasks. Competitors were required to develop LLM workflows through techniques like prompt engineering, few-shot learning, and fine-tuning using standardized datasets relevant to I-O Psychology. The goal is to benchmark techniques that unlock LLMs' potential as aids for I-O Psychologists through careful design and experimentation. Our goal was to organize a competition that reveals the current abilities of LLMs to assist with workflows in I-O Psychology using public benchmark datasets. Participants report reproducible prompts, results, and analyses to advance best practices for thoughtfully eliciting the strengths of LLMs for professional applications.

Benchmark Datasets

  • Predicting Empathy: Job candidates were asked to provide empathetic responses to a difficult workplace situation. Your task is to classify whether empathy was demonstrated or not in each simulated response.
  • Generating Interview Responses: Job candidates responded to 5 common interview questions. You will be given the text of 4 question and response pairs. Your task is to generate a likely text response for the 5th question based on the previous responses.
  • Rating Item Clarity: Respondents rated the clarity of personality test items using a 7-point scale from 1 = extremely unclear to 7 = extremely clear. Your task is to predict the average clarity rating for each item based on the responses.
  • Identifying Fairness Perceptions: Respondents compared two organizational policies and voted on which was fairest. Your task is to identify which policy received the majority vote as the fairer option.

Winning Solutions

Please visit the competition slide deck for an overview of this year's competition and winners.

1st place: PAID Team

  • Zihao Jia
  • Mina Son
  • Philseok Lee

Final score = .666

PAID Team's solution

2nd place: Akben&AAron&Elon

  • Mustafa Akben
  • Aaron Satko

Final score = .652

Akben&AAron&Elon's solution

3rd place: Hungry Llama

  • Jennifer Gibson
  • Shane Halder
  • Blake Hoffman
  • Hannah Johnson
  • Joseph Nicolas Luchman
  • Nick McCann
  • Selena Tran

Final score = .643

Hungry Llama's solution

4th place: Wonderlic ML

  • Guglielmo Menchetti (Wonderlic)
  • Lea Cleary (Wonderlic)
  • Annie Brinza (Wonderlic)

Final score = .630

Wonderlic ML's solution

2024_siop_machine_learning_competition's People

Contributors

sebastianmarinc avatar izk8 avatar

Stargazers

Kendall Ruber avatar  avatar Karim Badr avatar Ashleigh Wilson avatar Ian Lee avatar  avatar Demetrius K. Green avatar Gian Zlupko  avatar Brian Costello avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.