Code Monkey home page Code Monkey logo

nlp-uoregon / okapi Goto Github PK

View Code? Open in Web Editor NEW
82.0 82.0 2.0 268.44 MB

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

License: Apache License 2.0

Python 93.96% Shell 6.04%
bloom chatbot dataset instruction-tuning language-model large-language-models llama multilingual natural-language-processing nlp question-answering reinforcement-learning reinforcement-learning-from-human-feedback rlhf

okapi's People

Contributors

anoperson avatar chiennv2000 avatar nlp-uoregon avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

okapi's Issues

Request for Inclusion of Missing English Language Datasets

Issue Description:

I've encountered an issue while downloading the English language data. Specifically, the dataset appears to include only the 52K English instructions, omitting the multilingual-ranking-data-42k and multilingual-rl-tuning-64k datasets in English. Interestingly, these datasets are available in other languages.

I would appreciate guidance on where to access the missing English datasets. Your assistance in resolving this matter is highly valued.

Thank you.

Request for English models

I wanted to check if you trained RLHF models for English as well. If so, could you share those models as well?

Instruction finetuning for Multilingual Tasks

Hi!
Thank you for your awesome work!
I had a few doubts:

  • I understand you have finetuned on all languages separately: https://huggingface.co/uonlp. I was curious if you had attempted to finetune for all languages simultaneously for improved multi-lingual understanding. And if you had a single model which would work across all the high, medium and low resource languages.
  • For the base LLM, you seem to have used BLOOM and LLama-7B on which you are applying an Instruction fine-tuning techniques (like Supervised fine-tuning (SFT)) on the 3 datasets - ARC, HellaSwag and MMLU. Did not use Llama-2-chat which is already fine-tuned via SFT for a bigger corpus of (instruction, input, output) pairs.

Please do let me know the same if there is a gap in my understanding! Thanks again

Vikrant Dewangan

Datasets not available

I tried using git lfs pull to fetch the dataset, but it seems to have been deleted. Can you possibly share it on another storage location like huggingface datasets?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.