nlp-uoregon / okapi Goto Github PK

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

License: Apache License 2.0

Python 93.96% Shell 6.04%

bloom chatbot dataset instruction-tuning language-model large-language-models llama multilingual natural-language-processing nlp question-answering reinforcement-learning reinforcement-learning-from-human-feedback rlhf

okapi's People

Contributors

Stargazers

Watchers

Forkers

wannaphong vhientran

okapi's Issues

Request for Inclusion of Missing English Language Datasets

Issue Description:

I've encountered an issue while downloading the English language data. Specifically, the dataset appears to include only the 52K English instructions, omitting the multilingual-ranking-data-42k and multilingual-rl-tuning-64k datasets in English. Interestingly, these datasets are available in other languages.

I would appreciate guidance on where to access the missing English datasets. Your assistance in resolving this matter is highly valued.

Thank you.

Request for English models

I wanted to check if you trained RLHF models for English as well. If so, could you share those models as well?

Instruction finetuning for Multilingual Tasks

Hi!
Thank you for your awesome work!
I had a few doubts:

I understand you have finetuned on all languages separately: https://huggingface.co/uonlp. I was curious if you had attempted to finetune for all languages simultaneously for improved multi-lingual understanding. And if you had a single model which would work across all the high, medium and low resource languages.
For the base LLM, you seem to have used BLOOM and LLama-7B on which you are applying an Instruction fine-tuning techniques (like Supervised fine-tuning (SFT)) on the 3 datasets - ARC, HellaSwag and MMLU. Did not use Llama-2-chat which is already fine-tuned via SFT for a bigger corpus of (instruction, input, output) pairs.

Please do let me know the same if there is a gap in my understanding! Thanks again

Vikrant Dewangan

Datasets not available

I tried using git lfs pull to fetch the dataset, but it seems to have been deleted. Can you possibly share it on another storage location like huggingface datasets?

nlp-uoregon / okapi Goto Github PK

okapi's People

Contributors

Stargazers

Watchers

Forkers

okapi's Issues

Request for Inclusion of Missing English Language Datasets

Request for English models

Instruction finetuning for Multilingual Tasks

Datasets not available

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent