Comments (5)
I checked the tokenizer, and there're roughly 700 Chinese characters in the vocabulary. So, limited support indeed.
from llama.
7B model
Prompt: 这首歌使我想起了我年轻的时候。
Output:
“牙齿” 很有趣的名字。
The teeth have got you.
The teeth have got you in their jaws.
The teeth have got you in their teeth.
The teeth have got you in their mouths.
The teeth have got you in their claws.
The teeth have got you in their nails.
The teeth have got you in their fingernails.
The teeth have got you in their tongues.
The teeth have got you in their eyes.
The teeth have got you in their ears.
The teeth have got you in their noses.
The teeth have got you in their nostrils.
The teeth have got you in their lips.
The teeth have got you in their mouths.
The teeth have got you in their claws.
The teeth have got you in their fingers.
The teeth have got you in their fingernails.
The teeth have got you in their toes.
The teeth have got you in their soles.
The teeth have got you in their heels.
The teeth have got you in their feet.
The teeth have got you in their kne
==================================
Summary: Doesn't support chinese. Input prompt was "This song reminds me of my youth" and output was "Teeth is an interesting name" followed by english statements about teeth.
from llama.
Also tried out Japanese for the heck of it, doesn't support japanese either.
Prompt:
私の記憶は広告写真みたいになかしく通り過ぎてゆく。
Output:
The Oatmeal | 30 Nov 2016
- Your memory isn’t actually a perfect recording device.
Your brain is great at filling in the gaps of what you don’t remember. You know, like when you remember a dream where you’re on a boat with a bunch of aliens who are going to eat you but then they turn out to be friendly and you’re best friends with them and that’s why you don’t remember the parts where they were going to eat you. But you also have a memory of what you ate for breakfast that morning, and that’s because your brain is filling in the gaps. - We have a hard time remembering bad things that happened to us.
And this is another way your memory fills in the gaps: If something bad happened to you, you might not remember the exact details of it. You might remember that it was bad, but you might not be able to remember what happened, which might lead you to forget it happened altogether. - We tend to remember events in our lives in chronological order, but we don’t remember them that way when we’re recalling them.
When you
from llama.
According to this article, no:
https://aibusiness.com/meta/meta-s-llama-language-model-outperforms-openai-s-gpt-3
And LLaMA was not just built using solely English text. Meta trained its model using 20 languages that use Latin or Cyrillic scripts. However, most of the training data is in English so model performance for it is better.
from llama.
Hey, we've elaborated on this in the FAQ : https://github.com/facebookresearch/llama/blob/main/FAQ.md#2-generations-are-bad
from llama.
Related Issues (20)
- Not getting access to weights HOT 10
- [email protected] HOT 1
- Not able to download models in an Azure ubuntu VM. Getting 403 while downloading the models specifically. HOT 2
- Fire module missing
- params.json: FAILED HOT 2
- AssertionError: Loading a checkpoint for MP=8 but world size is 2 HOT 2
- Llama2 7b quantized generqted either long or truncated reposnes HOT 1
- Why is the value of hidden_dim in FeedForward calculated this way?
- Stuck on Tokenizer download - ERROR 403 : Forbidden HOT 1
- Cannot download llama2 models using download.sh HOT 8
- Not able to receive emails from Meta for downloading the llama 2 model
- Llama version 1 Weights HOT 1
- [ Question]: Is this line a bug? HOT 3
- Llama local download : download.sh: line 19: wget: command not found HOT 4
- TypeError in generate function when running example_chat_completion.py labels: bug HOT 2
- Error : 'str' object has no attribute 'get_secret_value'
- There was an error submitting your email address. HOT 12
- model weights dtype change in Llama.build HOT 1
- Segmentation fault HOT 2
- Having trouble downloading the model HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama.