anz_llm_bootcamp's Issues
Llama-2 models need authentication
Can you please add the capacity to pass in the current user's Hugging Face token to the use_auth_token
parameter during the initialization of tokenizer, model_config, and model in the 0.1_Huggins_Face_basics, Utils, etc. notebooks? It is required to download the model from the Meta website via Hugging Face.
Refactor and extend the advanced chunking tutorial notebook
Add more context info to prompting notebook and improve examples
Difficulty setting up `max_length` in 1_Single_QnA_example notebook
When I load a Llama-2 model and try to get a value back from the query test:
# Test Query 1
query = "What is this document about?"
result = qa.run(query)
print(result)
The response reads:
Input length of input_ids is 2764, but `max_length` is set to 20. This can lead to unexpected behavior. You should consider increasing `max_new_tokens`.
This
And that's it ~ apart from the warning about max_length being set to 20, I get the single word "This".
I have examined the pipe.model.generation_config
and it shows max_length
as 4096 (below), and I cannot figure out what the message is referring to, but it doesn't seem to be this max_length
!
GenerationConfig {
"_from_model_config": true,
"bos_token_id": 1,
"eos_token_id": 2,
"max_length": 4096,
"pad_token_id": 0,
"temperature": 0.9,
"top_p": 0.6,
"transformers_version": "4.29.2"
}
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.