Comments (1)
Thanks for the request! If you're referring to the logits generated by the causal LM, there is currently no way to access them through the Generator
interface. We'll look into exposing this in a future release.
from curated-transformers.
Related Issues (20)
- scipy through bitsandbytes HOT 1
- Add Low-Rank Adapters injection into base models HOT 3
- Loading a model saved locally HOT 2
- Support for loading from an AbstractFileSystem HOT 5
- Optimal Qlora settings HOT 1
- Unable to load fine-tuned causalLM MPT model HOT 2
- Pretrain from scratch and fine tune HOT 1
- Support for Encoder-Decoder-style architectures HOT 2
- Add support for Mistral
- Option to only return the last hidden layer output from models
- Convert QKV projection splitting methods into Torch modules
- Make `QkvMode` ADT-like
- Expose more outputs through the `Generator` interface
- Register models/tokenizers through `catalogue`, so that the auto classes pick them up HOT 2
- Add a an extras/contrib package
- Support DeBERTa v2/3 HOT 1
- Add support for attention sinks
- Move the old Falcon architecuture to the extras/addons pacakage
- Add suggested PyTorch LLM optimizations
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from curated-transformers.