Comments (3)
CUDA Version: 12.0
from pet.
升级torch和torchvision版本:
torch 1.10.1
torchvision 0.11.2
运行显示:
Evaluating: 0%| | 0/15000 [00:00<?, ?it/s]
Traceback (most recent call last):
File "cli.py", line 284, in
main()
File "cli.py", line 265, in main
no_distillation=args.no_distillation, seed=args.seed)
File "/home/123456/projects/prompt/pet-master/pet/modeling.py", line 249, in train_pet
save_unlabeled_logits=not no_distillation, seed=seed)
File "/home/123456/projects/prompt/pet-master/pet/modeling.py", line 355, in train_pet_ensemble
unlabeled_data=unlabeled_data))
File "/home/123456/projects/prompt/pet-master/pet/modeling.py", line 434, in train_single_model
results_dict['train_set_before_training'] = evaluate(model, train_data, eval_config)['scores']['acc']
File "/home/123456/projects/prompt/pet-master/pet/modeling.py", line 490, in evaluate
n_gpu=config.n_gpu, decoding_strategy=config.decoding_strategy, priming=config.priming)
File "/home/123456/projects/prompt/pet-master/pet/wrapper.py", line 376, in eval
logits = EVALUATION_STEP_FUNCTIONSself.config.wrapper_type(batch)
File "/home/123456/projects/prompt/pet-master/pet/wrapper.py", line 524, in mlm_eval_step
outputs = self.model(**inputs)
File "/home/123456/.conda/envs/python36/lib/python3.6/site-packages/torch/nn/modules/module.py", line 1102, in _call_impl
return forward_call(*input, **kwargs)
File "/home/123456/.conda/envs/python36/lib/python3.6/site-packages/transformers/modeling_albert.py", line 814, in forward
output_hidden_states=output_hidden_states,
File "/home/123456/.conda/envs/python36/lib/python3.6/site-packages/torch/nn/modules/module.py", line 1102, in _call_impl
return forward_call(*input, **kwargs)
File "/home/123456/.conda/envs/python36/lib/python3.6/site-packages/transformers/modeling_albert.py", line 548, in forward
token_type_ids = torch.zeros(input_shape, dtype=torch.long, device=device)
RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
from pet.
解决方法:
conda install pytorch torchvision cudatoolkit=11.3 -c pytorch
from pet.
Related Issues (20)
- Training Time Issue HOT 4
- GenPET commands with GPT-2 HOT 2
- RuntimeError on eval method HOT 2
- Token indices sequence length is longer than the specified maximum sequence length for this model HOT 2
- PET and iPET parameters
- Random seed parameter for iterations
- Data format for few-shot text classification
- Roberta-large using BPE tokenizer generates multi tokens. HOT 6
- API Usage
- How much test data do you use in your experiments?
- How to training PET model uses xlm-roberta with byte-level Byte-Pair-Encoding?
- How to reproduce results of the paper? HOT 1
- Clarification on how to interpret PET's results HOT 2
- A question about “Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification”
- OSError: Model name 'clue/albert_chinese_tiny' was not found in tokenizers model name list HOT 2
- TypeError: expected str, bytes or os.PathLike object, not NoneType HOT 1
- ZeroDivisionError when reduction is set to 'wmean' while training iPET HOT 1
- Stuck at 'There are 0 Examples for Label [my_label]' During iPETs Example Selection for Next Generation
- There is no softmax
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pet.