When transforming the model into BetterTransformer model I'm seeing accuracy drop on t

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Accuracy change with BetterTransformer about optimum HOT 6 OPEN

kapilsingh93 commented on September 28, 2024

Accuracy change with BetterTransformer

from optimum.

Comments (6)

kapilsingh93 commented on September 28, 2024 1

@fxmarty Thanks for the prompt response. I can see that torch 2.0.1 helps in reducing the variance.
However I still see decent variance in results when comparing the models in fp16 mode ( by using model.half() ) .

from optimum.

kapilsingh93 commented on September 28, 2024 1

@fxmarty
Adding model.half() to the above code

from transformers import AutoModelForSequenceClassification , AutoTokenizer
from optimum.bettertransformer import BetterTransformer

original_model.half()
transformed_model.half()

tokenizer=AutoTokenizer.from_pretrained("BAAI/bge-reranker-large")

original_model = AutoModelForSequenceClassification.from_pretrained("BAAI/bge-reranker-large").to('cuda:0')
transformed_model = BetterTransformer.transform(original_model, keep_original_model=True).to('cuda:0')

sentences_batch=[['do you like fox cookies', 'fox big brown fox']] 
inputs = tokenizer(sentences_batch,padding=True,truncation=True,return_tensors="pt",max_length=512,).to('cuda:0')

better_transformer_scores = transformed_model(**inputs, return_dict=True).logits.view(-1).float()
print(f"BetterTransfomer output: {better_transformer_scores.detach().cpu().numpy().tolist()}")

vanilla_model_scores = original_model(**inputs, return_dict=True).logits.view(-1).float()
print(f"Vanilla model output :{vanilla_model_scores.detach().cpu().numpy().tolist()}")

produces output

BetterTransfomer output: [-7.35546875]
Vanilla model output :[-7.3515625]

I've also observed a higher degree of variability when trying out input of more than 1 batch size .
For instance, with

sentences_batch=[
    ['do you like fox cookies', 'fox big brown fox']
                 ,['do you like fox cookies', 'fox big big brown fox']
                ,['do you like fox cookies', 'fox small tiny brown fox'],
                 ['n the middl just loading a mookies', 'fox small tiny brown fox  happen in the middl just loading a monthly rollup table']
                ,['do you like fox cookies', 'fox big hello world from the Since most of these loads happen in the middl just loading a monthly rollup table when the regular table load happens. I chose a replace into option, brown fox']
]

I see output as below, where the relative difference for 4th item is high ( around 0.23% )

BetterTransfomer output: [-7.35546875, -7.51171875, -8.21875, -1.27734375, -9.4765625]
Vanilla model output :[-7.35546875, -7.515625, -8.2265625, -1.2802734375, -9.4765625]

from optimum.

fxmarty commented on September 28, 2024

Hi @kapilsingh93, thank you, I can reproduce (only on CUDA device though), this is not expected, sorry for the issue. Let me fix shortly.

from optimum.

fxmarty commented on September 28, 2024

@kapilsingh93 Interestingly downgrading to torch 2.0.1 fixes the issue... It may be a torch regression. I hit the issue even with torch.backends.cuda.sdp_kernel(enable_flash=False, enable_math=True, enable_mem_efficient=False), only on CUDA device

from optimum.

fxmarty commented on September 28, 2024

@kapilsingh93 It would help to debug if you can confirm whether using torch 2.0.1 helps bringing back equal performance.

from optimum.

fxmarty commented on September 28, 2024

@kapilsingh93 Can you share a reproduction in fp16?

from optimum.

Accuracy change with BetterTransformer about optimum HOT 6 OPEN

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent