I didn't see any figure when running the code below. Is there something that I mis

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

There is no result or figure output when running model_view of BART. about bertviz HOT 3 CLOSED

Junpliu commented on August 24, 2024

There is no result or figure output when running model_view of BART.

from bertviz.

Comments (3)

Junpliu commented on August 24, 2024

utterances = ' '.join(["Jane: Hello",
"Vegano Resto: Hello, how may I help you today?",
"Jane: I would like to make a reservation.",
"Jane: For 6 people, tonight around 20:00",
"Vegano Resto: Let me just check.",
"Vegano Resto: Ah, I'm afraid that there is no room at 20:00.",
"Vegano Resto: However, I could offer you a table for six at 18:30 or at 21:00",
"Vegano Resto: Would either of those times suit you?",
"Jane: Oh dear.",
"Jane: Let me just ask my friends.",
"Vegano Resto: No problem.",
"Jane: 21:00 will be ok.",
"Vegano Resto: Perfect. So tonight at 21:00 for six people under your name.",
"Jane: great, thank you!"])

from bertviz.

Junpliu commented on August 24, 2024

I ran the code and the program just crashed. However, the attention weight can be shown as expected.

from bertviz.

jessevig commented on August 24, 2024

Hi @Junpliu, the visualization may fail for longer inputs as you are using in this example. See: https://github.com/jessevig/bertviz#%EF%B8%8F-limitations In a future version I will add a warning message in these cases.

I was able to get this to work with a shorter input as a test, does it work for you?:

from transformers import AutoTokenizer, AutoModel, utils
from bertviz import model_view

utils.logging.set_verbosity_error()  # Remove line to see warnings

# Initialize tokenizer and model. Be sure to set output_attentions=True.
# Load BART fine-tuned for summarization on CNN/Daily Mail dataset
model_name = "facebook/bart-large-cnn"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModel.from_pretrained(model_name, output_attentions=True)

# get encoded input vectors
utterances = "test"
encoder_input_ids = tokenizer(utterances, return_tensors="pt", add_special_tokens=True).input_ids

# create ids of encoded input vectors
decoder_input_ids = tokenizer("Jane made a 9 PM reservation for 6 people tonight at Vegano Resto .", return_tensors="pt", add_special_tokens=True).input_ids

outputs = model(input_ids=encoder_input_ids, decoder_input_ids=decoder_input_ids)

encoder_text = tokenizer.convert_ids_to_tokens(encoder_input_ids[0])
decoder_text = tokenizer.convert_ids_to_tokens(decoder_input_ids[0])

model_view(
    encoder_attention=outputs.encoder_attentions,
    decoder_attention=outputs.decoder_attentions,
    cross_attention=outputs.cross_attentions,
    encoder_tokens= encoder_text,
    decoder_tokens=decoder_text
)

from bertviz.

There is no result or figure output when running model_view of BART. about bertviz HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent