Hi, I wonder if rnn.forward_step changes the order of (batch_size*self.k) dimensio

TopKDecoder about pytorch-seq2seq HOT 6 OPEN

Hongzl1996 commented on June 12, 2024

TopKDecoder

from pytorch-seq2seq.

Comments (6)

GZJAS commented on June 12, 2024 1

I studied the codes these days, and I thought you can use the torch.repeat_interleave. Such as follow:
hidden = tuple([torch.repeat_interleave(h, self.k, dim=1) for h in encoder_hidden])
inflated_encoder_outputs = torch.repeat_interleave(encoder_outputs, self.k, dim=0)

from pytorch-seq2seq.

KwanWaiChung commented on June 12, 2024

hi, I am studying the code and have similar doubts. However, can you be clear what you mean by decoder_output? do you actually mean log_softmax_output?

from pytorch-seq2seq.

Hongzl1996 commented on June 12, 2024

@JojoFisherman Yeah, I mean the output probability of decoder, i.e. log_softmax_output.

from pytorch-seq2seq.

KwanWaiChung commented on June 12, 2024

I have the same question. It surprised me that no one has answered this. If theres really something wrong in the beam search, surely it will output some weird sequence. Do you have any conclusion about this?

from pytorch-seq2seq.

Hongzl1996 commented on June 12, 2024

It seems some issues have referred that beam search doesn't work correctly. Unfortunately, maybe this repo is not active maintained now. Currently, I use fairseq (pytorch version) to conduct some related experiments.

from pytorch-seq2seq.

muncok commented on June 12, 2024

I studied the codes these days, and I thought you can use the torch.repeat_interleave. Such as follow:
hidden = tuple([torch.repeat_interleave(h, self.k, dim=1) for h in encoder_hidden])
inflated_encoder_outputs = torch.repeat_interleave(encoder_outputs, self.k, dim=0)

I had the problem with batch_size > 1, but after applying this comment, then it works now.

Thank you!!

from pytorch-seq2seq.

Recommend Projects

TopKDecoder about pytorch-seq2seq HOT 6 OPEN

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent