I am trying to the code. But I face problem when I execute below lin

Maybe, I think, this line is responsible for the issue: <a href="htt

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Again Device issue about docformer HOT 3 CLOSED

shabie commented on July 20, 2024

Again Device issue

from docformer.

Comments (3)

uakarsh commented on July 20, 2024

Maybe, I think, this line is responsible for the issue:

self.scale = torch.sqrt(torch.FloatTensor([embed_dim]))

And, which the below lines are having problem of .to(device), especially when the device is cuda. Would shortly modify it and let you know, if the problem still persists. If possible, can you do try to change the above line from:
self.scale = torch.sqrt(torch.FloatTensor([embed_dim])) to self.scale = embed_dim**0.5, and remove all the .to(device) parts in the below set of lines and let us know?

I would try from my end as well and would let you know. Thanks for pointing this issue out.

Maybe, this couldn't be the case, and there could be something else, but I would let you know soon. And, can you do let us know, the whole part of the code, since the above mentioned line of code, won't help me recreate the bug

from docformer.

uakarsh commented on July 20, 2024

Can you do let me know, if the issue has been resolved or not? If not resolved, can you help me with reproducing the error on Google Colab, since then I can definitely try to solve th bug, and update the same in this repo

Regards,
Akarsh

from docformer.

uakarsh commented on July 20, 2024

Hi @kmr2017, sorry for the late reply, but I faced this issue just now, and I think I have managed to solve it. You can just clone: https://github.com/uakarsh/docformer, and this would do the thing (as far as I know). And, do let me know if that solves the issue or not. I would shortly include the update in the main branch as well.

Regards,

from docformer.

Recommend Projects

Again Device issue about docformer HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent