I had already trained the model for 8 epochs and then the training stopped as my pc cr

Update function doesn't work correctly. about drqa HOT 2 CLOSED

hitvoice commented on July 20, 2024

Update function doesn't work correctly.

from drqa.

Comments (2)

hitvoice commented on July 20, 2024

Hi MixalisV,

Sorry for the late reply. I was unable to reproduce this. Here's what I've got when training for the first time:

02/14/2018 01:40:01 epoch [ 8] updates[ 21639] train loss[3.57525] remaining[0:00:00]
02/14/2018 01:40:21 dev EM: 66.05487228003784 F1: 75.85399816131314
02/14/2018 01:40:23 [new best model saved.]
02/14/2018 01:40:23 Epoch 9
02/14/2018 01:40:24 epoch [ 9] updates[ 21641] train loss[3.57521] remaining[0:04:54]
02/14/2018 01:40:24 epoch [ 9] updates[ 21644] train loss[3.57518] remaining[0:05:47]
02/14/2018 01:40:24 epoch [ 9] updates[ 21647] train loss[3.57509] remaining[0:05:57]

After resumimg:

02/14/2018 02:14:23 [Data loaded.]
02/14/2018 02:14:23 [loading previous model...]
02/14/2018 02:14:45 [dev EM: 66.05487228003784 F1: 75.85399816131314]
02/14/2018 02:14:45 Epoch 9
02/14/2018 02:14:45 epoch [ 9] updates[ 21641] train loss[2.78311] remaining[0:04:47]
02/14/2018 02:14:46 epoch [ 9] updates[ 21644] train loss[3.14489] remaining[0:05:03]
02/14/2018 02:14:46 epoch [ 9] updates[ 21647] train loss[3.07743] remaining[0:05:27]
02/14/2018 02:14:46 epoch [ 9] updates[ 21650] train loss[2.89589] remaining[0:05:33]

The loss decreases significantly because the loss printed is the average of all losses and the average calculation is starting from an empty history after resumimg.

Please check whether you are using the latest version of this repository (by git log -1) and provide more information about your pytorch version.

from drqa.

hitvoice commented on July 20, 2024

Now the random states are totally recovered and loss display is consistent as well. Check out the latest version! (51a7ce0)

from drqa.

Recommend Projects

Update function doesn't work correctly. about drqa HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent