Comments (7)
@ilovin 训练集精度上去了,测试集精度为0,能帮忙说一下原因吗?而且训练的时候验证集每次用的都是一样的数字。也不是我验证集里面的数据。
from lstm_ctc_ocr.
测试的时候输出是啥
from lstm_ctc_ocr.
4_31,759.jpg 15,2
6_3.jpg 78
7_996.jpg 7899
9_732.jpg 7,9
12_40,986.jpg 180,8
16_38,854.jpg 1,8606
26_0.jpg 740
36_0.jpg 780
37_0.jpg 780
43_367.jpg 7,6
45_7.jpg 74
47_10.jpg 780
49_0.jpg 70
55_14,950.jpg 149,8
56_0.jpg 7,0
60_0.jpg 7,0
65_0.jpg 747
67_0.jpg 746
这是部分输出结果,下划线后面是标签,我训练的是十个数字加一个逗号,共11个字符,图片尺寸(160,60)时一直不收敛(迭代十几万次),调整到(120,45)很快收敛了,信息如下:
iter: 100 / 1000000, total loss: 13.0400743, lr: 0.0001000
speed: 0.102s / iter
iter: 200 / 1000000, total loss: 10.5619059, lr: 0.0001000
speed: 0.117s / iter
iter: 300 / 1000000, total loss: 7.7125740, lr: 0.0001000
speed: 0.119s / iter
iter: 400 / 1000000, total loss: 2.0175159, lr: 0.0001000
speed: 0.111s / iter
iter: 500 / 1000000, total loss: 0.5158803, lr: 0.0001000
speed: 0.118s / iter
iter: 600 / 1000000, total loss: 0.3201181, lr: 0.0001000
speed: 0.115s / iter
iter: 700 / 1000000, total loss: 0.1470777, lr: 0.0001000
speed: 0.113s / iter
iter: 800 / 1000000, total loss: 0.4072588, lr: 0.0001000
speed: 0.115s / iter
iter: 900 / 1000000, total loss: 0.2619937, lr: 0.0001000
speed: 0.117s / iter
seq 0: origin: [8, 1, 6, 11] decoded:[8, 1, 6, 11]
seq 1: origin: [8, 7, 1, 10, 3, 4, 10, 5] decoded:[8, 7, 1, 10, 3, 4, 10, 5]
seq 2: origin: [2, 2, 4, 2, 3] decoded:[2, 2, 4, 2, 3]
seq 3: origin: [4, 11] decoded:[4, 11]
seq 4: origin: [3] decoded:[3]
accuracy: 0.98438
iter: 1000 / 1000000, total loss: 0.0657427, lr: 0.0001000
speed: 0.127s / iter
iter: 1100 / 1000000, total loss: 0.0259138, lr: 0.0001000
speed: 0.119s / iter
iter: 1200 / 1000000, total loss: 0.0610742, lr: 0.0001000
speed: 0.113s / iter
iter: 1300 / 1000000, total loss: 0.0725027, lr: 0.0001000
speed: 0.109s / iter
('loss: ', 0.014755567)
Wrote snapshot to: /home/liu/lstm_ctc_ocr-beta/output/lstm_ctc/lstm_ctc_iter_2.ckpt
seq 0: origin: [8, 1, 6, 11] decoded:[8, 1, 6, 11]
seq 1: origin: [8, 7, 1, 10, 3, 4, 10, 5] decoded:[8, 7, 1, 10, 3, 4, 10, 5]
seq 2: origin: [2, 2, 4, 2, 3] decoded:[2, 2, 4, 2, 3]
seq 3: origin: [4, 11] decoded:[4, 11]
seq 4: origin: [3] decoded:[3]
accuracy: 1.00000
iter: 1400 / 1000000, total loss: 0.1878222, lr: 0.0001000
speed: 0.113s / iter
iter: 1500 / 1000000, total loss: 0.0301566, lr: 0.0001000
speed: 0.117s / iter
('loss: ', 0.0140728075)
Wrote snapshot to: /home/liu/lstm_ctc_ocr-beta/output/lstm_ctc/lstm_ctc_iter_2.ckpt
seq 0: origin: [8, 1, 6, 11] decoded:[8, 1, 6, 11]
seq 1: origin: [8, 7, 1, 10, 3, 4, 10, 5] decoded:[8, 7, 1, 10, 3, 4, 10, 5]
seq 2: origin: [2, 2, 4, 2, 3] decoded:[2, 2, 4, 2, 3]
seq 3: origin: [4, 11] decoded:[4, 11]
seq 4: origin: [3] decoded:[3]
accuracy: 1.00000
iter: 1600 / 1000000, total loss: 0.2451135, lr: 0.0001000
speed: 0.113s / iter
('loss: ', 0.01386846)
但是测试结果就如上所示了,所以麻烦问一下是什么情况,万分感谢。
PS:
seq 0: origin: [8, 1, 6, 11] decoded:[8, 1, 6, 11]
seq 1: origin: [8, 7, 1, 10, 3, 4, 10, 5] decoded:[8, 7, 1, 10, 3, 4, 10, 5]
seq 2: origin: [2, 2, 4, 2, 3] decoded:[2, 2, 4, 2, 3]
seq 3: origin: [4, 11] decoded:[4, 11]
seq 4: origin: [3] decoded:[3]
这些数据不是我val里面的数据标签,不知道读的哪里的。
from lstm_ctc_ocr.
你测试数据和训练数据是同分布吗,都是reptcha生成的?
from lstm_ctc_ocr.
数据不是生成的,是我自己的数据,一批数据随机挑出2000测试集,剩下30000多是训练集和验证集。
from lstm_ctc_ocr.
我觉得有可能是你测试的config没够改 测试时的图像和训练没有同分布
from lstm_ctc_ocr.
测试集数据跟训练集分布是一样的,用另外的代码跑这些数据集都正常。我觉得还是图片尺寸和网络之间的问题,(160,60)迭代十几万次都不收敛,改成(120,45)的迭代几轮就收敛了可能不是正常收敛,我再仔细研究一下,找到错误原因后再来给大家分享吧。有遇到类似情况的朋友希望分享下解决方案。
from lstm_ctc_ocr.
Related Issues (20)
- got AttributeError when training HOT 3
- Loss 小于0......
- Is this a scene text recoginizer? HOT 1
- 一个小问题 HOT 2
- ImportError: No module named utils.data_util HOT 3
- undefined symbol: _ZTIN10tensorflow8OpKernelE
- IOError: unknown file format
- why there comes out negative loss? HOT 2
- i can't make the warpCTC tensorflow_binding environment
- 运行代码时崩溃:段错误(核心已转储)
- ctc
- ctc和tensorflow绑定以后运行出了问题, undefined symbol: _ZNK10tensorflow14TensorShapeRep11DebugStringB5cxx11Ev HOT 2
- why num classes is __C.NCLASSES = len(__C.CHARSET)+2 HOT 2
- 请问训练数据字符个数长度不一致的时候有没有什么好的训练方式? HOT 5
- 字符的长度应该是63个吧 HOT 2
- 关于图像变长字符识别的问题 HOT 3
- BN HOT 1
- test.sh出错
- time_steps.append(nw//cfg.POOL_SCALE+cfg.OFFSET_TIME_STEP) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lstm_ctc_ocr.