wutianyirosun / cgnet Goto Github PK

View Code? Open in Web Editor NEW

255.0 255.0 51.0 1.99 MB

CGNet: A Light-weight Context Guided Network for Semantic Segmentation [IEEE Transactions on Image Processing 2020]

License: MIT License

Python 100.00%

camvid cityscapes pytorch semantic-segmentation

cgnet's People

Contributors

Stargazers

Watchers

Forkers

jiaobingle xiaoxiaojiayou fengweijie pchank mini-shark winwinjjiang mrlinning phygod mahlermozart nemonameless leeesangwon ms-krajesh jdc08161063 shiyongde lijiunderstand cccvt ceo1207 ddeeppnneett lixiyun98 xiaoketongxue aihekukafeidexiaoafei nnu-gisa 980380446 soulempty karenz17 zihaodong jasonlee020 xiaochengcike lzb863 fishman2008 hellosher louyanyang 1273545169 lxtgh chenqingya yakuzeng liaojiacai px-jpg saqibmamoon shining-love kazusaw1999 srdg cyanymore dh0000000001 sabughazal xrq0312 secrul manojll yangnengqun masoutgh xcfghv

cgnet's Issues

*gtFine_labelTrainIds.png

I can't find where is the "*gtFine_labelTrainIds.png" file in the list file. The dataset I downloaded from CityScapes only contains "*gtFine_labelIds.png". Where can I get the "*gtFine_labelTrainIds.png" files? Thanks.

ContextNet: 66.1% accuracy, 0.8 million params and 41.9 fps

Congratulations for nice work.

Have you also seen BMVC 2018 paper, ContextNet with 0.8 million parameter yield 66.1% accuracy and runs in 41.9 fps in the oldest Titan X gpu?

GPU utilization problrm

Hello. The num_worker is 1 to default, then the GPU utilization is around 25%. When the num_worker is 8, that is around 60%. However, a new problem occurs:
=====> epoch[0/300] iter: (369/371) cur_lr: 0.000997 loss: 1.175 time:0.23
=====> epoch[0/300] iter: (370/371) cur_lr: 0.000997 loss: 0.740 time:0.23
cityscapes_train.py:42: UserWarning: volatile was removed and now has no effect. Use with torch.no_grad(): instead.
input_var = Variable(input, volatile=True).cuda()
[0/500] time: 0.80
[1/500] time: 0.02
[2/500] time: 0.02
[3/500] time: 0.02
[4/500] time: 0.02
[5/500] time: 0.02
[6/500] time: 0.02
[7/500] time: 0.02
[8/500] time: 0.02
[9/500] time: 0.02
[10/500] time: 0.02
[11/500] time: 0.02
[12/500] time: 0.02
[13/500] time: 0.02
[14/500] time: 0.02
[15/500] time: 0.02
[16/500] time: 0.02
[17/500] time: 0.02
[18/500] time: 0.02
[19/500] time: 0.02
[20/500] time: 0.02
[21/500] time: 0.02
[22/500] time: 0.02
[23/500] time: 0.02
[24/500] time: 0.02
[25/500] time: 0.02
[26/500] time: 0.02
[27/500] time: 0.02
[28/500] time: 0.02
[29/500] time: 0.02
[30/500] time: 0.02
[31/500] time: 0.02
[32/500] time: 0.02
[33/500] time: 0.02
[34/500] time: 0.02
[35/500] time: 0.02
[36/500] time: 0.02
[37/500] time: 0.02
[38/500] time: 0.02
[39/500] time: 0.02
[40/500] time: 0.02
[41/500] time: 0.02
[42/500] time: 0.02
[43/500] time: 0.02
[44/500] time: 0.02
[45/500] time: 0.02
[46/500] time: 0.02
[47/500] time: 0.02
[48/500] time: 0.02
[49/500] time: 0.02
[50/500] time: 0.02
[51/500] time: 0.02
[52/500] time: 0.02
[53/500] time: 0.02
[54/500] time: 0.02
[55/500] time: 0.02
[56/500] time: 0.02
[57/500] time: 0.02
[58/500] time: 0.02
[59/500] time: 0.02
[60/500] time: 0.02
[61/500] time: 0.02
[62/500] time: 0.02
[63/500] time: 0.02
[64/500] time: 0.02
[65/500] time: 0.02
[66/500] time: 0.02
[67/500] time: 0.02
[68/500] time: 0.02
[69/500] time: 0.02
[70/500] time: 0.02
[71/500] time: 0.02
[72/500] time: 0.02
[73/500] time: 0.02
[74/500] time: 0.02
[75/500] time: 0.02

I do not know what this means. Could you tell me that,please?
Thank you.

“camvid_trainval_list.txt”——What does' camvid_trainval_list. txt 'contain

how to train dataset of mine

hi:

 how to get the response "cityscapes_inform.pkl" file when trainning with  my dataset.

thanks

FileNotFoundError: [Errno 2] No such file or directory: './dataset/wtfile/camvid_inform.pkl'

hello,could you give some suggestions to this problem,please?

TypeError: conv2d(): argument 'input' (position 1) must be Tensor, not builtin_function_or_method

License

What is the license of this repository?

No effective change in fps even after reducing input image size while training.

Nice work on CGNet results are fantastic for image size 640x480 I am getting 69% mIoU for citysacpe with 10 classes after training on GTX 1080 with fps of 43.

I though if I reduce the input image size for training i should get around atleast 2x fps gain.

However I was wrong. I got more or less same performance i.e 50 fps.

Could you guide how to tune it for speed ? I am ok with small reduction in mIoU. ?

No such file or directory: './dataset/wtfile/camvid_inform.pkl'

How can I get this file?

gtFine/train/cologne/cologne_000000_000019_gtFine_labelTrainIds.png

gtFine/train/cologne/cologne_000000_000019_gtFine_labelTrainIds.png
文本cityscapes_train_list.txt的文件和数据集不对应啊,数据集中图片是对的但是标签没有对应上

CUDA run time error for the python cityscape train_code

Getting error while training on Cityscape Data set
this my training configuration

python cityscapes_train.py --gpus "3,4" --data_dir ~/data/Cityscape_2017/ --dataset cityscapes --train_type ontrainval --train_data_list ~/data/Cityscape_2017/cityscapes_trainval_list.txt --max_epochs 350 --cuda True --scaleIn 1 --batch_size 4

code ran and printed
=====> use gpu id: '3,4'
====> Random Seed: 457
=====> current architeture: CGNet
=====> computing network parameters
the number of params: 0.50 M
the number of parameters: 496306
data['classWeights']: [ 1.4705521 9.505282 10.492059 10.492059 10.492059 10.492059
10.492059 10.492059 10.492059 10.492059 10.492059 10.492059
10.492059 10.492059 10.492059 10.492059 10.492059 10.492059
5.131664 ]
=====> Dataset statistics
mean and std: [72.3924 82.90902 73.158325] [45.319206 46.15292 44.91484 ]
torch.cuda.device_count()= 2
Got the GPU count
length of dataset is : 3475
length of dataset: 500
=====> no checkpoint found at './checkpoint/cityscapes/CGNet_M3N21bs16gpu2_ontrainval/model_1.pth'
=====> beginning training
=====> the number of iterations per epoch: 868
torch.Size([4, 3, 680, 680])
torch.Size([4, 680, 680])
/home/nithish/my_install/miniconda3/envs/CGNet/lib/python3.6/site-packages/torch/nn/functional.py:2351: UserWarning: nn.functional.upsample is deprecated. Use nn.functional.interpolate instead.
warnings.warn("nn.functional.upsample is deprecated. Use nn.functional.interpolate instead.")
torch.Size([4, 19, 680, 680])

/opt/conda/conda-bld/pytorch_1544174967633/work/aten/src/THCUNN/SpatialClassNLLCriterion.cu:99: void cunn_SpatialClassNLLCriterion_updateOutput_kernel(T *, T *, T *, long *, T *, int, int, int, int, int, long) [with T = float, AccumT = float]: block: [10,0,0], thread: [223,0,0] Assertion t >= 0 && t < n_classes failed.
Traceback (most recent call last):
File "cityscapes_train.py", line 291, in
train_model(args)
File "cityscapes_train.py", line 228, in train_model
lossTr, per_class_iu_tr, mIOU_tr, lr = train(args, trainLoader, model, criteria, optimizer, epoch)
File "cityscapes_train.py", line 100, in train
loss.backward()
File "/home/nithish/my_install/miniconda3/envs/CGNet/lib/python3.6/site-packages/torch/tensor.py", line 102, in backward
torch.autograd.backward(self, gradient, retain_graph, create_graph)
File "/home/nithish/my_install/miniconda3/envs/CGNet/lib/python3.6/site-packages/torch/autograd/init.py", line 90, in backward
allow_unreachable=True) # allow_unreachable flag
RuntimeError: cuDNN error: CUDNN_STATUS_INTERNAL_ERROR

about cityscapes_inform.pkl

Thanks for sharing your codes!
But I have a question! I can't find "cityscapes_inform.pkl" in the folder, and also the folder "./dataset/wtfile" is not exist. How can I get this file ?

Unable to download model files from baidu's site

Can anyone help me in downloading the weights from baidu's site ?

Untrack pycache

The __pycache__ directory should never be tracked in and added to git. It is to be ignored from git using a .gitignore file. Please add such a file and untrack all added instances of __pycache__.

[ 0 1 2 4 5 6 7 8 10 11 13 255] Some problem with labels. Please check image file: ./city/gtFine/train/cologne/cologne_000000_000019_gtFine_labelTrainIds.png

Hello, could you please teach me to solve this problem?
Thank you!

Training codes for other models?

Hi i'd like to run performance tests on various GPUs to compare CGNet and BiSeNet. Could you please share the training and inference code for BiSeNet?

Speed Question(Wrong Speed Test)

Hi!! Thanks for sharing your codes.
I have seen your result in your paper about bi-seg. I tried to reproduce the result of bi-seg, however I only got 71%IOU（single scale）, have you successfully got 74% IOU results on that? May be he use ms test.
what is your advantages compared with bi-seg? More Light (Less memory cost)?

dataset

I noted that in the code:
image = image[:, :, ::-1]
image -= self.mean
but,cv2.imread's image format is BGR,and self.mean also is BGR ,why need to convert the format image[:, :, ::-1].

what is the format of ''cityscapes_train_list.txt''?

classweights

I find the code used classweights is defferent from OCNet.
https://github.com/PkuRainBow/OCNet.pytorch/blob/master/utils/loss.py

can i do inference using CPU

Getting error while running cityscapes_evl.py on Cityscape Data set

when i run with "--gpu 0" , it shows "RuntimeError: CUDA error: out of memory", because of my 2GB GPU. and then, i run without "--cuda " or "--gpu" ,it also looks like "RuntimeError: CUDA error: out of memory",so how can i do test or eval using CPU, thx!

Camvid

For Camvid dataset, there are 0-11, 12 classes, isn't it? If the number of label is less, the confusion matrix seems not to be correct.

What's the difference between channel-wise convolution and depth-wise convolution?

Hi，i am confused with the channel-wise convolution operator. Could you give some suggestions about how to distinguish this?
In your source code, i think it is more similar to depth-conv which is used in MobileNets.

class ChannelWiseConv(nn.Module):
    def __init__(self, nIn, nOut, kSize, stride=1):
        """
        Args:
            nIn: number of input channels
            nOut: number of output channels, default (nIn == nOut)
            kSize: kernel size
            stride: optional stride rate for down-sampling
        """
        super().__init__()
        padding = int((kSize - 1)/2)
        self.conv = nn.Conv2d(nIn, nOut, (kSize, kSize), stride=stride, padding=(padding, padding), groups=nIn, bias=False)

And i found this paper, "ChannelNets: Compact and Efficient ConvolutionalNeural Networks via Channel-Wise Convolutions", which give a definition of "Channel-wise convolution"。https://arxiv.org/abs/1809.01330

What kind of openator is used in CGNet indeed?

关于cityscape数据集重新训练效果

我尝试使用您的代码重新训练了cityscape多次，在测试集上的结果只能达到54%，也尝试过使用您所训练好的模型在验证集上测试，如文章中所说可以取得64%miou。我在阅读您所提供的代码中发现以下问题：
1.你所使用的种子为随机种子。
2.你的dataloader中最终提供的数据类型为numpy，并没有将其转换为tensor格式。（你在cityscape_train文件中声明了transform，但是并没有使用）。
您所提供的代码并非您最终的实验代码。在确定CGNet结构的形式之后，我并没有跑出这个结果的原因主要是因为什么？

Adding Link to Pytorch-Deeplab Repo

Thanks for sharing! I found this project use some codes of Pytorch-Deeplab repository. Could you please consider adding a link to the Pytorch-Deeplab Repo probably in the acknowledgment of the readme? I appreciate that. https://github.com/speedinghzl/Pytorch-Deeplab.