dsgiitr / d2l-pytorch Goto Github PK

View Code? Open in Web Editor NEW

4.2K 165.0 1.2K 157.52 MB

This project reproduces the book Dive Into Deep Learning (https://d2l.ai/), adapting the code from MXNet into PyTorch.

License: Apache License 2.0

Jupyter Notebook 99.54% Python 0.46% Shell 0.01%

deep-learning d2l data-science pytorch-implmention book nlp computer-vision pytorch mxnet dive-into-deep-learning

d2l-pytorch's People

Contributors

Stargazers

Watchers

Forkers

captaindredge rsinghal757 mrayush18 ujjawalsharma15 subham103 aksub99 vipul2001 kanishk27dh farazlfc rishabh499 ishan-kumar2 sagupta8399 saswatpp dhruv220445 ajitpant shubhamgit1 gupta1912 leparalamapara hsouporto oderdene shankar0206 jrdeco560 anhduc2203 piyushmishra908 amir22010 prmohanty pgsrv dontcryme sungreong prantikhowlader cryax aocsa abhiupes01 stenpiren eanunez vigneshkalai shubham-kothari carlodavid012 tao-bai jizhihang allensmile nishant3815 karndeb salemameen satyakam001 shubham23471 zuxfoucault marcelomata shaunstanislauslau uchihasr santhosh-ks nguyenducnhaty kazimosiurahman shubhampachori12110095 tchigher jjongjjong harshalmittal4 earlbabson hajungong007 mrochan ankushr785 joydeep75 abhi9716 autohe rafmacalaba hieuqtran linhduongtuan mangeshingle tulasiram58827 thorpham rohitkeshari rajnehra unitingcoders aiexperts laol777 adrianxsalazar idgmatrix laksh9950 enginbozkurt nikhil-chigali hiyoung-asr forks13 manik-hossain mshong0320 smithvinayakiya ashishpatel26 nlgrf minsukchang hdocmsu songya akasan yunmango tylee33 iywo karanamlokesh jding0 batermj sprinterzzj jeonggwanlee manoj9april

d2l-pytorch's Issues

Why use np.asscalar nor item() to get scalar value?

in CH_4_Data_manipulation, we use

We can transform the result into a scalar in Python using the asscalar function of numpy. In the following example, the $\ell_2$ norm of x yields a single element tensor. The final result is transformed into a scalar.
In[20]: np.asscalar(x.norm())
Out[20]: 22.494443893432617

Why don't just use x.norm().item() to get the scalar, which seems much easier? Are their any differences?

Wrong label at y axis

Hi Team,

In Ch06_Multilayer_Perceptrons/Multilayer_Perceptron.ipynb, I found that the y axis label for the gradient of relu() is mistakenly written as grad of sigmoid.

Can I find the answer of the practising problems in somewhere?

Optimization Algorithms (Ch. 12)

I would cover the sections 12.3, 12.6 and 12.10 in the upcoming PR if nobody is working on it.

Missing `Concise Implementation of Recurrent Neural Networks`

Chapter 10.6 Concise Implementation of Recurrent Neural Networks was missing.

Will you write NLP part of d2l in the future?

Hope to see this part, and thanks for your open source code.

code clarification in Ch09_Modern_Convolutional_Networks Residual_Networks_(ResNet)

This is the original code

def resnet_block(input_channels, num_channels, num_residuals, first_block=False):
  blk = []
  for i in range(num_residuals):
    if i == 0 and not first_block:
      blk.append(Residual(input_channels, num_channels, use_1x1conv=True, strides=2))
    else:
      blk.append(Residual(num_channels, num_channels))
  return blk

I tried running the code in colab but there was a channel error. so I added the last line of code

in_channels = out_channels

And the new code looks like this .and it runs ok.

def resnet_block(in_channels,out_channels,num_residuals,first_block = False):
    blk = []
    for i in range(num_residuals):
        if i == 0 or not first_block:
            blk.append(Residual(in_channels,out_channels,use_1x1conv= True,strides = 2))
        else:
            blk.append(Residual(out_channels,out_channels))
        in_channels = out_channels
    return blk

A new idea for implementation during summers.

We can implement functions like tensor rotation, scaling, translation, general and channel-wise etc,
which are currently not possible with direct use of pytorch library as there are no such direct functions implemented, one idea was to convert tensor to PIL image and apply transformations and then revert back to tensor, the idea seems legit but there seems to be a bug in the library that doesn't return the correct result after conversion back to tensor, so such custom functions can be very helpful.

Saving figure with bounding boxes

Hi, I was following your tutorial on https://d2l.ai/chapter_computer-vision/bounding-box.html
How can I save figure with bounding boxes on it?

[possible typo] AdaptiveAvgPool2d or AdaptiveMaxPool2d in NiN?

in Ch09_Modern_Convolutional_Networks/Network_in_Network(NiN).ipynb defines the NiN net model with:

    #Global Average Pooling can be achieved by AdaptiveMaxPool2d with output size = (1,1)
    self.avg1 = nn.AdaptiveMaxPool2d((1,1))

Is it a typo of nn.AdaptiveAvgPool2d or some deeper reasons behind this? Cause using Avg will train a lot slower and get bad train/val accuracy like 0.460 (epoch 5)

Questions on the Focal Loss computation in the SSD Chapter

Two questions

alpha_t is unused below.
should we use the following to derive the BCECrossEntropyLoss:

pt = torch.log(p) * target.float() + torch.log(1.0 - p) * (1 - target).float()

class FocalLoss(nn.Module):
    def __init__(self, alpha=0.25, gamma=2, device="cuda:0", eps=1e-10):
        super().__init__()
        self.alpha = alpha
        self.gamma = gamma
        self.device = device
        self.eps = eps

    def forward(self, input, target):
        p = torch.sigmoid(input)
        pt = p * target.float() + (1.0 - p) * (1 - target).float()
        alpha_t = (1.0 - self.alpha) * target.float() + self.alpha * (1 - target).float()
        loss = - 1.0 * torch.pow((1 - pt), self.gamma) * torch.log(pt + self.eps)
        return loss.sum()

ready for social attention?

Hey, really great work on porting the book's code over to PyTorch.

It's one of the nicest books I read (atleast the few chapters I did) and Zack & team are great writers.

We want to get this port some attention via PyTorch Twitter, and want to make sure this book is ready in a state that you like -- and that we aren't surprising you by pre-announcing.

Let us know

14.1 Image Augmentation

I would like to contribute to this topic with notebook using Pytorch. Kindly assign this to me.

not implement “Image Classification (CIFAR-10) on Kaggle” ？

There's a bug in Ch10's “Implementation_of_Recurrent_Neural_Networks_from_Scratch”。

there is a runtime error when running the 7th code block. But I checked, the device type doesn't conflict each other. What's Wrong?

RuntimeError Traceback (most recent call last)
in
1 state = init_rnn_state(X.shape[0], num_hiddens, ctx)
2 inputs = to_onehot(X.to(ctx), len(vocab))
----> 3 params = get_params()
4 outputs, state_new = rnn(inputs, state, params)
5 len(outputs), outputs[0].shape, state_new[0].shape

in get_params()
9
10 # Hidden layer parameters
---> 11 W_xh = _one((num_inputs, num_hiddens))
12 W_hh = _one((num_hiddens, num_hiddens))
13 b_h = torch.zeros(num_hiddens, device=ctx)

in _one(shape)
6 def get_params():
7 def one(shape):
----> 8 return torch.Tensor(size=shape, device=ctx).normal(std=0.01)
9
10 # Hidden layer parameters

RuntimeError: legacy constructor for device type: cpu was passed device type: cuda, but device type must be: cpu

TypeError in Chp14_Computer_Vision Single_Shot_Multibox_Detection.ipynb

In Chp14_Computer_Vision Single_Shot_Multibox_Detection.ipynb in the implementation of Define Loss and Evaluation Functions there is a missing .long() on line 61. A TypeError was raised: "Expected Long but got Float"

I solved it by changing class_true_i[0, j] to class_true_i[0, j].long() because class_target is a tensor with dtype long as shown on line 54 class_target = torch.zeros(class_hat_i.shape[0]).long().to(self.device)

download_and_preprocess_data() does not work

Hi authors,

it seems that download_and_preprocess_data() method does not work?

can you check?

Thanks

To exactly replicate the book, function d2l.train_ch3() function would be a necessary addition. This function is to be called during training from the implementations in other chapters as well. This would be more convenient than writing the training loop each and every time.
Infact, all training loops required in the book can be added to train.py similar to https://github.com/d2l-ai/d2l-en/blob/8e53fce19c6c744cf4994a896c43e382567e6fbc/d2l/train.py#L1

invalid syntax

d2l-pytorch/d2l/base.py

Line 20 in 7090309

device = torch.device(cuda:i)

Replace with : 'cuda:'+str(i)