bighuang624 / andrew-ng-deep-learning-notes Goto Github PK

View Code? Open in Web Editor NEW

994.0 41.0 320.0 135.44 MB

吴恩达《深度学习》系列课程笔记及代码 | Notes in Chinese for Andrew Ng Deep Learning Course

Home Page: https://kyonhuang.top/Andrew-Ng-Deep-Learning-notes/

Jupyter Notebook 98.37% Python 1.63%

deep-learning neural-network andrew-ng-course andrew-ng coursera

andrew-ng-deep-learning-notes's Introduction

吴恩达《深度学习》系列课程笔记

笔记

Github Markdown 对 LaTeX 的支持较差，推荐在线阅览笔记。

本笔记 html 版本由 docsify 动态生成文档网站。由于笔记中图片链接来自 Github，有时不太稳定，推荐科学上网后阅览笔记。

代码

本仓库暂时只包含本人已完成的各课练习代码，请在 assignments 文件夹下查找，没有就是尚未完成。你可以通过在 github 上搜索 deeplearning.ai 找到很多含有未完成的练习代码的库。

另推荐一份已完成的各课练习代码。

课程地址

deeplearning.ai - 主页 - 网易云课堂

本项目地址

项目地址

须知

本项目只是本人在学习过程中所作的笔记，不能保证包含课程中所有内容，也不能替代实际课程学习。
本人也尚在学习中，水平有限。如果确认内容中存在谬误，欢迎提出 issue 或者 pull request。
如果您觉得有帮助，不妨点一个 star 以资鼓励。

参考资料

在整理笔记的过程中，我也参考了很多相同课程的笔记，感谢他们的帮助。在这里将其中一些比较连贯的系列笔记推荐给大家。

LICENSE

本项目所有文章采用 CC BY-NC-SA 3.0 许可协议，转载请注明出处。

andrew-ng-deep-learning-notes's People

Contributors

Stargazers

Watchers

Forkers

neilkizz jfkkf123 noinil juntingz techeye220 chaosfu gaoxuesong little1tow xiayuanxiang wolfworld6 chuanzhidong zhaozhengcoder jinglearnai ltcxjtu lishengfever abbcccyuan chenying99 yifeiyang210 baudzhou xiangruipuzhao cccc0der luricheng chenjun19950315 huangjip jason8kang bacterous xjturobocon xiaopingzeng chsl noticeable stchau4work mehrdad-shokri tzheng zengpeng21 yupeng08 wang80890 996169450 1987618girl wizardzhh zhangangus patrick0507joyce turpure lfcten dugzzuli yuanhaitao williamwhe qingqingsun vivianangela hekeh promissingfan moolighty willsonn asdncg xianglizuel lucky-starh happyyolanda lsy563193 qq839006841 bossrobin zhoupeng110 shanewfx dongdongbai pangxiejijin aenjon niuyechangge wzheng21 workhard-cumtb sericwong zjucsxxd luxiaolingfei yufengjinsong poseidon1214 emrekemal jzbcoding agrutkarsh gaotojunjie haoweilin hedgeliu cgz0703 lht1949 zdfccdanfeng qinzhenyi1314 wenwenwenwenshei bionllo brandacy yikerainbow cl2227619761 sywangs zouqian2468 hikkikuma gaoss2522 huakaibanmu airopti wangz2012 jimliu0327 yangmiemie whs1111 zuovision tcchriszhao wurentidai

andrew-ng-deep-learning-notes's Issues

神经网络和深度学习-浅层神经网络 - 计算神经网络的输出，关于W的shape的问题

原文链接
Z1 = W1.T * a0 + b1
因为
input layer = 3
hidden layer = 4
output layer = 1
所以，a0的shape是(3,1), 那么W1.T的shape推得即为(4,3)，所以W1的shape是(3,4)，同理W2的shape也应该反过来才是

补充:看到后面的[神经网络的梯度下降法]中forward propagation中Z1 = W1 * X + b，这里没有转置，建议前后统一，我担心有对向量化不熟的童鞋，这里会晕掉

感谢作者，但数学公式都无法显示

无论在github中还是html在线阅读，数学公式都无法正常显示，十分影响学习和阅读！不知能够改进？

assignment1-2 2_2

做预测时模型参数应该是训练得到的参数
predict(w,b,X) 改为
predict(params['w'],params['b'],X)

我下载你在github上这个任务里在做第一课的第三次作业的时候遇到一些问题，不知道怎么处理，能帮我看一下么
问题1
以下截图1是运行到前面可视化数据的时候出现的问题

这个问题我再网易课程的论坛中也找到了解决方法，将其中的代码c=Y修改成c=Y[0,]既可以看到那个花型图案，但是我在csdn以及github的网站上看到运行代码这里都不需要修改还是能够运行出花型图案，包括下载到您的代码我看到运行结果在没有修改代码的时候也运行出结果了，所以我想问下这里是我的环境有什么问题么？

问题2

运行这个cell会有下面红色的一行字体，您的运行结果也是有的，但是看到其他一些人的结果并没有这些红色字体，所以这里是为什么呢？不影响我们后续的运行么

问题3

这个运行又是为什么，我在网上搜到所有人都能正常的运行出结果，但是我这里却总是报这个错误，不知道为什么出错？？？

希望能帮忙看一下，初次介入deeplearning领域，也是刚刚学着使用github，周围没有可以参考的人，冒昧的打扰您，希望能帮忙处理一下

在线浏览网站内容出错?

所有公式都Math Processing Error

博客图片好像失效了？

学习笔记图片

Is there any pdf file of this notebook?

week 2 编程作业

我用的是jupyter notebook.在第二周的编程作业中，我在加载数据的时候会报出打开不了.h5这个文件，如下：
#Loading the data (cat/non-cat)
train_set_x_orig, train_set_y, test_set_x_orig, test_set_y, classes = load_dataset()

OSError Traceback (most recent call last)
in ()
1 # Loading the data (cat/non-cat)
----> 2 train_set_x_orig, train_set_y, test_set_x_orig, test_set_y, classes = load_dataset()

~\lr_utils.py in load_dataset()
4
5 def load_dataset():
----> 6 train_dataset = h5py.File('train_catvnoncat.h5', "r")
7 train_set_x_orig = np.array(train_dataset["train_set_x"][:]) # your train set features
8 train_set_y_orig = np.array(train_dataset["train_set_y"][:]) # your train set labels

d:\asoftware\python-3.5.4\benti\lib\site-packages\h5py_hl\files.py in init(self, name, mode, driver, libver, userblock_size, swmr, **kwds)
267 with phil:
268 fapl = make_fapl(driver, libver, **kwds)
--> 269 fid = make_fid(name, mode, userblock_size, fapl, swmr=swmr)
270
271 if swmr_support:

d:\asoftware\python-3.5.4\benti\lib\site-packages\h5py_hl\files.py in make_fid(name, mode, userblock_size, fapl, fcpl, swmr)
97 if swmr and swmr_support:
98 flags |= h5f.ACC_SWMR_READ
---> 99 fid = h5f.open(name, flags, fapl=fapl)
100 elif mode == 'r+':
101 fid = h5f.open(name, h5f.ACC_RDWR, fapl=fapl)

h5py_objects.pyx in h5py._objects.with_phil.wrapper()

h5py\h5f.pyx in h5py.h5f.open()

OSError: Unable to open file (unable to open file: name = 'datasets/train_catvnoncat.h5', errno = 2, error message = 'No such file or directory', flags = 0, o_flags = 0)

国内访问问题

请问，能否在gitee上部署，或者使用dnspod给域名解析。国内访问好慢。。。。

验证集，训练集，测试集分布

这里我记得应该是训练集和测试集建议来自同一个分部。"What you are targeting on should match your goal."

Xavier initialization处有误

WL = np.random.randn(WL.shape[0], WL.shape[1]) * np.sqrt(1/n)
这样，激活函数的输入 x 近似设置成均值为 0，标准方差为 1，

这里np.random.randn是均值0，方差1的分布，乘上sqrt(1/n)，根据方差公式推导，方差=1/n，对应的标准差=sqrt(1/n)

RMSProp 算法的一点建议

RMSProp 算法写的和老师讲的有一点出入

dw那组方程变小 db组方程变大

才能实现纵向变小横向变快

比写同时变大会更好一点吧

mistakes in assignments/assignment2-1/3.Gradient+Checking.ipynb

What you wrote is

There is a mistake in the backward propagation! difference = 0.17344196893145933
和预期有差别。暂不知道原因

Your codes:

dA2 = np.dot(W3.T, dZ3)
    dZ2 = np.multiply(dA2, np.int64(A2 > 0))
    dW2 = 1./m * np.dot(dZ2, A1.T) * 2 #It was wrong
    db2 = 1./m * np.sum(dZ2, axis=1, keepdims = True)
    
    dA1 = np.dot(W2.T, dZ2)
    dZ1 = np.multiply(dA1, np.int64(A1 > 0))
    dW1 = 1./m * np.dot(dZ1, X.T)
    db1 = 1./m * np.sum(dZ1, axis=1, keepdims = True)

This line dW2 = 1./m * np.dot(dZ2, A1.T) * 2 was wrong.
In this exercise,this piece of code was originally like this(maybe you have chenged some codes,but you did not konw)

dA2 = np.dot(W3.T, dZ3)
    dZ2 = np.multiply(dA2, np.int64(A2 > 0))
    dW2 = 1./m * np.dot(dZ2, A1.T) * 2 #wrong
    db2 = 1./m * np.sum(dZ2, axis=1, keepdims = True)
    
    dA1 = np.dot(W2.T, dZ2)
    dZ1 = np.multiply(dA1, np.int64(A1 > 0))
    dW1 = 1./m * np.dot(dZ1, X.T)
    db1 = 4./m * np.sum(dZ1, axis=1, keepdims = True) #wrong

There were errors in the backward_propagation_n code.
The code should be modified to:

dA2 = np.dot(W3.T, dZ3)
    dZ2 = np.multiply(dA2, np.int64(A2 > 0))
    dW2 = 1./m * np.dot(dZ2, A1.T) 
    db2 = 1./m * np.sum(dZ2, axis=1, keepdims = True)
    
    dA1 = np.dot(W2.T, dZ2)
    dZ1 = np.multiply(dA1, np.int64(A1 > 0))
    dW1 = 1./m * np.dot(dZ1, X.T)
    db1 = 1./m * np.sum(dZ1, axis=1, keepdims = True)