Code Monkey home page Code Monkey logo

cs231n's Introduction

Stanford cs231n'18

Stanfor cs231n是一门介绍CNN基础概念和知识的课程。这一课程前半部分的重点在于介绍组成CNN的各种模块,包括FC, Conv, Relu, BN等等。特别是在它的assignment1和2中,作者很用心的一步一步、循序渐进的带领初学者动手编程实现每一个模块。这样,在平时的工作中,我们不但可以像搭积木一样,将这些模块一层一层的叠加起来组成我们的CNN,更重要的是,我们可以更深入的了解数据在这些模块中是怎样流动的,以及梯度又是如何在这些模块中backpropagate的。基于此,在这里将学习cs231n课程中的学习心得记录下来,内容包括:

  1. 各个模块backpropagation公式的推导过程,以及在编写程序中遇到的问题解决过程。这一点很重要,因为在以后的工作中,不论是使用TensorFlow还是PyTorch,我们只需要考虑forward过程来搭建CNN,至于梯度如何backpropagate,软件会自动帮我们处理好。所以我们只知道CNN会backpropagation,但很少有机会知道它是怎样backpropagation的。而这门课程恰恰可以让我们了解每一个CNN模块backpropagation的公式推导及编程实现,这对今后处理梯度消失/爆炸的问题会很有帮助。
  2. 学习作者调试CNN模块的思路和流程,包括初始值的检验,gradie check,minibatch overfitting等等。如1所述,即使我们知道了CNN的梯度是如何计算的,CNN搭建起来是如何验证的,但将它编程实现又是另外一个问题,这里面有很多细节值得我们注意。基于此,这里将按照作业的流程将其中出现的所有函数自己动手写一遍,深入了解一下作者搭建调试CNN网络的过程。

Lecture 1

cs231n's People

Contributors

fortileizhang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.