neural network and deep learning筆記（1）

阿新 • • 發佈：2018-03-24

.cn arc AD puts ont release 深入 rem hang

neural network and deep learning 這本書看了陸陸續續看了好幾遍了，但每次都會有不一樣的收獲。

DL領域的paper日新月異。每天都會有非常多新的idea出來，我想。深入閱讀經典書籍和paper，一定能夠從中發現remian open的問題。從而有不一樣的視角。

PS：blog主要摘取書中重要內容簡述。

摘要部分

Neural networks, a beautiful biologically-inspired programming paradigm which enables a computer to learn from observational data.

Deep learning, a powerful set of techniques for learning in neural networks.

CHAPTER 1 Using neural nets to recognize handwritten digits
the neural network uses the examples to automatically infer rules for recognizing handwritten digits.

#

The exact form of active function isn’t so important - what really matters is the shape of the function when plotted.

#

4.The architecture of neural networks

The design of the input and output layers of a neural network is often straightforward, there can be quite an art to the design of the hidden layers. But researchers have developed many design heuristics for the hidden layers, which help people get the behaviour they want out of their nets.
Learning with gradient descent
1. The aim of our training algorithm will be to minimize the cost C as a function of the weights and biases. We’ll do that using an algorithm known as gradient descent.
2. Why introduce the quadratic cost? It’s a smooth function of the weights and biases in the network and it turns out to be easy to figure out how to make small changes in the weights and biases so as to get an improvement in the cost.
3. MSE cost function isn’t the only cost function used in neural network.
6. Mini batch: SGD randomly picking out a small number m of randomly chosen training inputs;epoch : randomly choose mini-batch and training until we’ve exhausted the training inputs.
Thinking about hyper-parameter choosing
”If we were coming to this problem for the first time then there wouldn’t be much in the output to guide us on what to do. We might worry not only about the learning rate, but about every other aspect of our neural network. We might wonder if we’ve initialized the weights and biases in a way that makes it hard for the network to learn? Or maybe we don’t have enough training data to get meaningful learning? Perhaps we haven’t run for enough epochs? Or maybe it’s impossible for a neural network with this architecture to learn to recognize handwritten digits?
Maybe the learning rate is too low? Or, maybe, the learning rate is too high?
When you’re coming to a problem for the first time, you’re not always sure.
The lesson to take away from this is that debugging a neural network is not trivial, and, just as for ordinary programming, there is an art to it. You need to learn that art of debugging in order to get good results from neural networks. More generally, we need to develop heuristics for choosing good hyper-parameters and a good architecture.”
Inspiration from Face detection:
“The end result is a network which breaks down a very complicated question - does this image show a face or not - into very simple questions answerable at the level of single pixels. It does this through a series of many layers, with early layers answering very simple and specific questions about the input image, and later layers building up a hierarchy of ever more complex and abstract concepts. Networks with this kind of many-layer structure - two or more hidden layers - are called deep neural networks.”

CHAPTER 2 How the backpropagation algorithm works

Backpropagation（BP）： a fast algorithm for computing the gradient of the cost function.
For backpropagation to work we need to make two main assumptions about the form of the cost function.
1. Since what BP let us do is compute the partial derivatives for a single training example,so we need that the cost function can be written as an average over all individual example.
2. It can be written as a function of the outputs from the neural network.Since y is not something which the neural network learns.
The four fundamental equations behind backpropagation
What’s clever about BP is that it enables us to simultaneously compute all the partial derivatives using just one forward pass through the network, followed by one backward pass through the network.
What indeed the BP do and how someone could ever have discovered BP?
1. A small perturbations will cause a change in the activation,then next and so on all the way through to causing a change in the final layer,and then the cost function.
  
  A clever way of keeping track of small perturbations to the weights (and biases) as they propagate through the network, reach the output, and then affect the cost.
2. （未完待續……）

neural network and deep learning筆記（1）

.cn arc AD puts ont release 深入 rem hang neural network and deep learning 這本書看了陸陸續續看

Neural Networks and Deep Learning 整理（三）

公式太麻煩，沒寫公式。交叉熵函式作為代價函式用求導推理說明了這樣比二次代價函式（方差的形式）要更好一些，即導數和（y-a）成正比。一開始期望值和輸出的差別越大，下降的速度

Neural Networks and Deep Learning 整理（二）

反向傳播（backpropagation）權重矩陣偏置向量帶權輸入z

Note——Neural Network and Deep Learning （1）[神經網路與深度學習學習筆記（1）]

一、初學神經網路的體會正如書中作者說的神經網路可以被稱作最美的程式設計正規化之一，神經網路將我們需要解決的複雜問題，比如手寫字型分類，簡化成一個個簡單的步驟，而本人無需瞭解內部的具體結構引數變化等。關於神經網路已經有很多實用的庫，使用這些庫可以很快的解決問題。但是不滿

Neural Networks and Deep Learning 筆記

目錄 1 Introduction to Deep Learning 1.1 結構化資料/非結構化資料 1.2 為什麼深度學習會興起 2 Neural Networks Basics 2.1 二分類問題 2.2 邏輯迴歸 2.3 損失函式

《neural network and deep learning》題解——ch03 過度擬合&規範化&權重初始化

問題一正如上面討論的那樣，一種擴充套件 MNIST 訓練資料的方式是用一些小的旋轉。如果我們允許過大的旋轉，則會出現什麼狀況呢？如果我們允許過大的旋轉，會使得模型不能很好的學習到數字的特

《neural network and deep learning》題解——ch01 神經網路

1.2 S 型神經元問題 1 假設我們把一個感知器網路中的所有權重和偏置乘以一個正的常數,c > 0。證明網路的行為並沒有改變。證： σ(cw,cb)=11+e−∑jcwjxj−cb=11+e−cz 當c>0時，

《neural network and deep learning》題解——ch03 再看手寫識別問題題解與原始碼分析

交叉熵代價函式 class QuadraticCost(object): @staticmethod def fn(a, y): return 0.5 * np.linalg.norm(a - y) ** 2 @s

Deep Learning 系列（1）：RBM（受限波爾茲曼機）和 DBN（深信度神經網路）

前言：Deep Learning （DL深度學習）是近幾年來最火的一種機器學習方法，由Hinton（多倫多大學）提出。主要有兩分支：Geoffery Hinton和Joshua Bengio這一支用RBM組成deep architecture的研究。另一支是以Yann

課程一(Neural Networks and Deep Learning)，第二週（Basics of Neural Network programming）—— 1、10個測驗題（Neural N

--------------------------------------------------中文翻譯-------

Neural Networks and Deep Learning學習筆記ch1 - 神經網絡

1.4 true ole 輸出使用 .org ptr easy isp 近期開始看一些深度學習的資料。想學習一下深度學習的基礎知識。找到了一個比較好的tutorial，Neural Networks and Deep Learning，認真看完了之後覺

【DeepLearning學習筆記】Coursera課程《Neural Networks and Deep Learning》——Week1 Introduction to deep learning課堂筆記

決定如同樣本理解你是水平包含 rod spa Coursera課程《Neural Networks and Deep Learning》 deeplearning.ai Week1 Introduction to deep learning What is a

【DeepLearning學習筆記】Coursera課程《Neural Networks and Deep Learning》——Week2 Neural Networks Basics課堂筆記

樣本數目 and 編程多次之間優化我們 round 符號 Coursera課程《Neural Networks and Deep Learning》 deeplearning.ai Week2 Neural Networks Basics 2.1 Logistic

課程一(Neural Networks and Deep Learning)，第一週（Introduction to Deep Learning）—— 0、學習目標

1. Understand the major trends driving the rise of deep learning. 2. Be able to explain how deep learning is applied to supervised learning. 3. Unde

課程一(Neural Networks and Deep Learning)，第一週（Introduction to Deep Learning）—— 2、10個測驗題

1、What does the analogy “AI is the new electricity” refer to? (B) A. Through the “smart grid”, AI is delivering a new wave of electricity.

sp1.1-1.2 Neural Networks and Deep Learning

Relu這影象也叫線性流動函式不再用sigmoid函式當啟用函式相當於max(0,x)函式比較0和當前值哪個大可以把隱藏層看作前面整合

sp1.3-1.4 Neural Networks and Deep Learning

交叉熵定義了兩個概率分佈之間的距離，因為是概率分佈所以又引入softmax變為概率的形式相加還是1 3 shallow neural network 神經網路輸入層不算上

python Deep learning 學習筆記（1）

Python深度學習筆記 -- 偏重實驗 Python 的 Keras 庫來學習手寫數字分類，將手寫數字的灰度影象(28 畫素 ×28 畫素)劃分到 10 個類別中(0~9) 神經網路的核心元件是層(layer),它是一種資料處理模組，它從輸入資料中提取表示，緊接著的一個例子中，將含有兩個Dense 層,它

(Stanford CS224d) Deep Learning and NLP課程筆記（三）：GloVe與模型的評估

本節課繼續講授word2vec模型的演算法細節，並介紹了一種新的基於共現矩陣的詞向量模型——GloVe模型。最後，本節課重點介紹了word2vec模型評估的兩種方式。 Skip-gram模型上節課，我們介紹了一個十分簡單的word2vec模型。模型的目標是預測word \(o\)出現在另一個word \(c

(Stanford CS224d) Deep Learning and NLP課程筆記（一）：Deep NLP

Stanford大學在2015年開設了一門Deep Learning for Natural Language Processing的課程，廣受好評。並在2016年春季再次開課。我將開始這門課程的學習，並做好每節課的課程筆記放在部落格上。爭取做到每週一更吧。本文是第一篇。 NLP簡介 NLP，全名Natu

neural network and deep learning筆記（1）

#

#

相關推薦