Where does the error come from

阿新 • • 發佈：2018-12-24

//李巨集毅視訊官網：http://speech.ee.ntu.edu.tw/~tlkagk/courses.html 點選此處返回總目錄

//邱錫鵬《神經網路與深度學習》官網：https://nndl.github.io

我們上一次講到，使用不同的model，在testing data上會得到不同的error。而且越複雜的model不一定會得到越低的error。

今天我們要討論的問題是，error來自什麼地方。

其實error有兩個來源，一個是"bias"，一個是“variance”。瞭解error的來源是重要的，因為你常常做一下machine learning，做完就得到一個error，接下來你要怎麼improve你的model呢。如果沒有什麼方向，毫無頭緒的亂做，你就沒有效率。如果你可以診斷你的error的來源，你就可以挑選適當的方法來improve你的model。

-------------------------------------------------------------------------------------------------------------------------------

上一節的時候，我們要預測寶可夢進化後的CP值，也就說要找一個function，這個function input一隻寶可夢，output就是進化後的CP值。這個function理論上有一個最佳的function，我們寫成f^。但是這個理論上最佳的function我們是不知道的，只有Niantic是知道的，Niantic就是做寶可夢的公司。f^是我們不知道的，我們能做的事情就是，實際去抓一些寶可夢，根據training data，去學到的最好的function，f*。f*並不會真的等於f^,因為並不知道f^是什麼樣子，f*可能不等於f^。f*就好像是f^的估測值一樣。

就想成，是在打靶。f^是靶的中心，收集到一些data，做training以後，你找到一個你覺得最好的function f*，這個f*不等於f^，它是在靶紙上的另外一個位置。這個f*與f^中間有一段距離，這個距離呢，來自於兩件事：它可能來自於bias，也可能來自於variance。

-------------------------------------------------------------------------------------------------------------------------------

bias和variance是什麼呢？我們先舉一個概率裡面的例子，概率與統計學過。

假設有一個變數x,想要估計它的mean，怎麼做呢？假設x的mean是，variance是。

要估測怎麼做呢？首先sample N個點，再把這N個點算平均值，得到m。

N個點算平均值m會跟一樣麼？其實不會。

假設紅點為的value，現在做一次sample，算出來的m可能不會跟一樣。再做一次實驗m2,不一樣。m3,m4,m5,m6可能都不一樣，可能沒有辦法算出來的m exactly等於。

但是，如果今天把m的期望值算出來的話：

得到的值就是。每一個m雖然都不一定跟exactly一樣，但是如果找很多m，他們的期望值呢會正好等於。所以用m來estimate ，是unbiased。就好像是說，在打靶的時候，他的準心呢是瞄準的，但是由於種種，比如機械故障，或者受到風俗干擾等等，你會散落在你本來瞄準的位置的周圍。

那散步在周圍會散的多開呢？取決於m的variance。

variance的值呢depends on samples的個數。如果N比較多的話，就會比較集中。如果N比較少，就會分散地比較開。

要估測variance，即，怎麼辦呢。首先計算m,然後計算。

可以拿來估測。這個估測地怎麼樣呢？每次都不等於，散佈在的周圍。但是這個估計是有偏的：

即，求期望並等於。而是N-1/N的倍數，所以普遍而言，是比要小的。小的次數比較多。如果increase N的話，估測的差距就會變小。

李巨集毅機器學習筆記——02.Where does the error come from ?

傳送門：在上節課講到，如果選擇不同的function set就是選擇不同的model 在testing data上會得到不同的error，而且越複雜的model不見得會給你越低的error，我們要討論的問題就是error來自什麼地方？ error有兩個來源，偏

Where does the error come from

//李巨集毅視訊官網：http://speech.ee.ntu.edu.tw/~tlkagk/courses.html

李巨集毅機器學習（2017full）-Lecture 2: Where does the error come from?

Where does the error come from? ML Lecture 2 Error的來源：Bias，Varience f^f^是計算pokemon真正的函式，只有Niantic公司知道從訓練集上，我們得出的一個估計f∗f∗ 故像射擊

李巨集毅老師機器學習課程筆記_ML Lecture 2: Where does the error come from?

####引言：最近開始學習“機器學習”，早就聽說祖國寶島的李巨集毅老師的大名，一直沒有時間看他的系列課程。今天聽了一課，感覺非常棒，通俗易懂，而又能夠抓住重點，中間還能加上一些很有趣的例子加深學生的印象。視訊連結（bilibili）：[李巨集毅機器學習(2017)](https://www.bilibil

Where do Data Scientists Come From?

Here we see some interesting patterns: data scientists, machine learning engineers, and software engineers are more likely to start straight out of academi

Error: The specified query does not exist\nResponse from attempted peer comms was an error

出現這個錯誤是因為在hyperledger composer playground 上面你的查詢檔名可以是query.qry，但是在真正部署到網路上時候，是會把模型檔案，邏輯檔案，訪問控制檔案，以及查詢檔案都整合到.bna的一個二進位制檔案當中，所以這個查詢檔案的名稱固定為queries

Where did the least-square come from?

Where did the least-square come from?What would you say in a machine learning interview, if asked about the mathematical basis of the least-square loss fun

Where does sand come from?

Sand is, indeed, just a bunch of tiny rocks. It is also one phase of the endlessly churning rock cycle that has been shaping the surface of our earth for t

docker: Error response from daemon: Conflict. The container name "/mysql" is already in use by conta

docker: Error response from daemon: Conflict. The container name “/mysql” is already in use by container “27e9834dce87b6cac674945d7

啟動docker容器提示"docker: Error response from daemon: Container command not found or does not exist"的原因

docker容器匯入匯出有兩種方法：一種是使用save和load命令使用例子如下： docker save ubuntu:load>/root/ubuntu.tar docker load<ubuntu.tar 一種是使用export和import命令使用

Eclipse:Some sites could not be found. See the error log for more detail.解決的方法

span pda more .net sof 分析 clas csdn war 今天遇到了一個奇葩的問題。我把我的sdk tools的版本號升級到23後。我在eclipse中嘗試升級ADT，發現了這麽一個問題，以下分析下原因：當我在eclipse中選擇Help--&g

排序與檢索【UVa10474】Where is the Marble?

素數指數 ive test posit muc not ria str Where is the

uva 10474 Where is the Marble?（簡單題）

content mil stdlib.h std lib [0 數據 main pre 我非常奇怪為什麽要把它歸類到回溯上，明明就是簡單排序，查找就OK了。wa了兩次，我還非常不解的懷疑了為什麽會 wa，原來是我居然把要找的數字也排序了，當時僅僅是想著能快一點查找。所以

POJ - 2387 Til the Cows Come Home

unique uic ring star eterm and string app lang Bessie is out in the field and wants to get back to the barn to get as much sleep as possi

POJ 2387 Til the Cows Come Home

tail from nal rail pst cows clas c代碼 == 題目連接： http://poj.org/problem?id=2387 Description Bessie is out in the field and wants to get back

docker端口映射或啟動容器時報錯Error response from daemon: driver failed programming external connectivity on endpoint quirky_allen

prot 服務 sina des ram pla sys from localhost 現象： [[email protected] ~]# docker run -d -p 9000:80 centos:httpd /bin/sh -c /usr/local/

docker報Error response from daemon: client is newer than server (client API version: 1.24, server API version: 1.19)

amd64 client export mit als 大堆 server rim rime docker version Client: Version: 17.05.0-ce API version: 1.24 (downgraded from 1.29)

Docker Where are the Docker daemon logs?

docker log Troubleshoot the daemonYou can enable debugging on the daemon to learn about the runtime activity of the daemon and to aid in troubleshootin

Why does the memory usage increase when I redeploy a web application?

man weakref solution read cannot erro try cto tag That is because your web application has a memory leak. A common issue are "PermGen"

Where have the cheapest adidas Yeezy Boost 350 V2

enc alt als while ply lease sel red -h For fans of the adidas Yeezy 350 for sale, this Holiday season we have a large lineup set to take