Frequently Asked Questions
I call learning the math and theory for machine learning first the “bottom-up” approach to machine learning.
It is the approach taught by universities and used in textbooks.
It requires that you learn the mathematical prerequsites, then the general theories of the field, then the equations and their derivations for each algorithm.
- It is much slower.
- It is much harder.
- It is great for training academics (not practitioners).
A final problem is, that is where the bottom-up approach ends.
I teach an alternative approach that inverts the process called “top-down” machine learning.
We start by learning the process of how to work through predictive modeling problems end to end, from defining the problem to making predictions. Then we practice this process and get good at it. We start by learning how to deliver results and add value.
Later we circle back to the math and theory, but only in the context of the process. Meaning, only the theory and math that helps us deliver better results faster is considered.
You can learn more about the contrast between these two approaches here:
You can learn how to get started with this approach here:
But it’s Dangerous!
I have seen this criticism a lot.
It is dangerous for beginners to use algorithms they don’t understand to make predictions that the business depends upon.
I agree.
- I agree for the same reason that I think a student learning to drive should not drive the school bus.
- I agree for the same reason that I think a student learning to code should not put their hello world code into production.
But,
- The student driver can practice and get good enough to drive the school bus eventually.
- The student coder can practice and get good enough to put code into production.
Trust is earned in machine learning, just like with any other profession or skill.
Does knowing how the math of an algorithm works give you that trust?
Maybe, but probably not.
- Does knowing how a combustion engine works give you trust enough to drive?
- Does knowing how a compiler works give you trust enough to push code to production?
I write more about this here:
But Math is Required!
It is, just not first.
Learning how algorithms work and about machine learning theory can make you a better machine learning practitioner.
But, it can come later, and it can come progressively.
You can iteratively dip into textbooks and papers, as needed, with a specific focus of learning a specific thing that will make you better, faster or more productive.
Knowing how an algorithm works is important, but it cannot tell you much about when to use it.
In supervised machine learning, we are using data to build a model to approximate an unknown and noisy mapping function. If we knew enough about this function in order to correctly choose the right algorithm, we probably don’t need machine learning (e.g. we could use statistics and descriptive modeling of already understood relationships).
The badly kept secret in machine learning is that you can use machine learning algorithms like black boxes, at least initially, because the hard part is actually figuring out how to best frame the problem, prepare the data and figure out which of one thousand methods might perform well.
You can learn more about this here:
The math does not have to come first. It can, if you prefer to learn that way, but perhaps this site is not the best place for you to start.
相關推薦
[翻譯] TensorFlow Programmer's Guide之Frequently Asked Questions(問得頻率最多的幾個問題)
_for file 語言 ons docs locking 內存數據 code mage 目錄: 特點和兼容性(Features and Compatibility) 建立一個TensorFlow圖(Building a TensorFlow graph) 運行一個T
Frequently Asked Questions About CC
Frequently Asked Questions Wed Aug 29 23:24:20 BST 2018 About CC (注;點選問題,答案自動彈) What is Creative Commons and what do you do? Is Creative Com
Relinking Oracle Home FAQ ( Frequently Asked Questions) (Doc ID 1467060.1)
In this Document Purpose Questions and Answers 1) What
Frequently Asked Questions about Machine Learning
A: Our machine learning process is based on academic research and has been rigorously tested. Working with AMS as an external third party, we can handle th
Frequently asked Questions Answers for Java Developers
If you are a Java developer, working in Spring framework and thinking to become a certified Spring professional but couldn't to do it in the past due to e
Frequently Asked Questions
I call learning the math and theory for machine learning first the “bottom-up” approach to machine learning. It is the approach taught by universities and
AWS Direct Connect Frequently Asked Questions
Q. What is Direct Connect Gateway? Direct Connect Gateway is a grouping of Virtual Private Gateways (VGWs) and Private Virtual Interface
Amazon RDS Frequently Asked Questions (FAQs)
Q: How do I control if and when the engine version of my DB instance is upgraded to new supported versions? Amazon RDS strives to keep y
Amazon ECS Frequently Asked Questions
Q: Does Amazon ECS support applications and services? Yes. The Amazon ECS Service scheduler can manage long-running applications
Go at I/O: Frequently Asked Questions
27 May 2010 Among the high-profile product launches at Google I/O last week, our small team gave presentatio
Amazon SES Frequently Asked Questions
Q: Can Amazon SES send emails with attachments? Amazon SES supports many popular content formats, including documents, images, aud
AWS Snowmobile Frequently Asked Questions and Answers
Q: Who should use a Snowmobile? Snowmobile enables customers to quickly migrate exabyte-scale datasets from on-premises to AWS in
面試時最經常被問到的問題(Frenquently asked interview questions)(II)
面試時最經常被問到的問題(Frenquently asked interview questions)之Analytical, puzzles, and brain-teasers篇 Analytical, puzzles, and brain-teasers Questio
The 30 Most Important Interview Questions TO ASK(shared from Glassdoor)
basic start door sea end plan rate about deep Exploring the Role Get beyond the basic job description and ask questions that probe deeper
(狀壓dp)UVA - 1252 Twenty Questions
ace code return uva typedef clu http 狀壓 .net 題目地址 讀入二進制數及轉換的方法。 e.g. bitset<16> x; cin>>x; cout<<x.to_ulong()<&l
Coursera Algorithms week1 Interview Questions: 3Sum in quadratic time
import 排好序 .get 部分 ase prop 計算 pan 出了 題目要求: Design an algorithm for the 3-SUM problem that takes time proportional to n2 in the worst cas
Coursera Algorithms week2 基礎排序 Interview Questions: 1 Intersection of two sets
number style arr div void length contain 簡單 oca 題目原文: Given two arrays a[] and b[], each containing n distinct 2D points in the plane, de
Coursera Algorithms week2 棧和隊列 Interview Questions: Queue with two stacks
item queue 實現 隊列 empty implement asn boolean out 題目原文: Implement a queue with two stacks so that each queue operations takes a constant a
Questions
bsp 需要 模板 區別 .... 它的 思路 mvp 規範 1.第3章中 ” 軟件開發流程不光指團隊流程,還包括個人開發流程,因為軟件團隊是由個人組成的.....因此,個人在團隊中也有獨立的流程” —— IC在軟件團隊中的意義,需要遵守哪些流程,若不遵守會給團隊帶來怎樣
https://stackoverflow.com/questions/16130292/java-lang-outofmemoryerror-permgen-space-java-reflection
ges flat con chang ng- nat lang quest using https://stackoverflow.com/questions/16130292/java-lang-outofmemoryerror-permgen-space-java-re