Learn: A silver bullet for basic machine learning

阿新 • • 發佈：2018-12-28

Let’s start a machine learning project workflow here. The intention of this workflow is not to improve the accuracy or f1 score of the classification problem but to touch on all the necessary modules to complete the classification problem efficiently using scikit-learn. Most of the classification examples start with iris dataset, so let’s pick another dataset within scikit-learn for this workflow. We will primarily work with Wisconsin breast cancer dataset. The objective is to classify diagnosis (cancer diagnosis: true or false) based on the patient’s clinical observation parameters. The dataset contains 569 observations and 30 continuous numeric features. class distribution of 212 — Malignant, 357 — Benign.

Datasets and generators: Unlike unsupervised learning tasks, the supervised tasks (i.e., classification) require labeled datasets, and the package comes with multiple datasets and dataset generators to get started with machine learning

Broadly split into two types

a. Static/toy datasets: datasets are dictionaries with feature data (numpy ndarray), dataset description, feature names, target (numpy array and ndarray for multilabel) and target name (i.e., fetch_20newsgroups contains text input, and grouped into 20 different newsgroups like sport, politics, finance, etc., ). These datasets only have a finite number of observations and target classes or prediction ranges. i.e., The famous iris dataset has only 150 observation and 3 target classes. I have written a function to convert the inbuild dataset which is in dictionary format to a pandas dataframe for visualization and exploration propose

Learn: A silver bullet for basic machine learning

Learn: A silver bullet for basic machine learning

Learn: A silver bullet for basic machine learning | AITopics

斯坦福大學公開課機器學習： advice for applying machine learning - evaluatin a phpothesis（怎麽評估學習算法得到的假設以及如何防止過擬合或欠擬合）

【文獻閱讀】Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

斯坦福大學公開課機器學習： advice for applying machine learning | regularization and bais/variance（機器學習中方差和偏差如何相互影響、以及和算法的正則化之間的相互關系）

斯坦福大學公開課機器學習：advice for applying machine learning | learning curves （改進學習算法：高偏差和高方差與學習曲線的關系）

斯坦福大學公開課機器學習： advice for applying machine learning | deciding what to try next(revisited)（針對高偏差、高方差問題的解決方法以及隱藏層數的選擇）

A Gentle Introduction to Applied Machine Learning as a Search Problem (譯文)

【原】Coursera—Andrew Ng機器學習—課程筆記 Lecture 10—Advice for applying machine learning

Machine Learning-Andrew Ng 課程第六週——Advice for Applying Machine Learning

Learn How to Code and Deploy Machine Learning Models on Spark Structured Streaming

5 Types of Regressions for your Machine Learning Toolbox

Applitools Recognized as a Top Artificial Intelligence and Machine Learning Solution in DevOps

Is automation testing a silver bullet?

A Tour of the Weka Machine Learning Workbench

Python is the Growing Platform for Applied Machine Learning

6 Practical Books for Beginning Machine Learning

Quick and Dirty Data Analysis for your Machine Learning Problem

Coursera Machine Learning 第六週 quiz Advice for Applying Machine Learning

Xcode匯入證書提示Your account already has a signing certificate for this machine but...錯誤的解決方法

Learn: A silver bullet for basic machine learning

相關推薦