1. 程式人生 > >5 Benefits of Competitive Machine Learning

5 Benefits of Competitive Machine Learning

Jeremy Howard, formally of Kaggle gave a presentation at the University of San Francisco in mid 2013. In that presentation he touched on some of the broader benefits of machine learning competitions like those held on Kaggle.

In this post you will discover 5 points I extracted from this talk that will motivate you to want to start participating in machine learning competitions

Competitive Machine Learning is a Meritocracy

Competitive Machine Learning is a Meritocracy
Photo by PaulBarber, some rights reserved

Big Data at USF

The talk presented by Howard was titled “Jeremy Howard of Kaggle speaks about Big Data at the University of San Francisco

“. The title is a misnomer. The talk focused on Howard’s background, how he came to machine learning and touching briefly on kaggle.

Howard has a background in start-ups and this talk gives a good summary of that background and the lessons he has to pass on from that journey.

Toward the end of the talk Howard touches on Kaggle and their mission which is what inspired these 5 points. They are:

  1. Meritocracy: Status is baed solely on ability.
  2. Role Models: Best performers and their origin stories become role models.
  3. Push Limits: Leaderboard push the capabilities of you and the group.
  4. Innovation: Competitions result in technological innovation.
  5. Communities: Like minds find each other and share ideas.

1. Meritocracy

Data Science or Machine Learning competitions are a meritocracy. This means that rank is determined solely based on merit.

The analogy given is that of sports where the only thing that matters is the result achieved by the athlete. It does not matter where you come from, your gender or where you went to school. All that matters is what results you can achieve.

Such systems are fair, the biases that exist like those in the workplace do not influence the result. The system is also transparent, everyone has access to the same source material (training data) and the evaluation of performance (leaderboard).

2. Role Models

Competitions create role models.

The results of the competitions show that it is generally not the academics that do well, but those people with an adaptive engineering mindset that use what works to get the best result. People with diverse and interesting backgrounds are ranking in the top 10 or top 100 of all data scientists on the platform.

This has the effect of creating role models. Their stories are different, such as only having encountered machine learning one year earlier in the free Coursera course. These interesting stories draw you in, “if he can do it, I can do it“.

You also see that when a “known” data scientist joins a competition, like a star from the Netflix Prize, then this prompts a lot more attention, “I want to beat the person who did well in the Nextfix Prize“.

3. Push Limits

Like sports, a leaderboard can push the limits of what you and the group are capable of.

Just by knowing that one person knows something that you do, even after you have given it your all, can push you to search for that one piece of additional information.

The real-time feedback of the leaderboard has a psychological effect on the results that can be achieved. This may cut both ways as it did with the four minute mile until Roger Bannister broke it, proving that it could be done.

4. Innovation

Competitions result in technological innovation.

The state-of-art benchmarks are broken every time. This most likely occurs because the problems are well specified for machine learning and because the participants are not limited to the methods used in a given domain or field of study. Anything goes.

This opens up different ways of talking problems which can both be leveraged in the field and leveraged on future similar competitions, accelerating advancements across the board.

5. Communities

Communities spring up around competitions.

There is a balance in sharing information but not sharing too much that you lose ground in the competition. Sharing benefits you and the group and seems to happen automatically around each competition.

Like minds find each other and team up, exploiting the best parts of each others ideas and pushing beyond what they are capable of independently.

Community and information flow is a crucial ingredient in good competitions. They help beginners get started, intermediates advance and innovation occur.

Summary

In this post you have discovered five benefits of competitive machine learning. They were: meritocracy, role models, pushing the limits, innovation and communities.

This is not new in machine learning, competitions have existed in collaboration with academic conferences for nearly 20 years. What is new is the scale of participation and the low barrier of entry. It’s an exciting and opportunistic time to get into applied machine learning, regardless of your background.

相關推薦

5 Benefits of Competitive Machine Learning

Tweet Share Share Google Plus Jeremy Howard, formally of Kaggle gave a presentation at the Unive

Realizing the Benefits of Automated Machine Learning | Become AI

While everyone is talking about machine learning and artificial intelligence (AI), how are organizations actually using this technology to deri

Benefits of Implementing Machine Learning Algorithms From Scratch

Tweet Share Share Google Plus Machine Learning can be difficult to understand when getting start

5 App Ideas to Unleash the Power of Mobile Machine Learning

With over 2 billion active Android devices and over 1 billion active iOS users, the mobile market provides the most engaging and profitable market to build

The Ins And Outs Of Adopting Machine Learning At A Corporate Level

The interest in Machine Learning can be understood by merely understanding that there is a rise in volumes and varieties of raw data, as well as the variou

The Future of Education? | Machine Learning Blog

This post is co-authored by Chun Ming Chin, Technical Program Manager, and Max Kaznady, Senior Data Scientist, of Microsoft, with Luyi Huang, Nicholas

Two elements of winning machine learning companies

While there are still technologies that are in the innovator or early adopter phase of the typical technology adoption life cycle (VR and bitcoin as exampl

Board Of Directors | Machine Learning Automation

Jai is a managing director at Sapphire Ventures who invests in startups he believes are developing ground-breaking products and services to become

How to Kick Ass in Competitive Machine Learning

Tweet Share Share Google Plus David Kofoed Wind posted an article to the Kaggle blog No Free Hun

Hello World of Applied Machine Learning

Tweet Share Share Google Plus It is easy to feel overwhelmed with the large numbers of machine l

How IoT could unleash the real power of the machine learning

What is so interesting about machine learning? Why is machine learning considered the future? Do you think a cognitive system will ever be able to

A Comprehensive survey of machine learning for Internet (2018) via Boutaba,Mohammed et al【sec 5

5 Traffic routing   網路流量路由是網路中的基礎,並且需要選擇用於分組傳輸的路徑。 選擇標準是多種多樣的,主要取決於操作策略和目標,例如成本最小化,鏈路利用率最大化和QoS配置。 流量路由需要具有強能力的ML模型能力,例如能夠應對和擴充套件複雜和動態網路拓撲,學習所選路

Machine Learning is Fun Part 5: Language Translation with Deep Learning and the Magic of Sequences

Making Computers TranslateSo how do we program a computer to translate human language?The simplest approach is to replace every word in a sentence with the

5 Types of Regressions for your Machine Learning Toolbox

However, some seasoned techniques are here to stay. At the top of the list are regression techniques. As long as this number is as high, you will encounter

Top 5 Machine Learning Trends of 2018 Analytics Insight

Machine learning is a modern science which enables computers to work without being explicitly programmed. The modern-day technology deploys algorithms that

What are the Benefits of Machine Learning in the Cloud?

Artificial intelligence and machine learning are steadily making their way into enterprise applications in areas such as customer support, fraud detection,

CS229 Machine Learning學習筆記:Note 5(正則化與模型選擇)

n) 不重復 所有 交叉 war 比例 class 搜索 machine 模型選擇 假設目前有d個學習模型構成的集合\(\mathcal M=\{M_1,\cdots,M_d\}\),訓練集S,下面介紹幾種選取模型的方法 Hold-out cross validation(

機器學習---文本特征提取之詞袋模型(Machine Learning Text Feature Extraction Bag of Words)

from 就是 mat 關聯關系 關系 們的 維度 進行 class 假設有一段文本:"I have a cat, his name is Huzihu. Huzihu is really cute and friendly. We are good friends." 那

Estimating the number of receiving nodes in 802.11 networks via machine learning

當前 網絡通信 works 存儲 bsp ron 測量 分析 輸入 來源:IEEE International Conference on Communications 作者:Matteo Maria 年份:2016 摘要: 現如今很多移動設備都配有多個無線接口,比如藍牙