推薦系統入門必讀論文

阿新 • • 發佈：2018-12-31

《Item-Based Collaborative Filtering Recommendation Algorithms 》

《 Factorization Meets the Neighborhood: a Multifaceted Collaborative Filtering Model 》

因式分解滿足鄰域:多層面協同過濾模型

https://blog.csdn.net/fangqingan_java/article/details/50762296

《Matrix factorization techniques for recommender systems》

《Factorization Machines with libFM》

帶libFM的因數分解機器

從item-base到svd再到rbm，多種Collaborative Filtering(協同過濾演算法)從原理到實現

https://blog.csdn.net/Dark_Scope/article/details/17228643

import numpy as np
import matplotlib.pyplot as plt
import math
number_of_bandits=10
number_of_arms=10
number_of_pulls=10000
epsilon=0.3
min_temp = 0.1
decay_rate=0.999

def pick_arm(q_values,counts,strategy,success,failure):
	global epsilon
	if strategy=="random":
		return np.random.randint(0,len(q_values))

	if strategy=="greedy":
		best_arms_value = np.max(q_values)
		best_arms = np.argwhere(q_values==best_arms_value).flatten()
		return best_arms[np.random.randint(0,len(best_arms))]

	if strategy=="egreedy" or strategy=="egreedy_decay": 
		if  strategy=="egreedy_decay": 
			epsilon=max(epsilon*decay_rate,min_temp)
		if np.random.random() > epsilon:
			best_arms_value = np.max(q_values)
			best_arms = np.argwhere(q_values==best_arms_value).flatten()
			return best_arms[np.random.randint(0,len(best_arms))]
		else:
			return np.random.randint(0,len(q_values))

	if strategy=="ucb":
		total_counts = np.sum(counts)
		q_values_ucb = q_values + np.sqrt(np.reciprocal(counts+0.001)*2*math.log(total_counts+1.0))
		best_arms_value = np.max(q_values_ucb)
		best_arms = np.argwhere(q_values_ucb==best_arms_value).flatten()
		return best_arms[np.random.randint(0,len(best_arms))]

	if strategy=="thompson":
		sample_means = np.zeros(len(counts))
		for i in range(len(counts)):
			sample_means[i]=np.random.beta(success[i]+1,failure[i]+1)
		return np.argmax(sample_means)


fig = plt.figure()
ax = fig.add_subplot(111)
for st in ["greedy","random","egreedy","egreedy_decay","ucb","thompson"]:

	best_arm_counts = np.zeros((number_of_bandits,number_of_pulls))

	for i in range(number_of_bandits):
		arm_means = np.random.rand(number_of_arms)
		best_arm = np.argmax(arm_means)

		q_values = np.zeros(number_of_arms)
		counts = np.zeros(number_of_arms)
		success=np.zeros(number_of_arms)
		failure=np.zeros(number_of_arms)

		for j in range(number_of_pulls):
			a = pick_arm(q_values,counts,st,success,failure)

			reward = np.random.binomial(1,arm_means[a])
			counts[a]+=1.0
			q_values[a]+= (reward-q_values[a])/counts[a]

			success[a]+=reward
			failure[a]+=(1-reward)
			best_arm_counts[i][j] = counts[best_arm]*100.0/(j+1)
		epsilon=0.3


	ys = np.mean(best_arm_counts,axis=0)
	xs = range(len(ys))
	ax.plot(xs, ys,label = st)

plt.xlabel('Steps')
plt.ylabel('Optimal pulls')

plt.tight_layout()
plt.legend()
plt.ylim((0,110))
plt.show()        


##################

相關程式碼：

# -*- coding: utf-8 -*-
import numpy as np
from matplotlib import pylab as plt
#from mpltools import style # uncomment for prettier plots
#style.use(['ggplot'])

'''
function definitions
'''
# generate all bernoulli rewards ahead of time
def generate_bernoulli_bandit_data(num_samples,K):
    CTRs_that_generated_data = np.tile(np.random.rand(K),(num_samples,1))
    true_rewards = np.random.rand(num_samples,K) < CTRs_that_generated_data
    return true_rewards,CTRs_that_generated_data

# totally random
def random(estimated_beta_params):
    return np.random.randint(0,len(estimated_beta_params))

# the naive algorithm
def naive(estimated_beta_params,number_to_explore=100):
    totals = estimated_beta_params.sum(1) # totals
    if np.any(totals < number_to_explore): # if have been explored less than specified
        least_explored = np.argmin(totals) # return the one least explored
        return least_explored
    else: # return the best mean forever
        successes = estimated_beta_params[:,0] # successes
        estimated_means = successes/totals # the current means
        best_mean = np.argmax(estimated_means) # the best mean
        return best_mean

# the epsilon greedy algorithm
def epsilon_greedy(estimated_beta_params,epsilon=0.01):
    totals = estimated_beta_params.sum(1) # totals
    successes = estimated_beta_params[:,0] # successes
    estimated_means = successes/totals # the current means
    best_mean = np.argmax(estimated_means) # the best mean
    be_exporatory = np.random.rand() < epsilon # should we explore?
    if be_exporatory: # totally random, excluding the best_mean
        other_choice = np.random.randint(0,len(estimated_beta_params))
        while other_choice == best_mean:
            other_choice = np.random.randint(0,len(estimated_beta_params))
        return other_choice
    else: # take the best mean
        return best_mean

# the UCB algorithm using 
# (1 - 1/t) confidence interval using Chernoff-Hoeffding bound)
# for details of this particular confidence bound, see the UCB1-TUNED section, slide 18, of: 
# http://lane.compbio.cmu.edu/courses/slides_ucb.pdf
def UCB(estimated_beta_params):
    t = float(estimated_beta_params.sum()) # total number of rounds so far
    totals = estimated_beta_params.sum(1)
    successes = estimated_beta_params[:,0]
    estimated_means = successes/totals # sample mean
    estimated_variances = estimated_means - estimated_means**2    
    UCB = estimated_means + np.sqrt( np.minimum( estimated_variances + np.sqrt(2*np.log(t)/totals), 0.25 ) * np.log(t)/totals )
    return np.argmax(UCB)

# the UCB algorithm - using fixed 95% confidence intervals
# see slide 8 for details: 
# http://dept.stat.lsa.umich.edu/~kshedden/Courses/Stat485/Notes/binomial_confidence_intervals.pdf
def UCB_bernoulli(estimated_beta_params):
    totals = estimated_beta_params.sum(1) # totals
    successes = estimated_beta_params[:,0] # successes
    estimated_means = successes/totals # sample mean
    estimated_variances = estimated_means - estimated_means**2
    UCB = estimated_means + 1.96*np.sqrt(estimated_variances/totals)
    return np.argmax(UCB)
    

# the bandit algorithm
def run_bandit_dynamic_alg(true_rewards,CTRs_that_generated_data,choice_func):
    num_samples,K = true_rewards.shape
    # seed the estimated params (to avoid )
    prior_a = 1. # aka successes 
    prior_b = 1. # aka failures
    estimated_beta_params = np.zeros((K,2))
    estimated_beta_params[:,0] += prior_a # allocating the initial conditions
    estimated_beta_params[:,1] += prior_b
    regret = np.zeros(num_samples) # one for each of the 3 algorithms

    for i in range(0,num_samples):
        # pulling a lever & updating estimated_beta_params
        this_choice = choice_func(estimated_beta_params)

        # update parameters
        if true_rewards[i,this_choice] == 1:
            update_ind = 0
        else:
            update_ind = 1
            
        estimated_beta_params[this_choice,update_ind] += 1
        
        # updated expected regret
        regret[i] = np.max(CTRs_that_generated_data[i,:]) - CTRs_that_generated_data[i,this_choice]

    cum_regret = np.cumsum(regret)

    return cum_regret



if __name__ == '__main__':
    '''
    main code
    '''
    # define number of samples and number of choices
    num_samples = 10000
    K = 5 # number of arms
    number_experiments = 100
    
    regret_accumulator = np.zeros((num_samples,5))
    for i in range(number_experiments):
        print "Running experiment:", i+1
        true_rewards,CTRs_that_generated_data = generate_bernoulli_bandit_data(num_samples,K)
        regret_accumulator[:,0] += run_bandit_dynamic_alg(true_rewards,CTRs_that_generated_data,random)
        regret_accumulator[:,1] += run_bandit_dynamic_alg(true_rewards,CTRs_that_generated_data,naive)
        regret_accumulator[:,2] += run_bandit_dynamic_alg(true_rewards,CTRs_that_generated_data,epsilon_greedy)
        regret_accumulator[:,3] += run_bandit_dynamic_alg(true_rewards,CTRs_that_generated_data,UCB)
        regret_accumulator[:,4] += run_bandit_dynamic_alg(true_rewards,CTRs_that_generated_data,UCB_bernoulli)
        
    plt.semilogy(regret_accumulator/number_experiments)
    plt.title('Simulated Bandit Performance for K = 5')
    plt.ylabel('Cumulative Expected Regret')
    plt.xlabel('Round Index')
    plt.legend(('Random','Naive','Epsilon-Greedy','(1 - 1/t) UCB','95% UCB'),loc='lower right')
    plt.show()

程式設計師的機器學習入門筆記（七）：推薦系統入門介紹

介紹背景隨著網際網路行業的井噴式發展，獲取資訊的方式越來越多，人們從主動獲取資訊逐漸變成了被動接受資訊，資訊量也在以幾何倍數式爆發增長。舉一個例子，PC時代用google reader，常常有上千條未讀部落格更新；如今的微信公眾號，也有大量的紅點未閱

大數據入門第十九天——推薦系統與mahout（一）入門與概述

tps font 解決技術分享 tar nbsp mage cnblogs clas 一、推薦系統概述　　為了解決信息過載和用戶無明確需求的問題，找到用戶感興趣的物品，才有了個性化推薦系統。其實，解決信息過載的問題，代表性的解決方案是分類目錄和搜索引擎，如hao123

C語言系列必讀技術書單推薦從入門到進階+技術書閱讀方法論

轉載自某大佬部落格：https://pymlovelyq.github.io/2018/10/10/CC/ 前言：技術書閱讀方法論一.速讀一遍（最好在1~2天內完成）人的大腦記憶力有限，在一天內快速看完一本書會在大腦裡留下深刻印象，對於之後複習以及總結都會有特別好

想了解推薦系統最新研究進展？請收好這16篇論文

推薦系統論文筆記（4）：Comparison of Collaborative Filtering Algorithms:Limitations of Current Techniques .....

一、基本資訊論文題目：《Comparison of Collaborative Filtering Algorithms:Limitations of Current Techniques and Proposals for Scalable,High-Performance Recommen

推薦系統論文筆記（2）：Towards the Next Generation of Recommender Systems:A Survey of the State-of-the-Art ....

一、基本資訊論文題目：《Towards the Next Generation of Recommender Systems:A Survey of the State-of-the-Art and Possible Extensions》發表時間：July 2005,IEEE Tran

推薦系統論文筆記（1）:Hybrid Recommender Systems:Survey and Experiments

一、基本資訊論文題目：《Hybrid Recommender Systems:Survey and Experiments》論文發表時間： 2002, 論文作者及單位：Robin Burke(California State University) 我的評分：5顆星

推薦系統論文筆記（7）：A survey of collaborative filtering based social recommender systems

一、基本資訊論文題目：《A survey of collaborative filtering based social recommender systems》發表時間：2014,Computer Communications 論文作者及單位：Yang, X.(Polytechni

推薦系統論文筆記（6）：Social Recommendation: A Review

一、基本資訊論文題目：《Social Recommendation: A Review》發表時間：2013 論文作者及單位：Jiliang Tang,Xia Hu,Huan Liu (Arizona State University) 論文地址：https://lin

Recsys2018 總結（推薦系統最新技術、應用和方向）32篇論文解讀

本文對10月2-7號在加拿大渥太華舉辦的Recsys的32篇論文做了整理和歸納，總結出了目前推薦系統最新技術應用和方向。並對每一篇文章做了粗略的講解。我打算從以下四個方面來講述這32篇論文。首先呢，我會概述一下大會論文反映的一些情況。然後分析一下

近年推薦系統論文調查彙總

現在推薦系統得到了廣泛的應用，在百度、京東、淘寶、豆瓣等均到看推薦系統的影子。推薦系統屬於機器學習的範疇，是一種預測模型，其型別大致可以分為：（1）使用者評分預測推薦（2）top-n 推薦（3）分類推薦。一般有collaborative filtering方法、

軟體測試入門書籍推薦以及推薦理由（入門必讀）

首先：先介紹下自己，看文章得知道作者是幹嘛的 IDO老徐，網際網路從業者，軟體測試老鳥，08年開始從事軟體測試職業；前後經歷3家公司，從測試小菜到公司測試負責人，

推薦系統入門必讀論文

推薦系統入門必讀論文

推薦系統入門必讀的經典paper

推薦系統入門

程式設計師的機器學習入門筆記（七）：推薦系統入門介紹

推薦系統入門——初步理解

推薦系統，深度論文剖析GBDT+LR

大數據入門第十九天——推薦系統與mahout（一）入門與概述

C語言系列必讀技術書單推薦從入門到進階+技術書閱讀方法論

推薦系統從入門到 Spark 案例實踐

推薦系統演算法工程師-從入門到就業

想了解推薦系統最新研究進展？請收好這16篇論文

推薦系統論文筆記（4）：Comparison of Collaborative Filtering Algorithms:Limitations of Current Techniques .....

推薦系統論文筆記（2）：Towards the Next Generation of Recommender Systems:A Survey of the State-of-the-Art ....

推薦系統論文筆記（1）:Hybrid Recommender Systems:Survey and Experiments

推薦系統論文筆記（7）：A survey of collaborative filtering based social recommender systems

推薦系統論文筆記（6）：Social Recommendation: A Review

推薦系統論文閱讀——Factorizing Personalized Markov Chains for Next-Basket Recommendation

Recsys2018 總結（推薦系統最新技術、應用和方向）32篇論文解讀

近年推薦系統論文調查彙總

軟體測試入門書籍推薦以及推薦理由（入門必讀）

推薦系統入門必讀論文

相關推薦