GloVe: Global Vectors for Word Representation

阿新 • • 發佈：2018-12-19

學習詞的向量空間表示可以很好捕獲語法和語義規則資訊，但是這些規則的起源並不透明。我們分析和闡明模型需要的這些規則。這是logbilinear regression模型，集合了全域性矩陣分解和本地視窗大小的方法。模型訓練在詞和詞的共現矩陣中，而不是整個語料庫的稀疏矩陣。

1 Introduction

語言的語義向量空間模型把每個詞表示為一個數值向量，這些向量是特徵，可以使用在資訊檢索，文件分類，問答，命名實體識別和語法分析。

大部分詞向量依賴於詞向量對的距離和角度來估計這些向量的質量。最近的估計方法是詞的相似度，而且還有不同維度的不同。比如king-queen=man-woman。

學習詞向量有兩大方法：1）全域性矩陣分解方法，比如LSA，2）本地文字視窗，比如skip-gram模型。這些方法都有缺點，LSA可以很好獲得統計資訊，但對於詞的相似度任務比較差，skip-gram對於相似度任務很好，但對於使用語料的統計資訊比較差，這是因為他們訓練在區域性上下文視窗而不是全域性共現對。

2 Related Work

Matrix Factorization Methods.

矩陣分解的方法可以追溯到LSA，這些方法使用低秩的矩陣分解大的矩陣，在LSA，矩陣是‘term-document’，比如行是詞，列是不同的文件。

Shallow Window-Based Methods.

另一個方法是在區域性上下文視窗內進行預測，比如CBOW和skip-gram模型。

不像矩陣分解方法，基於視窗的模型無法使用語料的共現資訊。

3 The GloVe Model

語料庫中共現詞的資訊可以由非監督學習方法獲得，但現在已有這些方法了，但是語義如何從這些統計資訊獲得還是問題。我們的模型叫GloVe（global vector），因為整個語料的統計資訊由模型直接獲得。

首先定義一些概念。

模型的效能對於臨界值的依賴很少，所以把xmax=100 ，並且α=3/4 比α=1 好

3.1 Relationship to Other Models

3.2 Complexity of the model

4 Experiments

4.1 Evaluation methods

Word analogies

Word similarity

Named entity recognition

4.2 Corpora and training details

4.3 Results

4.4 Model Analysis: Vector Length and Context Size

4.5 Model Analysis: Corpus Size

4.6 Model Analysis: Run-time

4.7 Model Analysis: Comparison with word2vec

5 Conclusion

Glove:Global Vectors for Word Representation.

related work 1）global matric factorization 例如LSA（latent semantic analysis）雖然利用了statistics of the corp

GloVe: Global Vectors for Word Representation

學習詞的向量空間表示可以很好捕獲語法和語義規則資訊，但是這些規則的起源並不透明。我們分析和闡明模型需要的這些規則。這是logbilinear regression模型，集合了全域性矩陣分解和本地視窗大小的方法。模型訓練在詞和詞的共現矩陣中，而不是整個語料庫的稀疏矩陣。 1 Introductio

Stanford 224N- GloVe: Global Vectors for word representations

Window based method (direct predict method): skip gram and CBOW Skip-gram Model: take one window at a time, predict probability of surrrounding words（wi

【論文閱讀】《GloVe: Global Vectors forWord Representation》

GloVe model 單詞表示模型：GloVe，用於全域性向量，全域性語料的統計資訊直接由模型獲得。符號 X X X：詞共現矩陣 Xij

How we serve 25M API calls from 10 scalable global endpoints for $150 a month

I woke up on Black Friday last year to a barrage of emails from users reporting 503 errors from the ipdata API.Our users typically call our API on each pag

.NET Core Global Tools for AWS

One of the exciting new features in .NET Core 2.1 are Global Tools. Global Tools provide the ability to distribute command line tools via a NuGet

You searched for word embedding

Develop a Deep Learning Model to Automatically Translate from German to English in Python with Keras, Step-by-Step. Machine translation is a challen

Building a Global Network for Genomic Data – DNAnexus, an Advanced APN Technology Partner

Today’s announcement of the precisionFDA platform is significant for the genomics research community for a number of reasons. With this pilot plat

New – AWS Global Accelerator for Availability and Performance

Having

intersect for multiple vectors in R

con span osi library tar other and pos intersect Say you have a <- c(1,3,5,7,9) b <- c(3,6,8,9,10) c <- c(2,3,4,5,7,9) A stra

Local Generic Representation for Face Recognition with Single Sample per Person (ACCV, 2014)

任務 strac iat 挑戰 dataset 進行通用 trac present Abstract: 1. 每個類別單個樣本的人臉識別（face recognition with single sample per person, SSPP）是一個非常有挑戰性的任務，因

Learning Structured Representation for Text Classification via Reinforcement Learning 學習筆記

ctu recursive fec 註釋 css 進攻 imp column converge Representation learning ：表征學習，端到端的學習 pre-specified 預先指定的 demonstrate 論證;證明，證實;顯示

Six golden A Global Leader in Industrial IoT rules for creating the ideal German cover letter and r

www.inhandnetworks.de Applying for jobs is never simple but it can feel even more difficult in a foreign country when you’re unfamiliar with the l

Shangbang Long_ECCV2018_TextSnake_A Flexible Representation for Detecting Text of Arbitrary Shapes

rectangle 分類 ask 超出 des 出了 effect orien 步驟 Shangbang Long_ECCV2018_TextSnake_A Flexible Representation for Detecting Text of Arbitrary Sh

Optical Flow Guided Feature A Fast and Robust Motion Representation for Video Action Recognition論文解讀

Optical Flow Guided Feature A Fast and Robust Motion Representation for Video Action Recognition論文解讀 1. Abstract 2. 論文解讀 3

Learning Invariant Deep Representation for NIR-VIS Face Recognition

查詢異質影象匹配的過程中，發現幾篇某組的論文，都是關於NIR-VIS的識別問題，提到了許多處理異質影象的處理方法，網路結構和idea都很不錯，記錄其中一篇。摘要 VIS-NIR（可見光與近紅外）面部識別仍然是異質影象識別中的挑戰。本文只用一個網路來對映NIR和VIS影象至一個緊湊的歐式空間。網路的低階層

A Light CNN for Deep Face Representation with Noisy Labels

清晰深度 html spa sca 數據由於圖像測試數據承接上一篇博客。該論文思路清晰，實驗充分，這裏大致寫一些比較不錯的idea。從標題就能看出本文的主要貢獻：輕量、魯棒。利用一個輕量CNN從大規模數據且含大量噪聲中來學習一個深度面部表征。直接談談貢獻：本

《End-to-End Learning of Motion Representation for Video Understanding》論文閱讀

CVPR 2018 | 騰訊AI Lab、MIT等機構提出TVNet：可端到端學習視訊的運動表徵動機儘管端到端的特徵學習已經取得了重要的進展，但是人工設計的光流特徵仍然被廣泛用於各類視訊分析任務中。為了彌補這個不足而提出；以前的方法：

[jnhs]使用netbeans生成的webapp釋出到tomcat是需要改名字的,不然就是404Description The origin server did not find a current representation for the target resource or is not

第一次使用tomcat釋出webapp 遇到404錯誤 Description The origin server did not find a current representation for the target resource or is not will

Complex Network Analysis for Characterizing Global Value Chains in Equipment Manufacturing

什麼是全球價值鏈以所謂“外包”，“分散生產”和“任務交易”為特點的全球價值鏈（全球價值鏈）的興起一直被認為是最重要的21世紀的貿易現象。研究的問題是什麼由於國際生產網路日益複雜和複雜，特別是在裝備製造業，傳統貿易方面統計數字和相應的貿易指標可能會給我們一個扭曲的貿易圖景。

GloVe: Global Vectors for Word Representation

1 Introduction

2 Related Work

3 The GloVe Model

3.1 Relationship to Other Models

3.2 Complexity of the model

4 Experiments

4.1 Evaluation methods

4.2 Corpora and training details

4.3 Results

4.4 Model Analysis: Vector Length and Context Size

4.5 Model Analysis: Corpus Size

4.6 Model Analysis: Run-time

4.7 Model Analysis: Comparison with word2vec

5 Conclusion

相關推薦