“什麼是Word Embedding（詞嵌入）”的個人理解

阿新 • • 發佈：2018-12-15

首先貼上一下Wiki英文的定義：

Word embedding is the collective name for a set of language modeling and feature learning techniques in natural language processing (NLP) where words or phrases from the vocabulary are mapped to vectors of real numbers. Conceptually it involves a mathematical embedding from a space with one dimension per word to a continuous

vector space with a much lower dimension.

它的意思是說，Word Embedding是一系列語言NLP中語言模型和特徵模型的總稱，在數學上牽涉到將每個單詞一個維度的高維向量對映到一個低維連續向量的過程。

之所以叫Embedding（“嵌入”），是因為Embedding在數學上的定義：

In mathematics, an embedding (or imbedding[1]) is one instance of some mathematical structure contained within another instance, such as a

group that is a subgroup.

When some object X is said to be embedded in another object Y, the embedding is given by some injective and structure-preserving map f : X → Y. The precise meaning of "structure-preserving" depends on the kind of mathematical structure of which X and Y are instances. In the terminology of

category theory, a structure-preserving map is called a morphism.

主要表徵一個結構通過對映而包含到另一個結構中，比如，我們可以把整數“嵌入”進有理數之中。顯然，整數是一個集合，同時它又是有理數的一個子集。整數集合中的每個整數，在有理數集合中都能找到一個唯一的對應（其實就是它本身）。同時，整數集合中的每個整數所具有的性質，在有理數中同樣得到了保持。同理，我們也可以把有理數“嵌入”到實數中去。

參考連結：

英文維基

最後一段，Embedding的例子的來源

“什麼是Word Embedding（詞嵌入）”的個人理解

參考連結：

“什麼是Word Embedding（詞嵌入）”的個人理解

Image Embedding(圖片嵌入）/ Feature Embedding（特徵嵌入）

leetCode 79.Word Search （詞搜尋）解題思路和方法

[機器學習入門] 李巨集毅機器學習筆記-15 （Unsupervised Learning: Word Embedding；無監督學習：詞嵌入）

LinkedList（Java8）個人理解

ArrayList（Java8）個人理解

推薦系統初學者系列（8）-- Graph Embedding（網路嵌入表示）做Top-K推薦

（新手入門）個人對redis的理解

beanstalkd協議解讀（中文翻譯加個人理解）

機房收費系統（VB.NET）個人版總結

LeetCode 290 Word Pattern（單詞模式）（istringstream、vector、map）（*）

【LeetCode-面試算法經典-Java實現】【139-Word Break（單詞拆分）】

對Java Serializable（序列化）的理解和總結

Java 檢查異常（checked exception）和未檢查異常（unchecked exception）區別理解

Java Serializable（序列化）的理解和總結

（4.19）深入理解SQLSERVER的日誌鏈

K-L散度（相對熵）的理解

關於對比損失（contrasive loss）的理解（相似度越大越相似的情況）：

談談對Spring IOC（控制反轉）的理解

916. Word Subsets（python+cpp）

“什麼是Word Embedding（詞嵌入）”的個人理解

參考連結：

相關推薦