2、Sentence-BERT：使用 Siamese BERT-Networks 的句子嵌入

阿新 • • 發佈：2021-08-04

1、摘要

BERT (Devlin et al., 2018) and RoBERTa (Liu et al., 2019) has set a new state-of-the-art performance on sentence-pair regression tasks like semantic textual similarity (STS). However, it requires that both sentences are fed into the network, which causes a massive computational overhead: Finding the most similar pair in a collection of 10,000 sentences requires about 50 million inference computations (~65 hours) with BERT. The construction of BERT makes it unsuitable for semantic similarity search as well as for unsupervised tasks like clustering.

BERT (Devlin et al., 2018) 和 RoBERTa (Liu et al., 2019) 在語義文字相似性 (STS) 等句子對迴歸任務上取得了新的最先進的效能。然而，它需要將兩個句子都輸入到網路中，這會導致大量的計算開銷：在 10,000 個句子的集合中找到最相似的一對需要使用 BERT 進行大約 5000 萬次推理計算（約 65 小時）。 BERT 的構建使其不適用於語義相似性搜尋以及聚類等無監督任務。

In this publication, we present Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity. This reduces the effort for finding the most similar pair from 65 hours with BERT / RoBERTa to about 5 seconds with SBERT, while maintaining the accuracy from BERT.

在本出版物中，我們介紹了 Sentence-BERT (SBERT)，這是對預訓練 BERT 網路的一種修改，該網路使用 siamese 和三元組網路結構來推導語義上有意義的句子嵌入，可以使用餘弦相似度進行比較。這將尋找最相似對的工作量從使用 BERT / RoBERTa 的 65 小時減少到使用 SBERT 的大約 5 秒，同時保持了 BERT 的準確性。

We evaluate SBERT and SRoBERTa on common STS tasks and transfer learning tasks, where it outperforms other state-of-the-art sentence embeddings methods.

我們在常見的 STS 任務和遷移學習任務上評估 SBERT 和 SRoBERTa，它優於其他最先進的句子嵌入方法。

2、Sentence-BERT：使用 Siamese BERT-Networks 的句子嵌入

1、摘要

2、Sentence-BERT：使用 Siamese BERT-Networks 的句子嵌入

Sagit.Framework For IOS 自動佈局教程：2、主介面：相對父窗體UIView佈局。

全棧測試二 | 介面自動化：2、requests模組的基礎和使用

作業系統之I/O管理：2、I/O軟體層次結構

計組之儲存系統：2、SRAM(區別、柵極電容、雙穩態觸發器、DRAM重新整理、地址複用)和DRAM(MROM、PROM、EPROM、EEPROM)

大資料叢集完全分散式部署實操篇：HDFS2.9.2、HBASE2.2.6、YARN2.9.2、SPARK2.4.7，ZOOKEEPER3.6.2

springboot @value啟動報錯_SpringBoot系列：2、配置

資料庫問題：com.microsoft.sqlserver.jdbc.SQLServerException: 索引 1（或2、3）超出範圍

Linux裝置模型：2、基本物件 Kobject、Kset、Ktype

任務一： 2、JDBC

JFoenix中文教程：2、JFXButton按鈕元件

2、idea 啟動專案JDK、JRE報錯：Class JavaLaunchHelper ...One of the two will be used. Which one is undefined.

蔚來 NIO OS 2.9.0 釋出：視覺融合全自動泊車、車輛近距召喚、App 遠端控制 FOTA 升級

蘋果 macOS Big Sur 11.2 正式版更新：修復藍芽、黑屏等問題

【C語言】求方程式 ax^2+bx+c=0 的根，分別考慮： 1、有兩個不等的實根 2、有兩個相等的實根

2、React：React的 JSX 語法規則介紹

微信電腦版釋出 3.2.1 測試版：可觀看、發起視訊號直播，提供手機直播工具

高通驍龍 7c Gen 2 晶片正式釋出：效能提升 10%，支援 Win10 PC、Chromebook

JavaWeb1.3.2【基礎加強：自定義註解（格式、本質、屬性、元註解）】

鴻蒙 HarmonyOS 開發必備工具，華為 DevEco Studio 2.2 Beta 1 釋出：支援低程式碼開發、遠端真機

2、Sentence-BERT：使用 Siamese BERT-Networks 的句子嵌入

1、摘要

相關推薦