Semanlink - [1908.10084] Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Tags:

Attached documents

UKPLab/sentence-transformers: Sentence Embeddings with BERT & XLNet

Tags:

2020-07-14 About

About This Document

sl:arxiv_author :
- Iryna Gurevych
- Nils Reimers
sl:arxiv_firstAuthor : Nils Reimers
sl:arxiv_num : 1908.10084
sl:arxiv_published : 2019-08-27T08:50:17Z
sl:arxiv_summary : BERT (Devlin et al., 2018) and RoBERTa (Liu et al., 2019) has set a new state-of-the-art performance on sentence-pair regression tasks like semantic textual similarity (STS). However, it requires that both sentences are fed into the network, which causes a massive computational overhead: Finding the most similar pair in a collection of 10,000 sentences requires about 50 million inference computations (~65 hours) with BERT. The construction of BERT makes it unsuitable for semantic similarity search as well as for unsupervised tasks like clustering. In this publication, we present Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity. This reduces the effort for finding the most similar pair from 65 hours with BERT / RoBERTa to about 5 seconds with SBERT, while maintaining the accuracy from BERT. We evaluate SBERT and SRoBERTa on common STS tasks and transfer learning tasks, where it outperforms other state-of-the-art sentence embeddings methods.@en
sl:arxiv_title : Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks@en
sl:arxiv_updated : 2019-08-27T08:50:17Z
sl:bookmarkOf : https://arxiv.org/abs/1908.10084
sl:creationDate : 2019-08-28
sl:creationTime : 2019-08-28T22:41:55Z
sl:relatedDoc :
- http://www.semanlink.net/doc/2020/01/richer_sentence_embeddings_usin
- http://www.semanlink.net/doc/2020/07/ukplab_sentence_transformers_s

File info

Bookmark of: https://arxiv.org/abs/1908.10084

Linked From

UKPLab/sentence-transformers: Sentence Embeddings with BERT & XLNet

Tags:

2020-07-14 About

Richer Sentence Embeddings using Sentence-BERT — Part I

Tags:

Sentence-BERT

2020-01-06 About

Documents with similar tags (experimental)

[2203.14655] Few-Shot Learning with Siamese Networks and Label Tuning

Tags:

> the problem of building text classifiers with little or no training data.
>
> In recent years, an approach based on neural textual entailment models has been found to give strong results on a diverse range of tasks.

(cf. #[NLI](tag:nli), using the input text as the premise and the text representing the label as the hypothesis)

> In this work, we show that **with proper pre-training, Siamese Networks that embed texts and labels** offer a competitive alternative.
>
> We introduce **label tuning: fine-tuning the label embeddings only**. While giving lower performance than model fine-tuning (which updates all params of the model), this approach has the architectural advantage that a single encoder can be shared by many different tasks (we only fine-tune the label embeddings)
> The drop in quality can
be compensated by using a variant of **[Knowledge distillation](tag:knowledge_distillation)**

[Github](https://tinyurl.com/label-tuning), [Tweet](doc:2022/03/thomas_muller_sur_twitter_pa)

2022-03-30 About