Semanlink - [2202.08904] SGPT: GPT Sentence Embeddings for Semantic Search

[2202.08904] SGPT: GPT Sentence Embeddings for Semantic Search

Tags:

About This Document

sl:arxiv_author : Niklas Muennighoff
sl:arxiv_firstAuthor : Niklas Muennighoff
sl:arxiv_num : 2202.08904
sl:arxiv_published : 2022-02-17T21:35:56Z
sl:arxiv_summary : Decoder transformers have continued increasing in scale reaching hundreds of billions of parameters. Due to their scale the same decoder sets state-of-the-art results on various language tasks via prompting or fine-tuning. Yet, these large foundation models remain unusable for the related fields of semantic search and sentence embeddings. This prevents possibly new state-of-the-art results and forces organizations to train and maintain separate models. To this end, we propose SGPT to use decoders for sentence embeddings and semantic search via prompting or fine-tuning. At 5.8 billion parameters SGPT improves on the previously best sentence embeddings by a margin of 7% and outperforms a concurrent method with 175 billion parameters as measured on the BEIR search benchmark. Code, models and result files are freely available at https://github.com/Muennighoff/sgpt.@en
sl:arxiv_title : SGPT: GPT Sentence Embeddings for Semantic Search@en
sl:arxiv_updated : 2022-08-05T09:33:10Z
sl:bookmarkOf : https://arxiv.org/abs/2202.08904
sl:creationDate : 2023-04-25
sl:creationTime : 2023-04-25T00:02:46Z
sl:relatedDoc : http://www.semanlink.net/doc/2022/09/2106_10199_bitfit_simple_par

File info

Bookmark of: https://arxiv.org/abs/2202.08904

Documents with similar tags (experimental)

[2303.17651] Self-Refine: Iterative Refinement with Self-Feedback

Tags:

2023-04-03 About

[2108.08877] Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models

Tags:

2023-02-17 About

[2104.08821] SimCSE: Simple Contrastive Learning of Sentence Embeddings

Tags:

2022-10-17 About

[2205.04260] EASE: Entity-Aware Contrastive Learning of Sentence Embedding

Tags:

2022-05-11 About

[2004.09813] Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation

Tags:

2022-03-18 About

[2104.06979] TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning

Tags:

> The most
successful previous approaches like InferSent (Conneau
et al., 2017), Universial Sentence Encoder
(USE) (Cer et al., 2018) and SBERT (Reimers and
Gurevych, 2019) heavily relied on labeled data to
train sentence embedding models.
>
> TSDAE can
achieve up to 93.1% of the performance of indomain
supervised approaches. Further, we
show that TSDAE is **a strong domain adaptation
and pre-training method for sentence
embeddings**, significantly outperforming other
approaches like Masked Language Model.

> During training, TSDAE
encodes corrupted sentences into fixed-sized
vectors and requires the decoder to reconstruct the
original sentences from this sentence embedding.

- <https://www.sbert.net/examples/unsupervised_learning/TSDAE/README.html>
- [github](https://github.com/UKPLab/sentence-transformers/tree/master/examples/unsupervised_learning/TSDAE)
- [UKPLab/sentence-transformers: Sentence Embeddings with BERT & XLNet](doc:2020/07/ukplab_sentence_transformers_s)
- [twitter](https://twitter.com/KexinWang2049/status/1433361957579538432):

> **TSDAE can learn domain-specific sentence embeddings with unlabeled sentences**
>
> Most importantly, instead of STS (Semantic Textual Similarity), **we suggest evaluating unsupervised sentence embeddings on the domain-specific tasks&datasets, which is the real use case for them**. Actually, STS scores do not correlate with performance on specific tasks.

2021-09-01 About

[1803.02893] An efficient framework for learning sentence representations

Tags:

2019-03-20 About

[1810.00438] Parameter-free Sentence Embedding via Orthogonal Basis

Tags:

2018-10-06 About

[1704.05358] Representing Sentences as Low-Rank Subspaces

Tags:

2018-10-06 About

[1806.06259] Evaluation of sentence embeddings in downstream and linguistic probing tasks

Tags:

2018-06-19 About

[1607.01759] Bag of Tricks for Efficient Text Classification

Tags:

2017-09-10 About