Semanlink - [2106.10199] BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models

Tags:

About This Document

sl:arxiv_author :
sl:arxiv_firstAuthor : Elad Ben Zaken
sl:arxiv_num : 2106.10199
sl:arxiv_published : 2021-06-18T16:09:21Z
sl:arxiv_summary : We introduce BitFit, a sparse-finetuning method where only the bias-terms of the model (or a subset of them) are being modified. We show that with small-to-medium training data, applying BitFit on pre-trained BERT models is competitive with (and sometimes better than) fine-tuning the entire model. For larger data, the method is competitive with other sparse fine-tuning methods. Besides their practical utility, these findings are relevant for the question of understanding the commonly-used process of finetuning: they support the hypothesis that finetuning is mainly about exposing knowledge induced by language-modeling training, rather than learning new task-specific linguistic knowledge.@en
sl:arxiv_title : BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models@en
sl:arxiv_updated : 2022-03-19T09:52:20Z
sl:bookmarkOf : https://arxiv.org/abs/2106.10199
sl:creationDate : 2022-09-01
sl:creationTime : 2022-09-01T17:20:28Z

File info

Bookmark of: https://arxiv.org/abs/2106.10199

Linked From

[2202.08904] SGPT: GPT Sentence Embeddings for Semantic Search

Tags:

2023-04-25 About

Documents with similar tags (experimental)

[2106.04612] Neural Extractive Search

Tags:

how to extend a
search paradigm we call “**extractive search**” with
neural similarity techniques.

> some information needs require extracting
and aggregating sub-sentence information
(words, phrases, or entities) from multiple documents
(e.g. a list of all the risk factors for a specific
disease and their number of mentions, or a comprehensive
table of startups and CEOs).

> extractive search combines
document selection with information extraction. **The query is extended with capture slots**:
these are **search terms that act as variables, whose
values should be extracted**.
> The user
is then presented with the matched documents, each
annotated with the corresponding captured spans,
as well as aggregate information over the captured
spans

Conclusion :

> We presented a system for neural extractive search.
While we found our system to be useful for scientific
search, it also has clear limitations and areas
for improvement, both in terms of accuracy (only
72.2% of the returned results are relevant, both the
alignment and similarity models generalize well to
some relations but not to others), and in terms of
scale

[Video of demo](https://www.youtube.com/watch?v=TtqWi2GgB5A&t=1832s)

2021-06-23 About