Semanlink - [1910.06294] Training Compact Models for Low Resource Entity Tagging using Pre-trained Language Models

Printer friendly

Search Tag:

Search Doc:

Preferences...

[1910.06294] Training Compact Models for Low Resource Entity Tagging using Pre-trained Language Models

Tags:

About This Document

sl:arxiv_author :
sl:arxiv_firstAuthor : Peter Izsak
sl:arxiv_num : 1910.06294
sl:arxiv_published : 2019-10-14T17:22:37Z
sl:arxiv_summary : Training models on low-resource named entity recognition tasks has been shown to be a challenge, especially in industrial applications where deploying updated models is a continuous effort and crucial for business operations. In such cases there is often an abundance of unlabeled data, while labeled data is scarce or unavailable. Pre-trained language models trained to extract contextual features from text were shown to improve many natural language processing (NLP) tasks, including scarcely labeled tasks, by leveraging transfer learning. However, such models impose a heavy memory and computational burden, making it a challenge to train and deploy such models for inference use. In this work-in-progress we combined the effectiveness of transfer learning provided by pre-trained masked language models with a semi-supervised approach to train a fast and compact model using labeled and unlabeled examples. Preliminary evaluations show that the compact models can achieve competitive accuracy with 36x compression rate when compared with a state-of-the-art pre-trained language model, and run significantly faster in inference, allowing deployment of such models in production environments or on edge devices.@en
sl:arxiv_title : Training Compact Models for Low Resource Entity Tagging using Pre-trained Language Models@en
sl:arxiv_updated : 2019-10-17T08:07:19Z
sl:bookmarkOf : https://arxiv.org/abs/1910.06294
sl:creationDate : 2022-03-31
sl:creationTime : 2022-03-31T21:06:23Z

File info

Bookmark of: https://arxiv.org/abs/1910.06294

Documents with similar tags (experimental)

[2309.06131] Annotating Data for Fine-Tuning a Neural Ranker? Current Active Learning Strategies are not Better than Random Selection

Tags:

2023-09-14 About

[2305.11778] Cross-Lingual Supervision improves Large Language Models Pre-training

Tags:

2023-05-22 About

[2210.16637] Beyond Prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Representations

Tags:

2022-11-25 About

[2209.11055] Efficient Few-Shot Learning Without Prompts

Tags:

2022-09-23 About

[2209.00099] Efficient Methods for Natural Language Processing: A Survey

Tags:

2022-09-04 About

[1807.00745] Training a Neural Network in a Low-Resource Setting on Automatically Annotated Noisy Data

Tags:

2022-07-18 About

[2205.05131] Unifying Language Learning Paradigms

Tags:

2022-05-12 About

[2205.03983] Building Machine Translation Systems for the Next Thousand Languages

Tags:

2022-05-10 About

[2004.05119] Beyond Fine-tuning: Few-Sample Sentence Embedding Transfer

Tags:

2022-03-31 About

[2105.00828] Memorisation versus Generalisation in Pre-trained Language Models

Tags:

2022-03-30 About

[2101.12294] Combining pre-trained language models and structured knowledge

Tags:

2022-03-25 About

[1911.02655] Towards Domain Adaptation from Limited Data for Question Answering Using Deep Neural Networks

Tags:

2021-11-19 About

[2108.13854] Contrastive Domain Adaptation for Question Answering using Limited Text Corpora

Tags:

2021-11-19 About

[1706.03610] Neural Domain Adaptation for Biomedical Question Answering

Tags:

2021-11-19 About

[2106.13474] Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains

Tags:

2021-10-21 About

[2010.12309] A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios

Tags:

2021-07-06 About

[2012.15723] Making Pre-trained Language Models Better Few-shot Learners

Tags:

2021-01-02 About

[2010.11967] Language Models are Open Knowledge Graphs

Tags:

2020-10-26 About

[1909.03193] KG-BERT: BERT for Knowledge Graph Completion

Tags:

2020-03-22 About

[2003.08271] Pre-trained Models for Natural Language Processing: A Survey

Tags:

2020-03-19 About

[1911.01464] Emerging Cross-lingual Structure in Pretrained Language Models

Tags:

2019-11-06 About

[1901.11504] Multi-Task Deep Neural Networks for Natural Language Understanding

Tags:

2019-02-17 About