Semanlink - Language Model

RDF

Printer friendly

Search Tag:

Search Doc:

Preferences...

Tag Cloud

AI@Google - AI@Meta - AI + Knowledge Bases - Andrej Karpathy - Arxiv Doc - Attention mechanism - Classification - Discuté avec Raphaël - Domain adaptation - Domain adaptation (NLP) - Embeddings - Entities - Entity Retrieval - Fine-tuning - GitHub - GitHub project - Google - Howto, tutorial, FAQ - Hugging Face - Information extraction - Information retrieval - Knowledge bases - Knowledge Graphs - Knowledge Graphs and NLP - LangChain - Neural Search - NLP@Google - NLP@Meta - NLP@Microsoft - NLP@Stanford - NLP and Search - NLP based IR - NLP girls and guys - NLP tasks / problems - OpenAI - Question Answering - Retrieval-based NLP - Sample code - Self-Attention - Sentence Embeddings - Structured knowledge - Text Classification - Text Embeddings - Tools - Transfer learning - Transfer learning in NLP - Transformers - Tweet - Vector database -

Parents:

Wikipedia Language Model

Related Tags:

57 Documents (Long List)

[2306.07174] Augmenting Language Models with Long-Term Memory

Tags:

2023-06-13 About

[2303.16839] MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks

Tags:

2023-04-25 About

Evidence of a predictive coding hierarchy in the human brain listening to speech | Nature Human Behaviour

Tags:

2023-04-16 About

Diffusion language models – Sander Dieleman

Tags:

2023-04-06 About

[2303.14177] Scaling Expert Language Models with Unsupervised Domain Discovery

Tags:

2023-03-27 About

David Chalmers sur Twitter : "what are some new and interesting results about the relative capacities of multimodal models and pure language models... (thinking about "do language models need sensory grounding for meaning and understanding?".)"

Tags:

2023-03-15 About

Embedding Recycling: Making Language Model Development More Sustainable | AI2 Blog

Tags:

2023-02-17 About

Guiding Frozen Language Models with Learned Soft Prompts – Google AI Blog

Tags:

2023-02-14 About

elvis sur Twitter : "NEW: Meta AI introduces OPT-IML, a large language model (175B) fine-tuned on 2000 NLP tasks. Uses instruction-tuning to improve zero-shot and few-shot generalization abilities...."

Tags:

2022-12-23 About

Stanford studied 30 large language models so you don’t have to

Tags:

2022-12-20 About

[2211.09110] Holistic Evaluation of Language Models

Tags:

2022-12-06 About

Ekin Akyürek @ NeurIPS sur Twitter : "How does in-context learning work?..."

Tags:

2022-12-01 About

Talking to Models: Stanford U & Microsoft Method Enables Developers to Correct Model Bugs via Natural Language Patches | Synced

Tags:

2022-11-27 About

Pretrained Transformer Language Models for Search | Vespa Blog

Tags:

2022-11-04 About

Will Manidis sur Twitter : "Billions of hours of human potential every year are wasted on menial tasks. Data entry, form filling, basic knowledge work kind of stuff..."

Tags:

2022-10-26 About

Harrison Chase sur Twitter : "Introducing LangChain: a python package aimed at helping build LLM applications through composability..."

Tags:

2022-10-25 About

Prithviraj (Raj) Ammanabrolu sur Twitter : "The secret to aligning LMs to human preferences is reinforcement learning. ..."

Tags:

2022-10-06 About

Yi Tay sur Twitter : "Don't retrieve, recite!..."

Tags:

2022-10-06 About

[2207.05221] Language Models (Mostly) Know What They Know

Tags:

2022-09-15 About

AI And The Limits Of Language

Tags:

2022-08-28 About

[2208.11857] Shortcut Learning of Large Language Models in Natural Language Understanding: A Survey

Tags:

2022-08-27 About

[2208.11663] PEER: A Collaborative Language Model

Tags:

2022-08-26 About

Timo Schick sur Twitter : "PEER, a language model trained to incrementally write texts & collaborate w/ humans ..."

Tags:

2022-08-25 About

Elicit: The AI Research Assistant

Tags:

2022-08-05 About

To Understand Language is to Understand Generalization | Eric Jang

Tags:

2022-07-18 About

Andrej Karpathy sur Twitter : "For people wondering why, as a "vision person", I am interested in language models..."

Tags:

2022-07-18 About

[2206.06520] Memory-Based Model Editing at Scale

Tags:

2022-07-07 About

[2203.08913] Memorizing Transformers

Tags:

2022-05-07 About

Papers with Code sur Twitter : "10 Recent Trends in Language Models In this thread..."

Tags:

2022-04-25 About

(((ل()(ل() 'yoav))))👾 sur Twitter : "... another step in understanding how transformer-based LMs work..."

Tags:

2022-03-30 About

Adding New Words into a Language Model using Parameters of Known Words with Similar Behavior (2018)

Tags:

2022-03-21 About

Improving Language Models by Retrieving from Trillions of Tokens | DeepMind

Tags:

2021-12-09 About

Guillaume Lample sur Twitter : "Last year, we showed that you can outperform a 24-layer transformer in language modeling with just...

Tags:

2020-10-10 About

[1905.06316] What do you learn from context? Probing for sentence structure in contextualized word representations

Tags:

2020-08-02 About

Artificial Neural Networks Accurately Predict Language Processing in the Brain | bioRxiv

Tags:

2020-06-27 About

[1909.07606] K-BERT: Enabling Language Representation with Knowledge Graph

Tags:

2020-03-08 About

Hugging Face: How to train a new language model from scratch using Transformers and Tokenizers

Tags:

2020-02-16 About

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

Tags:

> It has recently been observed that neural language
models trained on unstructured text can
implicitly store and retrieve knowledge using
natural language queries.

indeed, cf. Facebook's paper [Language Models as Knowledge Bases?](/doc/2019/09/_1909_01066_language_models_as)

> In this short paper,
we measure the practical utility of this
approach by fine-tuning pre-trained models to
answer questions without access to any external
context or knowledge.

> we show that a large language
model pre-trained on unstructured text can
attain competitive results on open-domain question
answering benchmarks without any access
to external knowledge

BUT:

>1. state-of-the-art results only with the largest model
which had 11 billion parameters.
>1. “open-book” models
typically provide some indication of what information
they accessed when answering a question
that provides a useful form of interpretability.
In contrast, our model distributes knowledge
in its parameters in an inexplicable way, which
precludes this form of interpretability.
>1. **the maximum-likelihood objective provides no guarantees as to whether
a model will learn a fact or not.**

So, what's the point? To be compared with this [IBM's paper](/doc/2019/09/_1909_04120_span_selection_pre): "a new pre-training task inspired by reading comprehension and an effort to avoid encoding general knowledge in the transformer network itself"

2020-02-11 About

Adam Roberts sur Twitter : "New preprint: How Much Knowledge Can You Pack into the Parameters of a Language Model?..."

Tags:

2020-02-11 About

CTRL: A CONDITIONAL TRANSFORMER LANGUAGE MODEL FOR CONTROLLABLE GENERATION

Tags:

2019-09-12 About

[1909.01066] Language Models as Knowledge Bases?

Tags:

2019-09-05 About

An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models (NAACL 2019)

Tags:

2019-06-08 About

Better Language Models and Their Implications

Tags:

2019-02-14 About

Generalized Language Models

Tags:

2019-02-10 About

[1812.04616] Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs

Tags:

2018-12-14 About

Transfer learning with language models

Tags:

2018-11-05 About

[1810.04805] BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Tags:

2018-10-12 About

Can Global Semantic Context Improve Neural Language Models? - Apple (2018)

Tags:

2018-09-27 About

NLP's ImageNet moment has arrived

Tags:

2018-07-09 About

Deep learning : background and application to natural language processing

Tags:

2018-07-07 About

Improving Language Understanding with Unsupervised Learning

Tags:

2018-06-12 About

Contextual LSTM: A Step towards Hierarchical Language Modeling

Tags:

2016-08-14 About

[1602.02410] Exploring the Limits of Language Modeling

Tags:

2016-02-09 About

A Billion Words: Because today's language modeling standard should be higher

Tags:

2014-05-02 About

An empirical study of smoothing techniques for language modeling

Tags:

2012-03-25 About

Language models - Jordan Boyd-Graber - University of Maryland

Tags:

2012-03-24 About

SRILM - The SRI Language Modeling Toolkit

Tags:

2012-03-24 About

Properties

sl:creationDate : 2012-03-24
sl:creationTime : 2012-03-24T09:01:10Z
sl:describedBy : https://en.wikipedia.org/wiki/Language_model
rdf:type : sl:Tag
skos:altLabel :
- Language models@en
- Statistical Language Model@en
- LM@en
- Language Modeling@en
skos:prefLabel : Language Model@en