Semanlink - Sanjeev Arora

RDF

Printer friendly

Search Tag:

Search Doc:

Preferences...

Parents:

Wikipedia Sanjeev Arora

Related Tags:

Princeton

Descendants

26 Documents (Long List)

[2307.15936] A Theory for Emergence of Complex Skills in Language Models

Tags:

2024-02-24 About

Sanjeev Arora sur X : "It's better to use just 5% of the instruction-tuning data (suitably selected) instead of the full dataset."

Tags:

2024-02-15 About

New Theory Suggests Chatbots Can Understand Text | Quanta Magazine

Tags:

2024-02-11 About

[2302.06600] Task-Specific Skill Localization in Fine-tuned Language Models

Tags:

2023-08-25 About

Sanjeev Arora sur Twitter : "new `skills' induced by LLM fine-tuning can be localized in tiny fraction of the model."

Tags:

2023-07-07 About

Sanjeev Arora sur Twitter : "Fine-tuning language models using just forward pass!...r

Tags:

2023-06-09 About

ICLR 2023 Workshop on Mathematical and Empirical Understanding of Foundation

Tags:

2023-05-04 About

Sanjeev Arora sur Twitter : "A priori, fine-tuning a huge LM on a few datapoints could lead to catastrophic overfitting. So why doesn’t it? Our theory + experiments..."

Tags:

2022-10-14 About

[2202.14037] Understanding Contrastive Learning Requires Incorporating Inductive Biases

Tags:

2022-03-05 About

How to allow deep learning on your data without revealing the data – Off the convex path

Tags:

2020-11-12 About

[1902.09229] A Theoretical Analysis of Contrastive Unsupervised Representation Learning

Tags:

2019-03-20 About

Contrastive Unsupervised Learning of Semantic Representations: A Theoretical Framework – Off the convex path (2019-03)

Tags:

2019-03-20 About

Word Embeddings: Explaining their properties – Off the convex path (2016)

Tags:

2019-03-20 About

Linear algebraic structure of word meanings – Off the convex path

Tags:

2018-09-20 About

Simple and efficient semantic embeddings for rare words, n-grams, and language features – Off the convex path

Tags:

2018-09-18 About

Off the convex path

Tags:

2018-09-09 About

A Latent Variable Model Approach to PMI-based Word Embeddings (2016)

Tags:

2018-08-28 About

[1601.03764] Linear Algebraic Structure of Word Senses, with Applications to Polysemy

Tags:

> Here it is shown that multiple word senses reside
in linear superposition within the word
embedding and simple sparse coding can recover
vectors that approximately capture the
senses

> Each extracted word sense is accompanied by one of about  2000 “discourse atoms” that gives a succinct description of which other words co-occur with that word sense.

> The success of the approach is mathematically explained using a variant of
the random walk on discourses model

("random walk": a generative model for language). Under the assumptions of this model,  there
exists a linear relationship between the vector of a
word w and the vectors of the words in its contexts (It is not the average of the words in w's context, but in a given corpus the matrix of the linear relationship does not depend on w. It can be estimated, and so we can compute the embedding of a word from the contexts it belongs to)

[Related blog post](/doc/?uri=https%3A%2F%2Fwww.offconvex.org%2F2016%2F07%2F10%2Fembeddingspolysemy%2F)

2018-08-28 About

Mathematics of Machine Learning: An introduction

Tags:

Sanjeev Arora

2018-08-08 About

Mathematics of Machine Learning and Deep Learning - Plenary talk at International Congress of Mathematicians 2018

Tags:

Sanjeev Arora

2018-08-08 About

Deep-learning-free Text and Sentence Embedding, Part 2 – Off the convex path

Tags:

2018-06-25 About

Deep-learning-free Text and Sentence Embedding, Part 1 – Off the convex path

Tags:

2018-06-25 About

Sanjeev Arora on "A theoretical approach to semantic representations" - YouTube (2016)

Tags:

2018-06-10 About

A Theoretical Approach to Semantic Coding and Hashing | Simons Institute for the Theory of Computing (2016)

Tags:

2018-05-26 About

A Simple but Tough-to-Beat Baseline for Sentence Embeddings (2017)

Tags:

2018-05-10 About

Semantic Word Embeddings – Off the convex path

Tags:

2017-11-21 About

Properties

sl:creationDate : 2018-05-26
sl:creationTime : 2018-05-26T10:33:58Z
sl:describedBy : https://en.wikipedia.org/wiki/Sanjeev_Arora
rdf:type : sl:Tag
skos:prefLabel : Sanjeev Arora