<?xml version='1.0' encoding='UTF-8'  ?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:dc="http://purl.org/dc/elements/1.1/">	<channel rdf:about="http://www.semanlink.net/tag/twitter_thread">		<title>Twitter thread</title>		<link>http://www.semanlink.net/tag/twitter_thread</link>		<description>Documents tagged with Twitter thread</description>		<items>			<rdf:Seq>							<rdf:li resource="http://www.semanlink.net/doc/2024/12/jeremy_howard_sur_x_i_ll_get"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/10/philipp_schmid_sur_x_can_we_"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/10/so_yeon_tiffany_min_sur_x_"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/04/jeremy_howard_sur_x_today_at"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/04/zeyuan_allen_zhu_sur_x_resul"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/03/raphaelsrty_sur_x_my_persona"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/03/matteo_pagliardini_sur_x_we_"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/03/benjamin_clavie_sur_x_docume"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/03/akshay_%F0%9F%9A%80_sur_x_let_s_build_"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/03/krista_opsahl_ong_sur_x_got_"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/03/hrishi_sur_x_bookmarked_pape"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/03/jerry_liu_sur_x_to_better_au"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/02/omar_khattab_sur_x_a_thread_"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/02/craig_macdonald_sur_x_colber"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/01/jo_kristian_bergum_sur_x_i%E2%80%99m"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/01/llamaindex_%F0%9F%A6%99_sur_x_use_rag_"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/01/jerry_liu_sur_x_%F0%9F%AA%9C_4_levels_"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/01/rachit_bansal_sur_x_extendin"/>				<rdf:li resource="http://www.semanlink.net/doc/2024/01/omar_khattab_sur_x_a_cool_th"/>				<rdf:li resource="http://www.semanlink.net/doc/2023/12/omar_khattab_sur_x_a%F0%9F%A7%B5on_bea"/>			</rdf:Seq>		</items>	</channel>		<item rdf:about="http://www.semanlink.net/doc/2024/12/jeremy_howard_sur_x_i_ll_get">		<title>Jeremy Howard sur X : &quot;We trained 2 new models. Like BERT, but modern. ModernBERT. Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc...&quot;</title>		<link>http://www.semanlink.net/doc/2024/12/jeremy_howard_sur_x_i_ll_get</link>		<description>&lt;https://x.com/LightOnIO/status/1869785737832366306&gt;		</description>		<dc:date>2024-12-21T17:13:36Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/10/philipp_schmid_sur_x_can_we_">		<title>Philipp Schmid sur X : &quot;Can we improve retrieval for RAG by learning from neighboring contexts? Contextual Document Embedding ...&quot;</title>		<link>http://www.semanlink.net/doc/2024/10/philipp_schmid_sur_x_can_we_</link>		<description>&gt; There is at least one notable benefit of statistical approaches that is lost by neural models. Statistical models can easily incorporate prior corpus statistics such as inverse document frequency (IDF), into their representation. This prior term imparts context-dependence onto the model, since it can be updated based on information specific to retrieval in a given domain at test time. We contrast this contextual formulation with neural document encoders that are by definition a function of the document itself. For example consider the following document:

[Tweet by author&#93;(https://x.com/jxmnop/status/1842236045074498026): &quot;best BERT-sized text embedding model in the world&quot;, &quot;a paradigm shift for text retrieval&quot;

&gt; a new contextual embedding architecture.
&gt; 
&gt; neighboring document information, during training and encoding, can create &quot;context-aware&quot; embeddings
&gt;
&gt; this requires changes to both the training and evaluation pipeline to incorporate *contextual tokens*

1. Cluster similar documents to identify neighboring documents for each one.
2. Extend Encoder to include information from these neighboring documents when generating embeddings.
3. Train the model using a contrastive learning objective that incorporates neighboring documents into the loss function.

(Point 1 reminds me of Raphaël&apos;s [raphaelsty/neural-tree: Tree-based indexes for neural-search&#93;(doc:2024/02/raphaelsty_neural_tree_tree_ba))		</description>		<dc:date>2024-10-11T00:38:04Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/10/so_yeon_tiffany_min_sur_x_">		<title>So Yeon (Tiffany) Min sur X : &quot;Embodied-RAG, a General Non-Parametric Method for Retrieval and Generation...&quot;</title>		<link>http://www.semanlink.net/doc/2024/10/so_yeon_tiffany_min_sur_x_</link>		<description>&gt; A new framework that equips embodied agents with a non-parametric memory capable of autonomously constructing hierarchical knowledge for navigation and language generation.
([Ruslan Salakhutdinov&#93;(tag:ruslan_salakhutdinov) [tweet&#93;(https://x.com/rsalakhu/status/1842694504387916073))

&gt; Hi robot, I&apos;m dehydrated, can you take me somewhere?

&gt; How to apply non-parametric memory to every day experiences?

&gt; key challenges in building embodied memory
&gt; - Dense memory that logs everything is memory inefficient.
&gt; - Space is continuous, and locations are spatially correlated, in contrast to independent documents in the text domain.

&gt; During the retrieval/generation phase, we select K &quot;chains&quot; (a leaf node all the way up to the root node), that are closest to the query.		</description>		<dc:date>2024-10-06T10:18:51Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/04/jeremy_howard_sur_x_today_at">		<title>Jeremy Howard sur X : &quot;FSDP/QDoRA with Llama3 : I believe that this combination is likely to create better task-specific models than anything else at any cost.&quot;</title>		<link>http://www.semanlink.net/doc/2024/04/jeremy_howard_sur_x_today_at</link>		<dc:date>2024-04-23T22:22:44Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/04/zeyuan_allen_zhu_sur_x_resul">		<title>Zeyuan Allen-Zhu sur X : &quot; surprisingly, when pre-training good data (e.g., Wiki) together with &quot;junks&quot; (e.g., Common Crawl), LLM&apos;s capacity on good data may decrease by 20x times!&quot;</title>		<link>http://www.semanlink.net/doc/2024/04/zeyuan_allen_zhu_sur_x_resul</link>		<description>&gt;  A simple fix: add domain tokens to your data; LLMs can auto-detect domains rich in knowledge and prioritize.&quot;		</description>		<dc:date>2024-04-10T18:33:05Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/03/raphaelsrty_sur_x_my_persona">		<title>raphaelsrty sur X : &quot;My personal Knowledge Base made to the front page of HackerNews today...&quot;</title>		<link>http://www.semanlink.net/doc/2024/03/raphaelsrty_sur_x_my_persona</link>		<dc:date>2024-03-23T11:53:35Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/03/matteo_pagliardini_sur_x_we_">		<title>Matteo Pagliardini sur X : &quot;#DenseFormer, a simple modification that performs—after each transformer block—a weighted average of past representations.&quot;</title>		<link>http://www.semanlink.net/doc/2024/03/matteo_pagliardini_sur_x_we_</link>		<dc:date>2024-03-23T11:44:52Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/03/benjamin_clavie_sur_x_docume">		<title>Benjamin Clavié sur X : &quot;Introducing rerankers: a lightweight library to provide a unified way to use various reranking methods&quot;</title>		<link>http://www.semanlink.net/doc/2024/03/benjamin_clavie_sur_x_docume</link>		<dc:date>2024-03-16T10:28:38Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/03/akshay_%F0%9F%9A%80_sur_x_let_s_build_">		<title>Akshay 🚀 sur X : &quot;Let&apos;s build a &quot;Chat with your code&quot; RAG application, step-by-step&quot;</title>		<link>http://www.semanlink.net/doc/2024/03/akshay_%F0%9F%9A%80_sur_x_let_s_build_</link>		<dc:date>2024-03-09T11:55:54Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/03/krista_opsahl_ong_sur_x_got_">		<title>Krista Opsahl-Ong sur X : &quot;Got a pipeline with **multiple prompts**, like a DSPy program? ... Introducing MIPRO, a Multi-prompt Instruction Proposal Optimizer....&quot;</title>		<link>http://www.semanlink.net/doc/2024/03/krista_opsahl_ong_sur_x_got_</link>		<dc:date>2024-03-09T11:37:47Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/03/hrishi_sur_x_bookmarked_pape">		<title>Hrishi sur X : &quot;RAPTOR is...  one of the very few [RAG architectures&#93; that actively presumes and uses the structure in a document....&quot;</title>		<link>http://www.semanlink.net/doc/2024/03/hrishi_sur_x_bookmarked_pape</link>		<description>(thread by the person of [WalkingRAG&#93;(tag:walkingrag))

&gt; The similarities between WalkingRAG and RAPTOR are that both attempt to capture relationships in the data into a higher structure using LLMs... This is a tree in RAPTOR&apos;s case, with WalkingRAG it&apos;s a graph. 		</description>		<dc:date>2024-03-09T11:30:15Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/03/jerry_liu_sur_x_to_better_au">		<title>Jerry Liu sur X : &quot;To better augment LLMs with context, it makes a lot of sense to organize context not just as a flat list of text chunks, but as a hierarchy of high-level to low-level details. RAPTOR...&quot;</title>		<link>http://www.semanlink.net/doc/2024/03/jerry_liu_sur_x_to_better_au</link>		<description>&gt; To better augment LLMs with context, it makes a lot of sense to organize context not just as a flat list of text chunks, but as a hierarchy of high-level to low-level details. 
&gt;
&gt; RAPTOR is a super simple but neat idea towards this direction. Hierarchically cluster and summarize the text into a tree (the clustering is important, allows semantically related concepts to be grouped together and doesn&apos;t purely rely on spatial positioning!). During query-time dynamically retrieve the most relevant context to the question.		</description>		<dc:date>2024-03-03T10:14:19Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/02/omar_khattab_sur_x_a_thread_">		<title>Omar Khattab sur X : &quot;A thread on late interaction, how it works efficiently, and why/where it&apos;s been shown to improve quality&quot;</title>		<link>http://www.semanlink.net/doc/2024/02/omar_khattab_sur_x_a_thread_</link>		<dc:date>2024-02-05T22:47:40Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/02/craig_macdonald_sur_x_colber">		<title>Craig Macdonald sur X : &quot;a thread of our main ColBERT research findings&quot;</title>		<link>http://www.semanlink.net/doc/2024/02/craig_macdonald_sur_x_colber</link>		<dc:date>2024-02-01T08:34:16Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/01/jo_kristian_bergum_sur_x_i%E2%80%99m">		<title>Jo Kristian Bergum sur X :  (on &quot;why using ColBERT?&quot;)</title>		<link>http://www.semanlink.net/doc/2024/01/jo_kristian_bergum_sur_x_i%E2%80%99m</link>		<description>&gt; The idea that you can accurately boil down the nuances of ~256 tokens (2/3rds of a page) into a single vector is a pretty wild proposition

&gt; Inspired by 
@lateinteraction
, we hacked into the ColBERT model&apos;s contextualized late-interaction similarities to produce [interpretable snippets&#93;(https://x.com/jobergum/status/1750282246072746178?s=20)!		</description>		<dc:date>2024-01-28T10:53:02Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/01/llamaindex_%F0%9F%A6%99_sur_x_use_rag_">		<title>LlamaIndex 🦙 sur X : &quot;Use RAG to build advanced text-to-SQL...&quot;</title>		<link>http://www.semanlink.net/doc/2024/01/llamaindex_%F0%9F%A6%99_sur_x_use_rag_</link>		<dc:date>2024-01-24T22:31:35Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/01/jerry_liu_sur_x_%F0%9F%AA%9C_4_levels_">		<title>Jerry Liu sur X : &quot;4 Levels of Agents for RAG...&quot;</title>		<link>http://www.semanlink.net/doc/2024/01/jerry_liu_sur_x_%F0%9F%AA%9C_4_levels_</link>		<dc:date>2024-01-23T20:35:24Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/01/rachit_bansal_sur_x_extendin">		<title>Rachit Bansal sur X : &quot;An LLM can be efficiently *composed* with specialized (L)LMs to enable new tasks&quot;</title>		<link>http://www.semanlink.net/doc/2024/01/rachit_bansal_sur_x_extendin</link>		<description>[[2401.02412&#93; LLM Augmented LLMs: Expanding Capabilities through Composition&#93;(doc:2024/01/2401_02412_llm_augmented_llms)

&gt; CALM—Composition to Augment Language Models:
&gt; 1. Scales up LLMs on new tasks by *re-using* existing (L)LMs w/ very few new parameters &amp; data,
&gt; 2. Keeps existing model weights intact, hence **preserves original capabilities**,
&gt; 3. Applies to diverse domains and settings.

&gt; Rather than a shallow combination, CALM introduces a small set of cross-attention parameters over models’ layer representations.

Use-case example, Multilinguality:

&gt; We reuse an LM trained on a bunch of low-resource languages (LRLs)
w/ an LLM that has never seen some of these LRLs.
		</description>		<dc:date>2024-01-06T12:07:15Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2024/01/omar_khattab_sur_x_a_cool_th">		<title>Omar Khattab sur X : &quot;...Let&apos;s use 30 lines of DSPy—without any hand-written prompts or any calls to OpenAI ($0)—to teach...&quot;</title>		<link>http://www.semanlink.net/doc/2024/01/omar_khattab_sur_x_a_cool_th</link>		<dc:date>2024-01-01T11:01:32Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2023/12/omar_khattab_sur_x_a%F0%9F%A7%B5on_bea">		<title>Omar Khattab sur X : &quot;A🧵on beating the hardware lottery for retrieval: the internals of the late interaction stack. ColBERT...&quot;</title>		<link>http://www.semanlink.net/doc/2023/12/omar_khattab_sur_x_a%F0%9F%A7%B5on_bea</link>		<dc:date>2023-12-29T11:40:34Z</dc:date>	</item></rdf:RDF>