<?xml version='1.0' encoding='UTF-8'  ?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:dc="http://purl.org/dc/elements/1.1/">	<channel rdf:about="http://www.semanlink.net/tag/reasoning_models_math_evals">		<title>Reasoning models: math evals</title>		<link>http://www.semanlink.net/tag/reasoning_models_math_evals</link>		<description>Documents tagged with Reasoning models: math evals</description>		<items>			<rdf:Seq>							<rdf:li resource="http://www.semanlink.net/doc/2025/03/2501_19393_s1_simple_test_ti"/>				<rdf:li resource="http://www.semanlink.net/doc/2025/02/diffuse_one"/>			</rdf:Seq>		</items>	</channel>		<item rdf:about="http://www.semanlink.net/doc/2025/03/2501_19393_s1_simple_test_ti">		<title>[2501.19393&#93; s1: Simple test-time scaling</title>		<link>http://www.semanlink.net/doc/2025/03/2501_19393_s1_simple_test_ti</link>		<description>&quot;Researchers created an open rival to OpenAI’s o1 ‘reasoning’ model for under $50&quot; [techcrunch.com&#93;(https://techcrunch.com/2025/02/05/researchers-created-an-open-rival-to-openais-o1-reasoning-model-for-under-50/)		</description>		<dc:date>2025-03-03T09:04:57Z</dc:date>	</item>	<item rdf:about="http://www.semanlink.net/doc/2025/02/diffuse_one">		<title>diffuse.one/reasoning_update_0</title>		<link>http://www.semanlink.net/doc/2025/02/diffuse_one</link>		<description>&gt; There is an emerging pattern of fine-tuning a small language model followed by reinforcement learning.

&gt; A reasoning model is a large language model that is trained to output both a chain of thought and a response. The chain of thought should be relatively long (
&gt; 1,000 tokens) and the reasoning should improve its performance relative to a similar-sized non-reasoning models. This is sometimes called &quot;test-time&quot; or &quot;inference-time&quot; scaling because reasoning models emit more tokens per completion and gain some performance as a result.		</description>		<dc:date>2025-02-24T13:21:09Z</dc:date>	</item></rdf:RDF>