<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Machine Learning on Rahul Bhati</title>
    <link>https://therahulbhati.github.io/tags/machine-learning/</link>
    <description>Recent content in Machine Learning on Rahul Bhati</description>
    <generator>Hugo -- 0.151.0</generator>
    <language>en-us</language>
    <lastBuildDate>Tue, 30 Jun 2026 10:00:00 +0530</lastBuildDate>
    <atom:link href="https://therahulbhati.github.io/tags/machine-learning/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Four Neural Network Ideas, Tested</title>
      <link>https://therahulbhati.github.io/posts/four-nn-ideas-tested/</link>
      <pubDate>Tue, 30 Jun 2026 10:00:00 +0530</pubDate>
      <guid>https://therahulbhati.github.io/posts/four-nn-ideas-tested/</guid>
      <description>&lt;p&gt;Most neural-network explanations start with math. That&amp;rsquo;s honest. But the ideas stick when you&amp;rsquo;ve actually broken something. Each section below is a live demo: click Run, watch it train, change a control, run it again.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;a href=&#34;#1-activations-exist-for-a-reason&#34;&gt;Activations exist for a reason&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&#34;#2-depth-without-nonlinearity-is-a-lie&#34;&gt;Depth without nonlinearity is a lie&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&#34;#3-embeddings-learn-similarity-from-next-token-alone&#34;&gt;Embeddings learn similarity from next-token alone&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href=&#34;#4-memorization-vs-generalization&#34;&gt;Memorization vs generalization&lt;/a&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;script&gt;
  window.addEventListener(&#39;message&#39;, function(e) {
    if (!e.data || e.data.type !== &#39;resize&#39;) return;
    var frames = document.querySelectorAll(&#39;iframe[data-autoresize]&#39;);
    for (var i = 0; i &lt; frames.length; i++) {
      if (frames[i].contentWindow === e.source) {
        frames[i].style.height = (e.data.h) + &#39;px&#39;;
        break;
      }
    }
  });
&lt;/script&gt;

&lt;hr&gt;
&lt;h2 id=&#34;1-activations-exist-for-a-reason&#34;&gt;1. Activations exist for a reason&lt;/h2&gt;
&lt;p&gt;A linear model and a ReLU model, side by side, trying to separate a red ring from a blue one. The linear model can&amp;rsquo;t draw a curved boundary no matter how long it trains. Without a nonlinear activation, a stack of layers collapses to one matrix multiply. The curved shape you need is impossible to express.&lt;/p&gt;</description>
    </item>
  </channel>
</rss>
