<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet href="https://rss.buzzsprout.com/styles.xsl" type="text/xsl"?>
<rss version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:podcast="https://podcastindex.org/namespace/1.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:psc="http://podlove.org/simple-chapters" xmlns:atom="http://www.w3.org/2005/Atom">
<channel>
  <atom:link href="https://feeds.buzzsprout.com/1933861.rss" rel="self" type="application/rss+xml" />
  <atom:link href="https://pubsubhubbub.appspot.com/" rel="hub" xmlns="http://www.w3.org/2005/Atom" />
  <title>Argmax</title>

  <lastBuildDate>Wed, 04 Mar 2026 18:10:33 -0500</lastBuildDate>
  <link>https://www.argmax.fm</link>
  <language>en-us</language>
  <copyright>© 2026 Argmax</copyright>
  <podcast:locked>yes</podcast:locked>
    <podcast:guid>e7012088-1726-5d8c-bbf8-780ccc4ea547</podcast:guid>
  <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
  <itunes:type>episodic</itunes:type>
  <itunes:explicit>false</itunes:explicit>
  <description><![CDATA[<p>A show where three machine learning enthusiasts talk about recent papers and developments in machine learning. Watch our video on YouTube https://www.youtube.com/@argmaxfm</p>]]></description>
  <generator>Buzzsprout (https://www.buzzsprout.com)</generator>
  <itunes:owner>
    <itunes:name>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:name>
  </itunes:owner>
  <image>
     <url>https://storage.buzzsprout.com/76azp67hw9mw8w2g12c46225evxj?.jpg</url>
     <title>Argmax</title>
     <link></link>
  </image>
  <itunes:image href="https://storage.buzzsprout.com/76azp67hw9mw8w2g12c46225evxj?.jpg" />
  <itunes:category text="Science">
    <itunes:category text="Mathematics" />
  </itunes:category>
  <item>
    <itunes:title>Mixture of Experts</itunes:title>
    <title>Mixture of Experts</title>
    <itunes:summary><![CDATA[In this episode we talk about the paper "Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean. ]]></itunes:summary>
    <description><![CDATA[<p>In this episode we talk about the paper &quot;Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer&quot; by Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean.</p>]]></description>
    <content:encoded><![CDATA[<p>In this episode we talk about the paper &quot;Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer&quot; by Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/15891978-mixture-of-experts.mp3" length="39462291" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-15891978</guid>
    <pubDate>Tue, 08 Oct 2024 12:00:00 -0700</pubDate>
    <itunes:duration>3286</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>LoRA</itunes:title>
    <title>LoRA</title>
    <itunes:summary><![CDATA[We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y ]]></itunes:summary>
    <description><![CDATA[<p>We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y</p>]]></description>
    <content:encoded><![CDATA[<p>We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/13517489-lora.mp3" length="45347445" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-13517489</guid>
    <pubDate>Sat, 02 Sep 2023 14:00:00 -0700</pubDate>
    <itunes:duration>3776</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>2</itunes:season>
    <itunes:episode>1</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>15: InstructGPT</itunes:title>
    <title>15: InstructGPT</title>
    <itunes:summary><![CDATA[In this episode we discuss the paper "Training language models to follow instructions with human feedback" by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT. ]]></itunes:summary>
    <description><![CDATA[<p>In this episode we discuss the paper &quot;Training language models to follow instructions with human feedback&quot; by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.</p>]]></description>
    <content:encoded><![CDATA[<p>In this episode we discuss the paper &quot;Training language models to follow instructions with human feedback&quot; by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/12534281-15-instructgpt.mp3" length="41398000" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-12534281</guid>
    <pubDate>Tue, 28 Mar 2023 09:00:00 -0700</pubDate>
    <itunes:duration>3447</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>15</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>14: Whisper</itunes:title>
    <title>14: Whisper</title>
    <itunes:summary><![CDATA[This week we talk about Whisper. It is a weakly supervised speech recognition model.    ]]></itunes:summary>
    <description><![CDATA[<div>This week we talk about Whisper. It is a weakly supervised speech recognition model.<br/><br/><br/><br/></div>]]></description>
    <content:encoded><![CDATA[<div>This week we talk about Whisper. It is a weakly supervised speech recognition model.<br/><br/><br/><br/></div>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/12457660-14-whisper.mp3" length="35476555" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-12457660</guid>
    <pubDate>Fri, 17 Mar 2023 08:00:00 -0700</pubDate>
    <itunes:duration>2954</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>14</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>13: AlphaTensor</itunes:title>
    <title>13: AlphaTensor</title>
    <itunes:summary><![CDATA[We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication. ]]></itunes:summary>
    <description><![CDATA[<p>We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication.</p>]]></description>
    <content:encoded><![CDATA[<p>We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/12417434-13-alphatensor.mp3" length="35369356" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-12417434</guid>
    <pubDate>Fri, 10 Mar 2023 19:00:00 -0800</pubDate>
    <itunes:duration>2945</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>13</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>12: SIRENs</itunes:title>
    <title>12: SIRENs</title>
    <itunes:summary><![CDATA[In this episode we talked about "Implicit Neural Representations with Periodic Activation Functions" and the strength of periodic non-linearities. ]]></itunes:summary>
    <description><![CDATA[<p>In this episode we talked about &quot;Implicit Neural Representations with Periodic Activation Functions&quot; and the strength of periodic non-linearities.</p>]]></description>
    <content:encoded><![CDATA[<p>In this episode we talked about &quot;Implicit Neural Representations with Periodic Activation Functions&quot; and the strength of periodic non-linearities.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/11562095-12-sirens.mp3" length="39114052" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-11562095</guid>
    <pubDate>Mon, 24 Oct 2022 18:00:00 -0700</pubDate>
    <itunes:duration>3257</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>12</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>true</itunes:explicit>
  </item>
  <item>
    <itunes:title>11: CVPR Workshop on Autonomous Driving Keynote by Ashok Elluswamy, a Tesla engineer</itunes:title>
    <title>11: CVPR Workshop on Autonomous Driving Keynote by Ashok Elluswamy, a Tesla engineer</title>
    <itunes:summary><![CDATA[In this episode we discuss this video: https://youtu.be/jPCV4GKX9Dw  How Tesla approaches collision detection with novel methods. ]]></itunes:summary>
    <description><![CDATA[<p>In this episode we discuss this video: https://youtu.be/jPCV4GKX9Dw<br/><br/>How Tesla approaches collision detection with novel methods.</p>]]></description>
    <content:encoded><![CDATA[<p>In this episode we discuss this video: https://youtu.be/jPCV4GKX9Dw<br/><br/>How Tesla approaches collision detection with novel methods.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/11417019-11-cvpr-workshop-on-autonomous-driving-keynote-by-ashok-elluswamy-a-tesla-engineer.mp3" length="35196459" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-11417019</guid>
    <pubDate>Fri, 30 Sep 2022 16:00:00 -0700</pubDate>
    <itunes:duration>2931</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>11</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>10: Outracing champion Gran Turismo drivers with deep reinforcement learning</itunes:title>
    <title>10: Outracing champion Gran Turismo drivers with deep reinforcement learning</title>
    <itunes:summary><![CDATA[We discuss Sony AI's accomplishment of creating a novel AI agent that can beat professional racers in Gran Turismo. Some topics include: - The crafting of rewards to make the agent behave nicely - What is QR-SAC? - How to deal with "rare" experiences in the replay buffer  Link to paper: https://www.nature.com/articles/s41586-021-04357-7 ]]></itunes:summary>
    <description><![CDATA[<p>We discuss Sony AI&apos;s accomplishment of creating a novel AI agent that can beat professional racers in Gran Turismo. Some topics include:<br/>- The crafting of rewards to make the agent behave nicely<br/>- What is QR-SAC?<br/>- How to deal with &quot;rare&quot; experiences in the replay buffer<br/><br/>Link to paper: https://www.nature.com/articles/s41586-021-04357-7</p>]]></description>
    <content:encoded><![CDATA[<p>We discuss Sony AI&apos;s accomplishment of creating a novel AI agent that can beat professional racers in Gran Turismo. Some topics include:<br/>- The crafting of rewards to make the agent behave nicely<br/>- What is QR-SAC?<br/>- How to deal with &quot;rare&quot; experiences in the replay buffer<br/><br/>Link to paper: https://www.nature.com/articles/s41586-021-04357-7</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/11182813-10-outracing-champion-gran-turismo-drivers-with-deep-reinforcement-learning.mp3" length="39506334" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-11182813</guid>
    <pubDate>Mon, 22 Aug 2022 17:00:00 -0700</pubDate>
    <itunes:duration>3290</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>10</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>9: Heads-Up Limit Hold&#39;em Poker Is Solved</itunes:title>
    <title>9: Heads-Up Limit Hold&#39;em Poker Is Solved</title>
    <itunes:summary><![CDATA[Today we talk about recent AI advances in Poker; specifically the use of counterfactual regret minimization to solve the game of 2-player Limit Texas Hold'em. ]]></itunes:summary>
    <description><![CDATA[<p>Today we talk about recent AI advances in Poker; specifically the use of counterfactual regret minimization to solve the game of 2-player Limit Texas Hold&apos;em.</p>]]></description>
    <content:encoded><![CDATA[<p>Today we talk about recent AI advances in Poker; specifically the use of counterfactual regret minimization to solve the game of 2-player Limit Texas Hold&apos;em.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/11051132-9-heads-up-limit-hold-em-poker-is-solved.mp3" length="34524608" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-11051132</guid>
    <pubDate>Fri, 29 Jul 2022 16:00:00 -0700</pubDate>
    <itunes:duration>2875</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>8: GATO (A Generalist Agent)</itunes:title>
    <title>8: GATO (A Generalist Agent)</title>
    <itunes:summary><![CDATA[Today we talk about GATO, a multi-modal, multi-task, multi-embodiment generalist agent. ]]></itunes:summary>
    <description><![CDATA[<p>Today we talk about GATO, a multi-modal, multi-task, multi-embodiment generalist agent.</p>]]></description>
    <content:encoded><![CDATA[<p>Today we talk about GATO, a multi-modal, multi-task, multi-embodiment generalist agent.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/11051122-8-gato-a-generalist-agent.mp3" length="32323087" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-11051122</guid>
    <pubDate>Fri, 29 Jul 2022 16:00:00 -0700</pubDate>
    <itunes:duration>2691</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>8</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>7: Deep Unsupervised Learning Using Nonequilibrium Thermodynamics (Diffusion Models)</itunes:title>
    <title>7: Deep Unsupervised Learning Using Nonequilibrium Thermodynamics (Diffusion Models)</title>
    <itunes:summary><![CDATA[We start talking about diffusion models as a technique for generative deep learning. ]]></itunes:summary>
    <description><![CDATA[<p>We start talking about diffusion models as a technique for generative deep learning.</p>]]></description>
    <content:encoded><![CDATA[<p>We start talking about diffusion models as a technique for generative deep learning.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/10788104-7-deep-unsupervised-learning-using-nonequilibrium-thermodynamics-diffusion-models.mp3" length="22285596" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-10788104</guid>
    <pubDate>Mon, 13 Jun 2022 18:00:00 -0700</pubDate>
    <itunes:duration>1855</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>7</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>true</itunes:explicit>
  </item>
  <item>
    <itunes:title>6: Deep Reinforcement Learning at the Edge of the Statistical Precipice</itunes:title>
    <title>6: Deep Reinforcement Learning at the Edge of the Statistical Precipice</title>
    <itunes:summary><![CDATA[We discuss NeurIPS outstanding paper award winning paper, talking about important topics surrounding metrics and reproducibility. ]]></itunes:summary>
    <description><![CDATA[<p>We discuss NeurIPS outstanding paper award winning paper, talking about important topics surrounding metrics and reproducibility.</p>]]></description>
    <content:encoded><![CDATA[<p>We discuss NeurIPS outstanding paper award winning paper, talking about important topics surrounding metrics and reproducibility.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/10748486-6-deep-reinforcement-learning-at-the-edge-of-the-statistical-precipice.mp3" length="44042853" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-10748486</guid>
    <pubDate>Mon, 06 Jun 2022 16:00:00 -0700</pubDate>
    <itunes:duration>3668</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>6</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>5: QMIX</itunes:title>
    <title>5: QMIX</title>
    <itunes:summary><![CDATA[We talk about QMIX https://arxiv.org/abs/1803.11485 as an example of Deep Multi-agent RL. ]]></itunes:summary>
    <description><![CDATA[<p>We talk about QMIX https://arxiv.org/abs/1803.11485 as an example of Deep Multi-agent RL.</p>]]></description>
    <content:encoded><![CDATA[<p>We talk about QMIX https://arxiv.org/abs/1803.11485 as an example of Deep Multi-agent RL.</p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/10504620-5-qmix.mp3" length="30336276" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-10504620</guid>
    <pubDate>Mon, 25 Apr 2022 20:00:00 -0700</pubDate>
    <itunes:duration>2526</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>5</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>4: Can Neural Nets Learn the Same Model Twice?</itunes:title>
    <title>4: Can Neural Nets Learn the Same Model Twice?</title>
    <itunes:summary><![CDATA[Todays paper: Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective (https://arxiv.org/pdf/2203.08124.pdf)  Summary: A discussion of reproducibility and double descent through visualizations of decision boundaries.  Highlights of the discussion: Relationship between model performance and reproducibilityWhich models are robust and reproducibleHow they calculate the various scores   ]]></itunes:summary>
    <description><![CDATA[<p>Todays paper: Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility<br/>and Double Descent from the Decision Boundary Perspective (https://arxiv.org/pdf/2203.08124.pdf)<br/><br/>Summary:<br/>A discussion of reproducibility and double descent through visualizations of decision boundaries.<br/><br/>Highlights of the discussion:</p><ul><li>Relationship between model performance and reproducibility</li><li>Which models are robust and reproducible</li><li>How they calculate the various scores</li></ul><p><br/><br/></p>]]></description>
    <content:encoded><![CDATA[<p>Todays paper: Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility<br/>and Double Descent from the Decision Boundary Perspective (https://arxiv.org/pdf/2203.08124.pdf)<br/><br/>Summary:<br/>A discussion of reproducibility and double descent through visualizations of decision boundaries.<br/><br/>Highlights of the discussion:</p><ul><li>Relationship between model performance and reproducibility</li><li>Which models are robust and reproducible</li><li>How they calculate the various scores</li></ul><p><br/><br/></p>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/10386978-4-can-neural-nets-learn-the-same-model-twice.mp3" length="39904066" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-10386978</guid>
    <pubDate>Tue, 05 Apr 2022 19:00:00 -0700</pubDate>
    <itunes:duration>3323</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>4</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>3: VICReg</itunes:title>
    <title>3: VICReg</title>
    <itunes:summary><![CDATA[Todays paper: VICReg (https://arxiv.org/abs/2105.04906)  Summary of the paper VICReg prevents representation collapse using a mixture of variance, invariance and covariance when calculating the loss. It does not require negative samples and achieves great performance on downstream tasks.  Highlights of discussion The VICReg architecture (Figure 1)Sensitivity to hyperparameters (Table 7)Top 5 metric usefulness]]></itunes:summary>
    <description><![CDATA[<p>Todays paper: VICReg (<a href='https://arxiv.org/abs/2105.04906'>https://arxiv.org/abs/2105.04906</a>)<br/><br/><b>Summary of the paper</b><br/>VICReg prevents representation collapse using a mixture of variance, invariance and covariance when calculating the loss. It does not require negative samples and achieves great performance on downstream tasks.<br/><br/><b>Highlights of discussion</b></p><ul><li>The VICReg architecture (Figure 1)</li><li>Sensitivity to hyperparameters (Table 7)</li><li>Top 5 metric usefulness</li></ul>]]></description>
    <content:encoded><![CDATA[<p>Todays paper: VICReg (<a href='https://arxiv.org/abs/2105.04906'>https://arxiv.org/abs/2105.04906</a>)<br/><br/><b>Summary of the paper</b><br/>VICReg prevents representation collapse using a mixture of variance, invariance and covariance when calculating the loss. It does not require negative samples and achieves great performance on downstream tasks.<br/><br/><b>Highlights of discussion</b></p><ul><li>The VICReg architecture (Figure 1)</li><li>Sensitivity to hyperparameters (Table 7)</li><li>Top 5 metric usefulness</li></ul>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/10292930-3-vicreg.mp3" length="32255966" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-10292930</guid>
    <pubDate>Mon, 21 Mar 2022 16:00:00 -0700</pubDate>
    <itunes:duration>2686</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>3</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>2: data2vec</itunes:title>
    <title>2: data2vec</title>
    <itunes:summary><![CDATA[Todays paper: data2vec (https://arxiv.org/abs/2202.03555)  Summary of the paper A multimodal SSL algorithm that predicts latent representation of different types of input. Highlights of discussion What are the motivations of SSL and multimodalHow does the student teacher learning work?What are similarities and differences between ViT, BYOL, and Reinforcement Learning algorithms.]]></itunes:summary>
    <description><![CDATA[<p>Todays paper: data2vec (https://arxiv.org/abs/2202.03555)<br/><br/><b>Summary of the paper</b><br/>A multimodal SSL algorithm that predicts latent representation of different types of input.</p><p><b>Highlights of discussion</b></p><ul><li>What are the motivations of SSL and multimodal</li><li>How does the student teacher learning work?</li><li>What are similarities and differences between ViT, BYOL, and Reinforcement Learning algorithms.</li></ul>]]></description>
    <content:encoded><![CDATA[<p>Todays paper: data2vec (https://arxiv.org/abs/2202.03555)<br/><br/><b>Summary of the paper</b><br/>A multimodal SSL algorithm that predicts latent representation of different types of input.</p><p><b>Highlights of discussion</b></p><ul><li>What are the motivations of SSL and multimodal</li><li>How does the student teacher learning work?</li><li>What are similarities and differences between ViT, BYOL, and Reinforcement Learning algorithms.</li></ul>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/10205015-2-data2vec.mp3" length="38465799" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-10205015</guid>
    <pubDate>Mon, 07 Mar 2022 11:00:00 -0800</pubDate>
    <itunes:duration>3203</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>2</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
  <item>
    <itunes:title>1: Reward is Enough</itunes:title>
    <title>1: Reward is Enough</title>
    <itunes:summary><![CDATA[This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.  Todays paper: Reward is Enough   Summary of the paper The authors present the Reward is Enough hypothesis: Intelligence, and its associated abilities, can be understood as subserving the maximisation of reward by an agent acting in its environment. Highlights of discussion High level overview of Reinforcement LearningHow evolution can be encoded as a reward m...]]></itunes:summary>
    <description><![CDATA[<p>This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.<br/><br/>Todays paper: <a href='https://www.sciencedirect.com/science/article/pii/S0004370221000862'>Reward is Enough</a> <br/><br/><b>Summary of the paper</b><br/>The authors present the Reward is Enough hypothesis: Intelligence, and its associated abilities, can be understood as subserving the maximisation of reward by an agent acting in its environment.</p><p><b>Highlights of discussion</b></p><ul><li>High level overview of Reinforcement Learning</li><li>How evolution can be encoded as a reward maximization problem</li><li>What is the one reward signal we are trying to optimize?</li></ul>]]></description>
    <content:encoded><![CDATA[<p>This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.<br/><br/>Todays paper: <a href='https://www.sciencedirect.com/science/article/pii/S0004370221000862'>Reward is Enough</a> <br/><br/><b>Summary of the paper</b><br/>The authors present the Reward is Enough hypothesis: Intelligence, and its associated abilities, can be understood as subserving the maximisation of reward by an agent acting in its environment.</p><p><b>Highlights of discussion</b></p><ul><li>High level overview of Reinforcement Learning</li><li>How evolution can be encoded as a reward maximization problem</li><li>What is the one reward signal we are trying to optimize?</li></ul>]]></content:encoded>
    <enclosure url="https://www.buzzsprout.com/1933861/episodes/10116664-1-reward-is-enough.mp3" length="39337296" type="audio/mpeg" />
    <itunes:author>Vahe Hagopian, Taka Hasegawa, Farrukh Rahman</itunes:author>
    <guid isPermaLink="false">Buzzsprout-10116664</guid>
    <pubDate>Mon, 21 Feb 2022 11:00:00 -0800</pubDate>
    <itunes:duration>3276</itunes:duration>
    <itunes:keywords></itunes:keywords>
    <itunes:season>1</itunes:season>
    <itunes:episode>1</itunes:episode>
    <itunes:episodeType>full</itunes:episodeType>
    <itunes:explicit>false</itunes:explicit>
  </item>
</channel>
</rss>
