<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>LLM Timeline — Model Releases</title>
    <link>https://llm-timeline.duyet.net</link>
    <description>Chronological index of Large Language Model releases from 2017 to present. Updated weekly.</description>
    <language>en-us</language>
    <lastBuildDate>Wed, 25 Mar 2026 00:00:00 GMT</lastBuildDate>
    <atom:link href="https://llm-timeline.duyet.net/rss.xml" rel="self" type="application/rss+xml"/>
    <image>
      <url>https://llm-timeline.duyet.net/favicon.svg</url>
      <title>LLM Timeline — Model Releases</title>
      <link>https://llm-timeline.duyet.net</link>
    </image>
  <item>
    <title>Nemotron-Cascade-2-30B-A3B (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Nemotron-Cascade-2-30B-A3B-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: Nemotron-Cascade-2-30B-A3B
License: open
Type: model
30BA3B. Gold medal performance in both the 2025 IMO and the IOI. HLE=no tools.</description>
  </item>
  <item>
    <title>MiMo-V2-Pro (Xiaomi)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MiMo-V2-Pro-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>Xiaomi</category>
    <category>open</category>
    <description>Parameters: MiMo-V2-Pro
License: open
Type: model
1T42B. Over 1T total parameters (42B active). Uses a 7:1 Hybrid Attention mechanism and supports a 1M-token context window.</description>
  </item>
  <item>
    <title>Mamba-3 (CMU)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Mamba-3-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>CMU</category>
    <category>open</category>
    <description>Parameters: Mamba-3
License: open
Type: model
&quot;with architectural refinements, our Mamba-3 model achieves significant gains across retrieval, state-tracking, and downstream language modeling tasks.&quot;</description>
  </item>
  <item>
    <title>MiniMax-M2.5 (MiniMax)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MiniMax-M2.5-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>MiniMax</category>
    <category>open</category>
    <description>Parameters: MiniMax-M2.5
License: open
Type: model
230B-A10B. Early RSI. &quot;M2.7 is our first model deeply participating in its own evolution…&quot; https://lifearchitect.ai/asi/</description>
  </item>
  <item>
    <title>Holotron-12B (H Company)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Holotron-12B-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>H Company</category>
    <category>open</category>
    <description>Parameters: Holotron-12B
License: open
Type: model
&quot;Holotron-12B is a high-throughput, multimodal Vision-Language Model (VLM) designed specifically as a policy model for computer-use agents.&quot;</description>
  </item>
  <item>
    <title>Mistral Small 4 (Mistral)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Mistral%20Small%204-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>Mistral</category>
    <category>open</category>
    <description>Parameters: Mistral Small 4
License: open
Type: model
119BA6.5B. &quot;unifies the capabilities of three different model families—Instruct, Reasoning (previously called Magistral), and Devstral—into a single, unified model.&quot;</description>
  </item>
  <item>
    <title>MiroThinker-H1 (MiroMindAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MiroThinker-H1-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>MiroMindAI</category>
    <category>open</category>
    <description>Parameters: MiroThinker-H1
License: open
Type: model
&quot;Our proprietary agent, MiroThinker-H1 provides promising evidence for long-chain verifiable reasoning [based on new model, MiroThinker-1.7&quot;</description>
  </item>
  <item>
    <title>Covenant-72B (1Covenant)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Covenant-72B-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>1Covenant</category>
    <category>open</category>
    <description>Parameters: Covenant-72B
License: open
Type: model
&quot;largest permissionless collaboratively trained language model&quot; ~20 distinct peers, each running 8xB200 GPUs.</description>
  </item>
  <item>
    <title>Nemotron 3 Super (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Nemotron%203%20Super-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: Nemotron 3 Super
License: open
Type: model
120B-A12B. Announce: https://blogs.nvidia.com/blog/nemotron-3-super-agentic-ai/</description>
  </item>
  <item>
    <title>Sarvam 105B (Sarvam.ai)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Sarvam%20105B-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>Sarvam.ai</category>
    <category>open</category>
    <description>Parameters: Sarvam 105B
License: open
Type: model
105BA10.3B. &quot;22 Indian languages&quot;</description>
  </item>
  <item>
    <title>GPT-5.4 (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5.4-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>open</category>
    <description>Parameters: GPT-5.4
License: open
Type: model
&quot;most capable and efficient frontier model for professional work.&quot; Announce: https://openai.com/index/introducing-gpt-5-4/</description>
  </item>
  <item>
    <title>Yuan3.0-Ultra (YuanLabAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Yuan3.0-Ultra-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>YuanLabAI</category>
    <category>open</category>
    <description>Parameters: Yuan3.0-Ultra
License: open
Type: model
1515BA68.8B. Poor performance due to low training data/ratio.</description>
  </item>
  <item>
    <title>GPT-5.3 Instant (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5.3%20Instant-2026-03-01</guid>
    <pubDate>Sun, 01 Mar 2026 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>open</category>
    <description>Parameters: GPT-5.3 Instant
License: open
Type: model
Announce: https://openai.com/index/gpt-5-3-instant/</description>
  </item>
  <item>
    <title>STATIC (Google)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#STATIC-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Google</category>
    <category>open</category>
    <description>Parameters: STATIC
License: open
Type: model
YouTube (Google). STATIC (Sparse Transition Matrix-Accelerated Trie Index for Constrained Decoding). &quot;The model is a Gemini-based generative retrieval model similar to PLUM [8], served with a batch size of 2 (per chip) and a beam size of 𝑀 = 70. The model is based on a non-Mixture-of-Experts (MoE) architecture with 3 billion dense parameters. All benchmark experiments are conducted on Google TPU v6e accelerators.&quot;</description>
  </item>
  <item>
    <title>Arrow 1.0 (Quiver)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Arrow%201.0-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Quiver</category>
    <category>open</category>
    <description>Parameters: Arrow 1.0
License: open
Type: model
&quot;A first of it&apos;s kind SVG AI model.&quot; Announce: https://x.com/QuiverAI/status/2026792057893708072</description>
  </item>
  <item>
    <title>Qwen3.5-27B (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Qwen3.5-27B-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>open</category>
    <description>Parameters: Qwen3.5-27B
License: open
Type: model
&quot;Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility&quot;</description>
  </item>
  <item>
    <title>LFM2-24B-A2B (Liquid AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#LFM2-24B-A2B-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Liquid AI</category>
    <category>open</category>
    <description>Parameters: LFM2-24B-A2B
License: open
Type: model
&quot;a traditional instruct model without reasoning traces.&quot;</description>
  </item>
  <item>
    <title>Mercury 2 (Inception)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Mercury%202-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Inception</category>
    <category>open</category>
    <description>Parameters: Mercury 2
License: open
Type: model
Diffusion large language model (dLLM).</description>
  </item>
  <item>
    <title>Gemini 3.1 Pro (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Gemini%203.1%20Pro-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>open</category>
    <description>Parameters: Gemini 3.1 Pro
License: open
Type: model
Knowledge cutoff still=January 2025. Announce: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/</description>
  </item>
  <item>
    <title>ZUNA (Zyphra)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#ZUNA-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Zyphra</category>
    <category>open</category>
    <description>Parameters: ZUNA
License: open
Type: model
For BCI, &apos;thought-to-text&apos;. Training dataset calcs: (2M hours * 3,600 seconds/hour * 256 samples/second ) / 32 samples/token = 57.6B tokens (refined to 45.1B after rigorous filtering ); 150,000 steps * 2.16M tokens/batch = 324B total tokens seen during training. Announce: https://www.zyphra.com/post/zuna</description>
  </item>
  <item>
    <title>Grok 4.2 (xAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Grok%204.2-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>xAI</category>
    <category>open</category>
    <description>Parameters: Grok 4.2
License: open
Type: model
No details provided. Announce: https://x.com/elonmusk/status/2023829664318583105</description>
  </item>
  <item>
    <title>INTELLECT-3.1 (Prime Intellect)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#INTELLECT-3.1-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Prime Intellect</category>
    <category>open</category>
    <description>Parameters: INTELLECT-3.1
License: open
Type: model
Base: GLM-4.5-Air-Base, INTELLECT-3 model. 106BA12B.</description>
  </item>
  <item>
    <title>Claude Sonnet 4.6 (Anthropic)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Claude%20Sonnet%204.6-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Anthropic</category>
    <category>open</category>
    <description>Parameters: Claude Sonnet 4.6
License: open
Type: model
1M context. Announce: https://www.anthropic.com/news/claude-sonnet-4-6 Showing GMMLU (Global MMLU by Cohere).</description>
  </item>
  <item>
    <title>Tiny Aya (Cohere)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Tiny%20Aya-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Cohere</category>
    <category>open</category>
    <description>Parameters: Tiny Aya
License: open
Type: model
70+ languages. Showing GMMLU (Global MMLU by Cohere).</description>
  </item>
  <item>
    <title>Qwen3.5-397B-A17B (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Qwen3.5-397B-A17B-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>open</category>
    <description>Parameters: Qwen3.5-397B-A17B
License: open
Type: model
&quot;Qwen3.5 represents a significant leap forward, integrating breakthroughs in multimodal learning, architectural efficiency, reinforcement learning scale, and global accessibility&quot;</description>
  </item>
  <item>
    <title>JoyAI-LLM Flash (JD Open Source)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#JoyAI-LLM%20Flash-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>JD Open Source</category>
    <category>open</category>
    <description>Parameters: JoyAI-LLM Flash
License: open
Type: model
48B-A3B.</description>
  </item>
  <item>
    <title>MiniMax-M2.5 (MiniMax)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MiniMax-M2.5-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>MiniMax</category>
    <category>open</category>
    <description>Parameters: MiniMax-M2.5
License: open
Type: model
230B-A10B. HLE showing without tools.</description>
  </item>
  <item>
    <title>GLM-5 (Z.AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GLM-5-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Z.AI</category>
    <category>open</category>
    <description>Parameters: GLM-5
License: open
Type: model
744B-A40B. Announce: https://z.ai/blog/glm-5</description>
  </item>
  <item>
    <title>Nanbeige4.1-3B (Nanbeige)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Nanbeige4.1-3B-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Nanbeige</category>
    <category>open</category>
    <description>Parameters: Nanbeige4.1-3B
License: open
Type: model
SOTA for size (3B)</description>
  </item>
  <item>
    <title>RynnBrain-30B-A3B (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#RynnBrain-30B-A3B-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>open</category>
    <description>Parameters: RynnBrain-30B-A3B
License: open
Type: model
Base: Qwen3-VL-30B-A3B-Instruct. &quot;an embodied foundation model grounded in physical reality.&quot;</description>
  </item>
  <item>
    <title>Claude Opus 4.6 (Anthropic)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Claude%20Opus%204.6-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Anthropic</category>
    <category>open</category>
    <description>Parameters: Claude Opus 4.6
License: open
Type: model</description>
  </item>
  <item>
    <title>Intern-S1-Pro (Shanghai AI Laboratory/SenseTime)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Intern-S1-Pro-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>Shanghai AI Laboratory/SenseTime</category>
    <category>open</category>
    <description>Parameters: Intern-S1-Pro
License: open
Type: model
1000TA22B. Assumes base model of Qwen3. &quot;Built upon a 235B MoE language model and a 6B Vision encoder, Intern-S1 has been further pretrained on 5 trillion tokens of multimodal data&quot;</description>
  </item>
  <item>
    <title>Step 3.5 Flash (StepFun)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Step%203.5%20Flash-2026-02-01</guid>
    <pubDate>Sun, 01 Feb 2026 00:00:00 GMT</pubDate>
    <category>StepFun</category>
    <category>open</category>
    <description>Parameters: Step 3.5 Flash
License: open
Type: model
196B-A11B.</description>
  </item>
  <item>
    <title>Assistant_Pepe_8B (Independent)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Assistant_Pepe_8B-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Independent</category>
    <category>open</category>
    <description>Parameters: Assistant_Pepe_8B
License: open
Type: model
Warning for inappropriate content. Base: Llama-3.1-Nemotron-8B. &quot;trained it on an extended 4chan dataset&quot; &quot;the original, gpt4chan (by Yannic Kilcher) scored especially high in truthfulness (that was b4 benchmaxxing)... outperformed the base tune (the unabliterated one), it also changed its political alignment... People were initially joking about the &quot;alignment tax&quot;, I think there&apos;s a none trivial substance in all of this. It seems to me just above a marginal error or statistical noise.&quot;</description>
  </item>
  <item>
    <title>Trinity-Large (Arcee AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Trinity-Large-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Arcee AI</category>
    <category>open</category>
    <description>Parameters: Trinity-Large
License: open
Type: model
400BA13B. &quot;we worked closely with Prime Intellect. They not only served the H100 clusters Datology used to generate synthetic data, they have been deeply involved in helping scale our training setup to the GPU footprint required for a fully frontier sized model, including the current 2048 B300 GPU configuration for Trinity Large.&quot;</description>
  </item>
  <item>
    <title>SERA (Allen AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#SERA-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Allen AI</category>
    <category>open</category>
    <description>Parameters: SERA
License: open
Type: model
Base: Qwen3-32B. SERA=Soft-verified Efficient Repository Agents. &quot;SERA was built largely by a single Ai2 researcher.&quot; https://allenai.org/blog/open-coding-agents &quot;SERA-32B was trained using Soft Verified Generation (SVG), a simple and efficient method that is 26x cheaper than reinforcement learning and 57x cheaper than previous synthetic data methods to reach equivalent performance. The total cost for data generation and training is approximately $2,000 (40 GPU-days).&quot;</description>
  </item>
  <item>
    <title>Kimi K2.5 (Moonshot AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Kimi%20K2.5-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Moonshot AI</category>
    <category>open</category>
    <description>Parameters: Kimi K2.5
License: open
Type: model
1TA32B. 1T parameters and 384 experts. Open source SOTA. &quot;Kimi K2.5 builds on Kimi K2 [15.5T tokens] with continued pretraining over approximately 15T mixed visual and text tokens. [+ 15T=30.5T]&quot;</description>
  </item>
  <item>
    <title>GLM-4.7-Flash (Z.AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GLM-4.7-Flash-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Z.AI</category>
    <category>open</category>
    <description>Parameters: GLM-4.7-Flash
License: open
Type: model
30B-A3B.</description>
  </item>
  <item>
    <title>MedGemma 1.5 4B (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MedGemma%201.5%204B-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>open</category>
    <description>Parameters: MedGemma 1.5 4B
License: open
Type: model
Lower MMLU score compared to previous MedGemma 1 27B (67.2 v 87). Announce: https://research.google/blog/next-generation-medical-image-interpretation-with-medgemma-15-and-medical-speech-to-text-with-medasr/</description>
  </item>
  <item>
    <title>FrogBoss (Microsoft)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#FrogBoss-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Microsoft</category>
    <category>open</category>
    <description>Parameters: FrogBoss
License: open
Type: model
Base: Qwen3-32B.</description>
  </item>
  <item>
    <title>EDEN (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#EDEN-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>closed</category>
    <description>Parameters: EDEN
License: closed
Type: model
&quot;EDEN (environmentally-derived evolutionary network) family of metagenomic foundation models, including a 28 billion parameter model trained on 9.7 trillion nucleotide tokens from BaseData1 . This dataset, at the time of training, contained more than 10 billion novel genes from over 1 million new species, and is intentionally enriched for environmental and host-associated metagenomes, phage sequences, and mobile genetic elements, enabling the model to learn from diverse and novel cross-species evolutionary mechanisms and apply them to key challenges in human health.&quot;</description>
  </item>
  <item>
    <title>Baichuan-M3 (Baichuan)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Baichuan-M3-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Baichuan</category>
    <category>open</category>
    <description>Parameters: Baichuan-M3
License: open
Type: model
&quot;new-generation medical-enhanced large language model&quot;</description>
  </item>
  <item>
    <title>Engram (DeepSeek-AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Engram-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>DeepSeek-AI</category>
    <category>partial</category>
    <description>Parameters: Engram
License: partial
Type: model
39.5BA3.8B. &quot;we explore conditional memory as a complementary sparsity axis, instantiated via Engram, a module that modernizes classic N -gram embeddings for O ( 1 ) lookup.&quot;</description>
  </item>
  <item>
    <title>SleepFM (Stanford)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#SleepFM-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Stanford</category>
    <category>open</category>
    <description>Parameters: SleepFM
License: open
Type: model
Uses a leave-one-out contrastive learning approach to align brain activity (EEG), heart activity (ECG), and respiratory signals. 130+ disease categories and 19–20+ clinical PSG channels. Dataset ~12.63B (Calculated based on 585,000 hours of data across 3 modality groups using 5-second window tokens) x 10 epochs.</description>
  </item>
  <item>
    <title>TimeCapsuleLLM-v2-1800-1875 (Independent)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#TimeCapsuleLLM-v2-1800-1875-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Independent</category>
    <category>open</category>
    <description>Parameters: TimeCapsuleLLM-v2-1800-1875
License: open
Type: model
112GB dataset=30B tokens x 0.5 epochs = 15B tokens.</description>
  </item>
  <item>
    <title>Jamba2 (AI21)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Jamba2-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>AI21</category>
    <category>open</category>
    <description>Parameters: Jamba2
License: open
Type: model
52B-A12B. Pre-training tokens from Jamba=1.2T + 500B mid.</description>
  </item>
  <item>
    <title>LFM2.5 (Liquid AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#LFM2.5-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>Liquid AI</category>
    <category>open</category>
    <description>Parameters: LFM2.5
License: open
Type: model
For on-device agentic applications. &quot;Extended pre-training from 10T to 28T tokens and large-scale multi-stage reinforcement learning.&quot;</description>
  </item>
  <item>
    <title>MiroThinker v1.5 (MiroMindAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MiroThinker%20v1.5-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>MiroMindAI</category>
    <category>open</category>
    <description>Parameters: MiroThinker v1.5
License: open
Type: model
Base: Qwen3 235B-A22B. Official demo: https://dr.miromind.ai</description>
  </item>
  <item>
    <title>Falcon-H1R (TII)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Falcon-H1R-2026-01-01</guid>
    <pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate>
    <category>TII</category>
    <category>open</category>
    <description>Parameters: Falcon-H1R
License: open
Type: model
Base model: Falcon-H1 (May/2025). Announce: https://huggingface.co/blog/tiiuae/falcon-h1r-7b</description>
  </item>
  <item>
    <title>Solar Open 100B (Upstage)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Solar%20Open%20100B-2025-12-31</guid>
    <pubDate>Wed, 31 Dec 2025 00:00:00 GMT</pubDate>
    <category>Upstage</category>
    <category>closed</category>
    <description>Parameters: 102B
License: closed
Type: model
AI model by Upstage</description>
  </item>
  <item>
    <title>K-EXAONE (LG AI Research)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#K-EXAONE-2025-12-31</guid>
    <pubDate>Wed, 31 Dec 2025 00:00:00 GMT</pubDate>
    <category>LG AI Research</category>
    <category>closed</category>
    <description>Parameters: 236B
License: closed
Type: model
AI model by LG AI Research</description>
  </item>
  <item>
    <title>VAETKI (NC AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#VAETKI-2025-12-30</guid>
    <pubDate>Tue, 30 Dec 2025 00:00:00 GMT</pubDate>
    <category>NC AI</category>
    <category>open</category>
    <description>Parameters: 100B
License: open
Type: model
AI model by NC AI</description>
  </item>
  <item>
    <title>A.X K1 (SK Telecom)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#A.X%20K1-2025-12-30</guid>
    <pubDate>Tue, 30 Dec 2025 00:00:00 GMT</pubDate>
    <category>SK Telecom</category>
    <category>closed</category>
    <description>Parameters: 519B
License: closed
Type: model
AI model by SK Telecom</description>
  </item>
  <item>
    <title>HyperCLOVA X SEED 32B Think (NAVER)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#HyperCLOVA%20X%20SEED%2032B%20Think-2025-12-29</guid>
    <pubDate>Mon, 29 Dec 2025 00:00:00 GMT</pubDate>
    <category>NAVER</category>
    <category>closed</category>
    <description>Parameters: 32B
License: closed
Type: model
AI model by NAVER</description>
  </item>
  <item>
    <title>MiniMax-M2.1 (MiniMax)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MiniMax-M2.1-2025-12-23</guid>
    <pubDate>Tue, 23 Dec 2025 00:00:00 GMT</pubDate>
    <category>MiniMax</category>
    <category>open</category>
    <description>Parameters: 229B
License: open
Type: model
AI model by MiniMax</description>
  </item>
  <item>
    <title>GLM-4.7 (Z.ai (Zhipu AI))</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GLM-4.7-2025-12-22</guid>
    <pubDate>Mon, 22 Dec 2025 00:00:00 GMT</pubDate>
    <category>Z.ai (Zhipu AI)</category>
    <category>open</category>
    <description>Parameters: 358B
License: open
Type: model
AI model by Z.ai (Zhipu AI)</description>
  </item>
  <item>
    <title>GPT-5.2 Codex (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5.2%20Codex-2025-12-18</guid>
    <pubDate>Thu, 18 Dec 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by OpenAI</description>
  </item>
  <item>
    <title>Nomos 1 (Nous Research)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Nomos%201-2025-12-11</guid>
    <pubDate>Thu, 11 Dec 2025 00:00:00 GMT</pubDate>
    <category>Nous Research</category>
    <category>open</category>
    <description>Parameters: 30B
License: open
Type: model
AI model by Nous Research</description>
  </item>
  <item>
    <title>Nova 2 (Amazon Web Services (AWS))</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Nova%202-2025-12-02</guid>
    <pubDate>Tue, 02 Dec 2025 00:00:00 GMT</pubDate>
    <category>Amazon Web Services (AWS)</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Amazon Web Services (AWS)</description>
  </item>
  <item>
    <title>mHC 27B (DeepSeek-AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#mHC%2027B-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>DeepSeek-AI</category>
    <category>closed</category>
    <description>Parameters: mHC 27B
License: closed
Type: model
27BA4.14B. Scaling tested with 3B MoE on 1T tokens=334:1. &quot;Manifold-Constrained Hyper-Connections (mHC), a general framework that projects the residual connection space of HC onto a specific manifold to restore the identity mapping property, while incorporating rigorous infrastructure optimization to ensure efficiency. Empirical experiments demonstrate that mHC is effective for training at scale, offering tangible performance improvements and superior scalability.&quot;</description>
  </item>
  <item>
    <title>IQuest-Coder-V1 (IQuestLab)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#IQuest-Coder-V1-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>IQuestLab</category>
    <category>open</category>
    <description>Parameters: IQuest-Coder-V1
License: open
Type: model
&quot;IQuest-Coder-V1 captures the dynamic evolution of software logic, delivering state-of-the-art performance across critical dimensions&quot; https://github.com/IQuestLab/IQuest-Coder-V1</description>
  </item>
  <item>
    <title>A.X K1 (SK Hynix)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#A.X%20K1-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>SK Hynix</category>
    <category>open</category>
    <description>Parameters: A.X K1
License: open
Type: model
519BA33B.</description>
  </item>
  <item>
    <title>K-EXAONE (LG)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#K-EXAONE-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>LG</category>
    <category>open</category>
    <description>Parameters: K-EXAONE
License: open
Type: model
236BA23B. “EXAONE”=“EXpert AI for EveryONE”.</description>
  </item>
  <item>
    <title>Ranke-4B (UZH)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Ranke-4B-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>UZH</category>
    <category>closed</category>
    <description>Parameters: Ranke-4B
License: closed
Type: model
Base Model: Qwen 3. 600B tokens of pre-(1913, 1929, 1933, 1939, 1946) data only.</description>
  </item>
  <item>
    <title>WeDLM (Tencent)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#WeDLM-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Tencent</category>
    <category>open</category>
    <description>Parameters: WeDLM
License: open
Type: model
Project page: https://wedlm.github.io/ &quot;WeDLM, a diffusion decoding framework built entirely on standard causal attention to make parallel generation prefix-cache friendly. The core idea is to let each masked position condition on all currently observed tokens while keeping a strict causal mask, achieved by Topological Reordering that moves observed tokens to the physical prefix while preserving their logical positions.. We instantiate WeDLM on both Qwen2.5-7B and Qwen3-8B, utilizing 100B tokens for continued training and 10B tokens for SFT.&quot;</description>
  </item>
  <item>
    <title>SOLAR Open (Upstage AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#SOLAR%20Open-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Upstage AI</category>
    <category>open</category>
    <description>Parameters: SOLAR Open
License: open
Type: model
South Korean. 102BA12B. Releasing 31/Dec.</description>
  </item>
  <item>
    <title>GLM-4.7 (Z.AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GLM-4.7-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Z.AI</category>
    <category>open</category>
    <description>Parameters: GLM-4.7
License: open
Type: model
355B-A32B. &quot;context window has been expanded from 128K to 200K tokens&quot;</description>
  </item>
  <item>
    <title>NitroGen (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#NitroGen-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: NitroGen
License: open
Type: model
&quot;NitroGen is a unified vision-to-action model designed to play video games directly from raw frames. It takes video game footage as input and outputs gamepad actions... trained on 40,000 hours of gameplay videos across more than 1,000 games.&quot;</description>
  </item>
  <item>
    <title>MiMo-V2-Flash (Xiaomi)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MiMo-V2-Flash-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Xiaomi</category>
    <category>open</category>
    <description>Parameters: MiMo-V2-Flash
License: open
Type: model
309BA15B.</description>
  </item>
  <item>
    <title>FunctionGemma (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#FunctionGemma-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>open</category>
    <description>Parameters: FunctionGemma
License: open
Type: model
&quot;FunctionGemma, a specialized version of our Gemma 3 270M model tuned for function calling. It is designed as a strong base for further training into custom, fast, private, local agents that translate natural language into executable API actions.&quot;</description>
  </item>
  <item>
    <title>T5Gemma 2 (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#T5Gemma%202-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>open</category>
    <description>Parameters: T5Gemma 2
License: open
Type: model
Base model: Gemma 3. Dataset: Gemma 3 4B checkpoint (4T) + pretraining (2T)=6T.</description>
  </item>
  <item>
    <title>Gemini 3 Flash (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Gemini%203%20Flash-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>open</category>
    <description>Parameters: Gemini 3 Flash
License: open
Type: model
Announce: https://deepmind.google/models/gemini/flash/</description>
  </item>
  <item>
    <title>NVIDIA-Nemotron-3-Nano-30B-A3B (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#NVIDIA-Nemotron-3-Nano-30B-A3B-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: NVIDIA-Nemotron-3-Nano-30B-A3B
License: open
Type: model
Knowledge cutoff November 28, 2025 (post).</description>
  </item>
  <item>
    <title>Bolmo (Allen AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Bolmo-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Allen AI</category>
    <category>open</category>
    <description>Parameters: Bolmo
License: open
Type: model
Base Model: Olmo 3 7B. Announce: https://allenai.org/blog/bolmo</description>
  </item>
  <item>
    <title>EuroLLM-22B (Consortium)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#EuroLLM-22B-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Consortium</category>
    <category>open</category>
    <description>Parameters: EuroLLM-22B
License: open
Type: model
A fully open language model developed in Europe.</description>
  </item>
  <item>
    <title>LLaDA2.0 Flash (Inclusion AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#LLaDA2.0%20Flash-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Inclusion AI</category>
    <category>open</category>
    <description>Parameters: LLaDA2.0 Flash
License: open
Type: model
Base Model: Ling-flash-2.0: 103B total parameters with 6.1B activated. &quot;largest diffusion language model to date&quot;</description>
  </item>
  <item>
    <title>GPT-5.2 (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5.2-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>open</category>
    <description>Parameters: GPT-5.2
License: open
Type: model
&quot;GPT‑5.2 sets a new state of the art across many benchmarks, including GDPval, where it outperforms industry professionals at well-specified knowledge work tasks spanning 44 occupations.&quot; Announce: https://openai.com/index/introducing-gpt-5-2/ MMLU is for Spanish.</description>
  </item>
  <item>
    <title>Apriel-1.6-15B-Thinker (ServiceNow)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Apriel-1.6-15B-Thinker-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>ServiceNow</category>
    <category>open</category>
    <description>Parameters: Apriel-1.6-15B-Thinker
License: open
Type: model</description>
  </item>
  <item>
    <title>Motif 2 12.7B (Motif-Technologies)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Motif%202%2012.7B-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Motif-Technologies</category>
    <category>open</category>
    <description>Parameters: Motif 2 12.7B
License: open
Type: model</description>
  </item>
  <item>
    <title>Devstral 2 (Mistral)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Devstral%202-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Mistral</category>
    <category>open</category>
    <description>Parameters: Devstral 2
License: open
Type: model
SWE-bench Verified=72.2%.</description>
  </item>
  <item>
    <title>Nanbeige4-3B-Base (Nanbeige4-3B-Base)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Nanbeige4-3B-Base-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Nanbeige4-3B-Base</category>
    <category>open</category>
    <description>Parameters: Nanbeige4-3B-Base
License: open
Type: model</description>
  </item>
  <item>
    <title>HY 2.0 (Tencent)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#HY%202.0-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Tencent</category>
    <category>open</category>
    <description>Parameters: HY 2.0
License: open
Type: model
406BA32B.</description>
  </item>
  <item>
    <title>K2-V2 (MBZUAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#K2-V2-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>MBZUAI</category>
    <category>open</category>
    <description>Parameters: K2-V2
License: open
Type: model
8.5x more tokens trained than K2 (1.4T v 12T). Project page: https://ifm.ai/k2/</description>
  </item>
  <item>
    <title>Trinity-Mini (Arcee AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Trinity-Mini-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Arcee AI</category>
    <category>open</category>
    <description>Parameters: Trinity-Mini
License: open
Type: model
26BA3B. &quot;we worked closely with Prime Intellect. They not only served the H100 clusters Datology used to generate synthetic data, they have been deeply involved in helping scale our training setup to the GPU footprint required for a fully frontier sized model, including the current 2048 B300 GPU configuration for Trinity Large.&quot;</description>
  </item>
  <item>
    <title>Nova 2 Pro (Amazon)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Nova%202%20Pro-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Amazon</category>
    <category>open</category>
    <description>Parameters: Nova 2 Pro
License: open
Type: model
&quot;Nova 2 Pro is Amazon&apos;s most intelligent reasoning model that can process text, images, video, and speech to generate text.&quot;</description>
  </item>
  <item>
    <title>Mistral Large 3 (Mistral)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Mistral%20Large%203-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>Mistral</category>
    <category>open</category>
    <description>Parameters: Mistral Large 3
License: open
Type: model
675BA41B. &quot;Mistral Large 3 joins the ranks of frontier instruction-fine-tuned open-source models.&quot; EU tech doc: https://legal.cms.mistral.ai/assets/1e37fffd-7ea5-469b-822f-05dcfbb43623</description>
  </item>
  <item>
    <title>DeepSeek-V3.2-Speciale (DeepSeek-AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#DeepSeek-V3.2-Speciale-2025-12-01</guid>
    <pubDate>Mon, 01 Dec 2025 00:00:00 GMT</pubDate>
    <category>DeepSeek-AI</category>
    <category>open</category>
    <description>Parameters: DeepSeek-V3.2-Speciale
License: open
Type: model
The word &apos;Speciale&apos; may be a reference to Ferrari. &quot;It shows gold-medal performance in the IOI 2025, ICPC World Final 2025, IMO 2025, and CMO 2025.&quot; API: https://api-docs.deepseek.com/news/news251201</description>
  </item>
  <item>
    <title>DeepSeekMath-V2 (DeepSeek)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#DeepSeekMath-V2-2025-11-27</guid>
    <pubDate>Thu, 27 Nov 2025 00:00:00 GMT</pubDate>
    <category>DeepSeek</category>
    <category>open</category>
    <description>Parameters: 685B
License: open
Type: model
AI model by DeepSeek</description>
  </item>
  <item>
    <title>Claude Opus 4.5 (Anthropic)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Claude%20Opus%204.5-2025-11-24</guid>
    <pubDate>Mon, 24 Nov 2025 00:00:00 GMT</pubDate>
    <category>Anthropic</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Anthropic</description>
  </item>
  <item>
    <title>Olmo 3 (Allen Institute for AI (Ai2))</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Olmo%203-2025-11-20</guid>
    <pubDate>Thu, 20 Nov 2025 00:00:00 GMT</pubDate>
    <category>Allen Institute for AI (Ai2)</category>
    <category>open</category>
    <description>Parameters: 32B
License: open
Type: model
AI model by Allen Institute for AI (Ai2)</description>
  </item>
  <item>
    <title>Grok 4.1 Fast (xAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Grok%204.1%20Fast-2025-11-19</guid>
    <pubDate>Wed, 19 Nov 2025 00:00:00 GMT</pubDate>
    <category>xAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by xAI</description>
  </item>
  <item>
    <title>GPT-5.1-Codex-Max (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5.1-Codex-Max-2025-11-19</guid>
    <pubDate>Wed, 19 Nov 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by OpenAI</description>
  </item>
  <item>
    <title>Gemini 3 Pro (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Gemini%203%20Pro-2025-11-18</guid>
    <pubDate>Tue, 18 Nov 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Google DeepMind</description>
  </item>
  <item>
    <title>Grok 4.1 (xAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Grok%204.1-2025-11-17</guid>
    <pubDate>Mon, 17 Nov 2025 00:00:00 GMT</pubDate>
    <category>xAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by xAI</description>
  </item>
  <item>
    <title>GPT-5.1 (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5.1-2025-11-13</guid>
    <pubDate>Thu, 13 Nov 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by OpenAI</description>
  </item>
  <item>
    <title>GPT-5.1 Instant (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5.1%20Instant-2025-11-13</guid>
    <pubDate>Thu, 13 Nov 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by OpenAI</description>
  </item>
  <item>
    <title>GPT-5.1-Codex (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5.1-Codex-2025-11-12</guid>
    <pubDate>Wed, 12 Nov 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by OpenAI</description>
  </item>
  <item>
    <title>Kimi K2 Thinking (Moonshot)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Kimi%20K2%20Thinking-2025-11-06</guid>
    <pubDate>Thu, 06 Nov 2025 00:00:00 GMT</pubDate>
    <category>Moonshot</category>
    <category>open</category>
    <description>Parameters: 1T
License: open
Type: model
AI model by Moonshot</description>
  </item>
  <item>
    <title>Gen-0 (Generalist)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Gen-0-2025-11-04</guid>
    <pubDate>Tue, 04 Nov 2025 00:00:00 GMT</pubDate>
    <category>Generalist</category>
    <category>closed</category>
    <description>Parameters: 10B
License: closed
Type: model
AI model by Generalist</description>
  </item>
  <item>
    <title>DeepSeek-Math-V2 (DeepSeek-AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#DeepSeek-Math-V2-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>DeepSeek-AI</category>
    <category>open</category>
    <description>Parameters: DeepSeek-Math-V2
License: open
Type: model
&quot;DeepSeekMath-V2, demonstrates strong theorem-proving capabilities, achieving gold-level scores on IMO 2025 and CMO 2024 and a near-perfect 118/120 on Putnam 2024 with scaled testtime compute. &quot;</description>
  </item>
  <item>
    <title>Orchestrator-8B (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Orchestrator-8B-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: Orchestrator-8B
License: open
Type: model
Base Model: Qwen3-8B</description>
  </item>
  <item>
    <title>INTELLECT-3 (Prime Intellect)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#INTELLECT-3-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Prime Intellect</category>
    <category>open</category>
    <description>Parameters: INTELLECT-3
License: open
Type: model
Base: GLM-4.5-Air-Base model. 106BA12B. Announce: https://www.primeintellect.ai/blog/intellect-3</description>
  </item>
  <item>
    <title>Fara-7B (Microsoft)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Fara-7B-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Microsoft</category>
    <category>open</category>
    <description>Parameters: Fara-7B
License: open
Type: model
&quot;Fara-7B is Microsoft&apos;s first agentic small language model (SLM) designed specifically for computer use. With only 7 billion parameters, Fara-7B is an ultra-compact Computer Use Agent (CUA)...Current production baselines leverage Qwen 2.5-VL (7B).&quot;</description>
  </item>
  <item>
    <title>Claude Opus 4.5 (Anthropic)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Claude%20Opus%204.5-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Anthropic</category>
    <category>open</category>
    <description>Parameters: Claude Opus 4.5
License: open
Type: model
&quot;the best model in the world for coding, agents, and computer use.&quot; Announce: https://www.anthropic.com/news/claude-opus-4-5</description>
  </item>
  <item>
    <title>Nemotron Elastic (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Nemotron%20Elastic-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: Nemotron Elastic
License: open
Type: model
&quot;Nemotron Elastic, a framework for building reasoning-oriented LLMs, including hybrid Mamba-Attention architectures, that embed multiple nested submodels within a single parent model, each optimized for different deployment configurations and budgets. Each of these submodels shares weights with the parent model and can be extracted zero-shot during deployment without additional training or fine-tuning...We apply Nemotron Elastic to the Nemotron Nano V2 12B model, simultaneously producing a 9B and a 6B model using only 110B training tokens&quot;</description>
  </item>
  <item>
    <title>GeoVista (Tencent)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GeoVista-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Tencent</category>
    <category>open</category>
    <description>Parameters: GeoVista
License: open
Type: model
Base model: Qwen2.5-VL-7B-Instruct. &quot;GeoVista, an agentic model that seamlessly integrates tool invocation within the reasoning loop, including an image-zoom-in tool to magnify regions of interest and a web-search tool to retrieve related web information. &quot; Project page: https://ekonwang.github.io/geo-vista/</description>
  </item>
  <item>
    <title>OLMo 3 (Allen AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#OLMo%203-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Allen AI</category>
    <category>open</category>
    <description>Parameters: OLMo 3
License: open
Type: model
Announce: https://allenai.org/blog/olmo3</description>
  </item>
  <item>
    <title>Gemini 3 Pro (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Gemini%203%20Pro-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>open</category>
    <description>Parameters: Gemini 3 Pro
License: open
Type: model
&quot;The knowledge cutoff date for Gemini 3 Pro was January 2025.&quot;</description>
  </item>
  <item>
    <title>Grok 4.1 (xAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Grok%204.1-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>xAI</category>
    <category>open</category>
    <description>Parameters: Grok 4.1
License: open
Type: model</description>
  </item>
  <item>
    <title>Baguettotron (PleIAs)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Baguettotron-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>PleIAs</category>
    <category>open</category>
    <description>Parameters: Baguettotron
License: open
Type: model
&quot;The name is both a nod to French origins and to the unusual shape of the model: with 80 layers, Baguettotron is currently the deepest SLM in its size range.&quot;</description>
  </item>
  <item>
    <title>ERNIE-5.0-Preview-1022 (Baidu)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#ERNIE-5.0-Preview-1022-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Baidu</category>
    <category>open</category>
    <description>Parameters: ERNIE-5.0-Preview-1022
License: open
Type: model
Very low performance on ALPrompt. 2.4T params confirmed: https://global.chinadaily.com.cn/a/202511/13/WS691571bda310d6866eb29500.html</description>
  </item>
  <item>
    <title>GPT-5.1 (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5.1-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>open</category>
    <description>Parameters: GPT-5.1
License: open
Type: model
Personality change via fine-tuning. GPQA (no tools) increased from GPT-5=85.7 to GPT-5.1=88.1. MMLU is for Spanish.</description>
  </item>
  <item>
    <title>TiDAR (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#TiDAR-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: TiDAR
License: open
Type: model
Base model: Qwen3-8B (36T) + 150B continual training. &quot;TiDAR, a sequence-level hybrid architecture that drafts tokens (Thinking) in Diffusion and samples final outputs (Talking) AutoRegressively - all within a single forward pass using specially designed structured attention masks&quot;</description>
  </item>
  <item>
    <title>SONIC (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#SONIC-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: SONIC
License: open
Type: model
Supersizing mOtion tracking for Natural humanoId Control (SONIC). Training dataset calcs: (700 hours * 3,600 seconds/hour * 50 frames/second ) / 1 frame/token = 126M tokens (refined to 100M+ after rigorous filtering ); 150,000 steps * 6.67M tokens/batch = 1.0T total tokens seen during training.</description>
  </item>
  <item>
    <title>JustRL-Nemotron-1.5B (Tsinghua)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#JustRL-Nemotron-1.5B-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Tsinghua</category>
    <category>open</category>
    <description>Parameters: JustRL-Nemotron-1.5B
License: open
Type: model
&quot;JustRL, a simple recipe with fixed hyperparameters, achieves state-of-the-art performance on two different 1.5B base models (54.5% and 64.3% across 9 math benchmarks) while using 2× less compute than sophisticated approaches. The same hyperparameters transfer across both models without tuning, and training remains stable over thousands of steps without intervention. This suggests the field may be adding complexity to solve problems that disappear with a stable, scaled-up baseline.&quot;</description>
  </item>
  <item>
    <title>ERNIE-4.5-VL-28B-A3B-Thinking (Baidu)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#ERNIE-4.5-VL-28B-A3B-Thinking-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Baidu</category>
    <category>open</category>
    <description>Parameters: ERNIE-4.5-VL-28B-A3B-Thinking
License: open
Type: model
28B-A3B. Open-sourced 12/Nov/2025 from Jun/2025 release.</description>
  </item>
  <item>
    <title>HOPE (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#HOPE-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>partial</category>
    <description>Parameters: HOPE
License: partial
Type: model
&quot;Combining our self-modifying sequence model with the continuum memory system, we present a learning module, called HOPE, showing promising results in language modeling, continual learning, and long-context reasoning tasks.&quot; Announce: https://research.google/blog/introducing-nested-learning-a-new-ml-paradigm-for-continual-learning/ May be released after paper is public.</description>
  </item>
  <item>
    <title>Kimi K2 Thinking (Moonshot AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Kimi%20K2%20Thinking-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Moonshot AI</category>
    <category>open</category>
    <description>Parameters: Kimi K2 Thinking
License: open
Type: model
1TA32B. 1T parameters and 384 experts. Open source SOTA. HLE=51.0 on text-only subset, compare to Grok-4 HLE=50.7 also on text-only, but Grok-4 HLE=44.4 on HLE full, ∴ Kimi K2 Thinking HLE≈44 full (estimated).</description>
  </item>
  <item>
    <title>Ling-1T (Inclusion AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Ling-1T-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Inclusion AI</category>
    <category>open</category>
    <description>Parameters: Ling-1T
License: open
Type: model
1TA50B.</description>
  </item>
  <item>
    <title>GEN-0 (Generalist)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GEN-0-2025-11-01</guid>
    <pubDate>Sat, 01 Nov 2025 00:00:00 GMT</pubDate>
    <category>Generalist</category>
    <category>partial</category>
    <description>Parameters: GEN-0
License: partial
Type: model
&quot;GEN-0, a new class of embodied foundation models built for multimodal training directly on high-fidelity raw physical interaction. Its architecture builds on the strengths of vision and language models while also going beyond them—natively designed to capture human-level reflexes and physical commonsense. One core feature is Harmonic Reasoning, in which the models are trained to simultaneously think and act seamlessly... GEN-0 is pretrained on our in-house robotics dataset, which includes over 270,000 hours of real-world diverse manipulation data, growing at a rate of 10,000 hours a week and accelerating.&quot;</description>
  </item>
  <item>
    <title>Emu3.5 (Beijing Academy of Artificial Intelligence / BAAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Emu3.5-2025-10-30</guid>
    <pubDate>Thu, 30 Oct 2025 00:00:00 GMT</pubDate>
    <category>Beijing Academy of Artificial Intelligence / BAAI</category>
    <category>open</category>
    <description>Parameters: 34.1B
License: open
Type: model
AI model by Beijing Academy of Artificial Intelligence / BAAI</description>
  </item>
  <item>
    <title>Kimi Linear (Moonshot)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Kimi%20Linear-2025-10-30</guid>
    <pubDate>Thu, 30 Oct 2025 00:00:00 GMT</pubDate>
    <category>Moonshot</category>
    <category>open</category>
    <description>Parameters: 48B
License: open
Type: model
AI model by Moonshot</description>
  </item>
  <item>
    <title>Composer (Cursor)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Composer-2025-10-29</guid>
    <pubDate>Wed, 29 Oct 2025 00:00:00 GMT</pubDate>
    <category>Cursor</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Cursor</description>
  </item>
  <item>
    <title>SWE-1.5 (Cognition)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#SWE-1.5-2025-10-29</guid>
    <pubDate>Wed, 29 Oct 2025 00:00:00 GMT</pubDate>
    <category>Cognition</category>
    <category>closed</category>
    <description>Parameters: 300B
License: closed
Type: model
AI model by Cognition</description>
  </item>
  <item>
    <title>Tongyi DeepResearch (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Tongyi%20DeepResearch-2025-10-28</guid>
    <pubDate>Tue, 28 Oct 2025 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>open</category>
    <description>Parameters: 30.5B
License: open
Type: model
AI model by Alibaba</description>
  </item>
  <item>
    <title>MiniMax-M2 (MiniMax)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MiniMax-M2-2025-10-27</guid>
    <pubDate>Mon, 27 Oct 2025 00:00:00 GMT</pubDate>
    <category>MiniMax</category>
    <category>open</category>
    <description>Parameters: 229B
License: open
Type: model
AI model by MiniMax</description>
  </item>
  <item>
    <title>LoongRL 7B (Microsoft Research Asia,Shanghai Jiao Tong University,Carnegie Mellon University (CMU))</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#LoongRL%207B-2025-10-27</guid>
    <pubDate>Mon, 27 Oct 2025 00:00:00 GMT</pubDate>
    <category>Microsoft Research Asia,Shanghai Jiao Tong University,Carnegie Mellon University (CMU)</category>
    <category>closed</category>
    <description>Parameters: 7B
License: closed
Type: model
AI model by Microsoft Research Asia,Shanghai Jiao Tong University,Carnegie Mellon University (CMU)</description>
  </item>
  <item>
    <title>LoongRL 14B (Microsoft Research Asia,Shanghai Jiao Tong University,Carnegie Mellon University (CMU))</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#LoongRL%2014B-2025-10-27</guid>
    <pubDate>Mon, 27 Oct 2025 00:00:00 GMT</pubDate>
    <category>Microsoft Research Asia,Shanghai Jiao Tong University,Carnegie Mellon University (CMU)</category>
    <category>closed</category>
    <description>Parameters: 14B
License: closed
Type: model
AI model by Microsoft Research Asia,Shanghai Jiao Tong University,Carnegie Mellon University (CMU)</description>
  </item>
  <item>
    <title>Lapa LLM (Ukrainian Catholic University,Igor Sikorsky Kyiv Polytechnic Institute,AGH University of Krakow,Lviv Polytechnic)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Lapa%20LLM-2025-10-25</guid>
    <pubDate>Sat, 25 Oct 2025 00:00:00 GMT</pubDate>
    <category>Ukrainian Catholic University,Igor Sikorsky Kyiv Polytechnic Institute,AGH University of Krakow,Lviv Polytechnic</category>
    <category>open</category>
    <description>Parameters: 12B
License: open
Type: model
AI model by Ukrainian Catholic University,Igor Sikorsky Kyiv Polytechnic Institute,AGH University of Krakow,Lviv Polytechnic</description>
  </item>
  <item>
    <title>Ring-mini-linear-2.0 (Ant Group)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Ring-mini-linear-2.0-2025-10-23</guid>
    <pubDate>Thu, 23 Oct 2025 00:00:00 GMT</pubDate>
    <category>Ant Group</category>
    <category>open</category>
    <description>Parameters: 16.4B
License: open
Type: model
AI model by Ant Group</description>
  </item>
  <item>
    <title>Ring-flash-linear-2.0 (Ant Group)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Ring-flash-linear-2.0-2025-10-23</guid>
    <pubDate>Thu, 23 Oct 2025 00:00:00 GMT</pubDate>
    <category>Ant Group</category>
    <category>open</category>
    <description>Parameters: 104.2B
License: open
Type: model
AI model by Ant Group</description>
  </item>
  <item>
    <title>Deepseek OCR (DeepSeek)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Deepseek%20OCR-2025-10-21</guid>
    <pubDate>Tue, 21 Oct 2025 00:00:00 GMT</pubDate>
    <category>DeepSeek</category>
    <category>open</category>
    <description>Parameters: 3B
License: open
Type: model
AI model by DeepSeek</description>
  </item>
  <item>
    <title>BAPO 32B (Fudan University,Shanghai Qiji Zhifeng)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#BAPO%2032B-2025-10-21</guid>
    <pubDate>Tue, 21 Oct 2025 00:00:00 GMT</pubDate>
    <category>Fudan University,Shanghai Qiji Zhifeng</category>
    <category>closed</category>
    <description>Parameters: 32B
License: closed
Type: model
AI model by Fudan University,Shanghai Qiji Zhifeng</description>
  </item>
  <item>
    <title>Odyssey 102B (Anthrogen)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Odyssey%20102B-2025-10-18</guid>
    <pubDate>Sat, 18 Oct 2025 00:00:00 GMT</pubDate>
    <category>Anthrogen</category>
    <category>closed</category>
    <description>Parameters: 102B
License: closed
Type: model
AI model by Anthrogen</description>
  </item>
  <item>
    <title>Odyssey 12B (Anthrogen)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Odyssey%2012B-2025-10-18</guid>
    <pubDate>Sat, 18 Oct 2025 00:00:00 GMT</pubDate>
    <category>Anthrogen</category>
    <category>closed</category>
    <description>Parameters: 12B
License: closed
Type: model
AI model by Anthrogen</description>
  </item>
  <item>
    <title>Odyssey 1.2B (Anthrogen)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Odyssey%201.2B-2025-10-18</guid>
    <pubDate>Sat, 18 Oct 2025 00:00:00 GMT</pubDate>
    <category>Anthrogen</category>
    <category>closed</category>
    <description>Parameters: 1.2B
License: closed
Type: model
AI model by Anthrogen</description>
  </item>
  <item>
    <title>Claude Haiku 4.5 (Anthropic)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Claude%20Haiku%204.5-2025-10-15</guid>
    <pubDate>Wed, 15 Oct 2025 00:00:00 GMT</pubDate>
    <category>Anthropic</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Anthropic</description>
  </item>
  <item>
    <title>Veo 3.1 (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Veo%203.1-2025-10-15</guid>
    <pubDate>Wed, 15 Oct 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Google DeepMind</description>
  </item>
  <item>
    <title>Llama 4 Scout + ScaleRL (Meta AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Llama%204%20Scout%20%2B%20ScaleRL-2025-10-15</guid>
    <pubDate>Wed, 15 Oct 2025 00:00:00 GMT</pubDate>
    <category>Meta AI</category>
    <category>closed</category>
    <description>Parameters: 109B
License: closed
Type: model
AI model by Meta AI</description>
  </item>
  <item>
    <title>MAI-Image-1 (Microsoft)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MAI-Image-1-2025-10-13</guid>
    <pubDate>Mon, 13 Oct 2025 00:00:00 GMT</pubDate>
    <category>Microsoft</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Microsoft</description>
  </item>
  <item>
    <title>C2S-Scale (Google Research,Yale University,Google DeepMind,Brown University,University of Southern California)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#C2S-Scale-2025-10-11</guid>
    <pubDate>Sat, 11 Oct 2025 00:00:00 GMT</pubDate>
    <category>Google Research,Yale University,Google DeepMind,Brown University,University of Southern California</category>
    <category>open</category>
    <description>Parameters: 27B
License: open
Type: model
AI model by Google Research,Yale University,Google DeepMind,Brown University,University of Southern California</description>
  </item>
  <item>
    <title>Ring-1T (Ant Group)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Ring-1T-2025-10-10</guid>
    <pubDate>Fri, 10 Oct 2025 00:00:00 GMT</pubDate>
    <category>Ant Group</category>
    <category>open</category>
    <description>Parameters: 1T
License: open
Type: model
AI model by Ant Group</description>
  </item>
  <item>
    <title>Ling-1T (Ant Group)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Ling-1T-2025-10-10</guid>
    <pubDate>Fri, 10 Oct 2025 00:00:00 GMT</pubDate>
    <category>Ant Group</category>
    <category>open</category>
    <description>Parameters: 1T
License: open
Type: model
AI model by Ant Group</description>
  </item>
  <item>
    <title>Grok Imagine (xAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Grok%20Imagine-2025-10-08</guid>
    <pubDate>Wed, 08 Oct 2025 00:00:00 GMT</pubDate>
    <category>xAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by xAI</description>
  </item>
  <item>
    <title>GPT-5 Pro (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT-5%20Pro-2025-10-07</guid>
    <pubDate>Tue, 07 Oct 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by OpenAI</description>
  </item>
  <item>
    <title>Gemini 2.5 Computer Use (Google)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Gemini%202.5%20Computer%20Use-2025-10-07</guid>
    <pubDate>Tue, 07 Oct 2025 00:00:00 GMT</pubDate>
    <category>Google</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Google</description>
  </item>
  <item>
    <title>Tiny Recursive Model (TRM-Att) (Samsung SAIT AI Lab)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Tiny%20Recursive%20Model%20(TRM-Att)-2025-10-06</guid>
    <pubDate>Mon, 06 Oct 2025 00:00:00 GMT</pubDate>
    <category>Samsung SAIT AI Lab</category>
    <category>closed</category>
    <description>Parameters: 7M
License: closed
Type: model
AI model by Samsung SAIT AI Lab</description>
  </item>
  <item>
    <title>Granite-4.0-H-Tiny (IBM)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Granite-4.0-H-Tiny-2025-10-02</guid>
    <pubDate>Thu, 02 Oct 2025 00:00:00 GMT</pubDate>
    <category>IBM</category>
    <category>open</category>
    <description>Parameters: 7B
License: open
Type: model
AI model by IBM</description>
  </item>
  <item>
    <title>Granite-4.0-H-Micro (IBM)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Granite-4.0-H-Micro-2025-10-02</guid>
    <pubDate>Thu, 02 Oct 2025 00:00:00 GMT</pubDate>
    <category>IBM</category>
    <category>open</category>
    <description>Parameters: 3B
License: open
Type: model
AI model by IBM</description>
  </item>
  <item>
    <title>Granite-4.0-H-Small (IBM)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Granite-4.0-H-Small-2025-10-02</guid>
    <pubDate>Thu, 02 Oct 2025 00:00:00 GMT</pubDate>
    <category>IBM</category>
    <category>open</category>
    <description>Parameters: 32B
License: open
Type: model
AI model by IBM</description>
  </item>
  <item>
    <title>CALM (Wechat)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#CALM-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>Wechat</category>
    <category>open</category>
    <description>Parameters: CALM
License: open
Type: model
&quot;Continuous Autoregressive Language Models (CALM), a paradigm shift from discrete next-token prediction to continuous next-vector prediction. CALM uses a high-fidelity autoencoder to compress a chunk of K tokens into a single continuous vector, from which the original tokens can be reconstructed with over 99.9% accuracy... We train our models on the Pile uncopyrighted dataset (Gao et al., 2020). The raw text is processed with the Llama 3 tokenizer (Grattafiori et al., 2024), resulting in a training set of ∼230B tokens.&quot;</description>
  </item>
  <item>
    <title>Kimi-Linear (Moonshot AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Kimi-Linear-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>Moonshot AI</category>
    <category>open</category>
    <description>Parameters: Kimi-Linear
License: open
Type: model
48B-A3B. &quot;Kimi Linear is a hybrid linear attention architecture that outperforms traditional full attention methods across various contexts, including short, long, and reinforcement learning (RL) scaling regimes. At its core is Kimi Delta Attention (KDA)—a refined version of Gated DeltaNet that introduces a more efficient gating mechanism to optimize the use of finite-state RNN memory.&quot;</description>
  </item>
  <item>
    <title>MiniMax-M2 (MiniMax)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MiniMax-M2-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>MiniMax</category>
    <category>open</category>
    <description>Parameters: MiniMax-M2
License: open
Type: model
230B-A10B.</description>
  </item>
  <item>
    <title>MACE-MH-1 (Cambridge/LBNL)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MACE-MH-1-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>Cambridge/LBNL</category>
    <category>open</category>
    <description>Parameters: MACE-MH-1
License: open
Type: model
MACE-MH-1 (Multi-Head 1). Features Multiple Heads (OMAT PBE, OMOL r2scan, OC20) to maintain high accuracy across domains</description>
  </item>
  <item>
    <title>DeepSeek-OCR (DeepSeek-AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#DeepSeek-OCR-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>DeepSeek-AI</category>
    <category>open</category>
    <description>Parameters: DeepSeek-OCR
License: open
Type: model
2D vision tokens for 1D text achieves huge compression. Encoder/Decoder: DeepEncoder 380M (80M SAM-base + 300M CLIP-large), DeepSeek-3B-MoE (A570M).</description>
  </item>
  <item>
    <title>UserLM-8b (Microsoft)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#UserLM-8b-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>Microsoft</category>
    <category>open</category>
    <description>Parameters: UserLM-8b
License: open
Type: model
&quot;we trained UserLM-8b to simulate the “user” role in conversation (by training it to predict user turns in a large corpus of conversations called WildChat).&quot;</description>
  </item>
  <item>
    <title>CoDA (Salesforce)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#CoDA-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>Salesforce</category>
    <category>open</category>
    <description>Parameters: CoDA
License: open
Type: model
&quot;diffusion coder trained on TPU [Google TPU v4-1024 VM]&quot;</description>
  </item>
  <item>
    <title>TRM (Samsung)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#TRM-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>Samsung</category>
    <category>open</category>
    <description>Parameters: TRM
License: open
Type: model
&quot;Tiny Recursive Model (TRM), a much simpler recursive reasoning approach that achieves significantly higher generalization than HRM, while using a single tiny network with only 2 layers&quot;</description>
  </item>
  <item>
    <title>Granite-4.0 Small (IBM)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Granite-4.0%20Small-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>IBM</category>
    <category>open</category>
    <description>Parameters: Granite-4.0 Small
License: open
Type: model
32B-A9B. Announce: https://www.ibm.com/new/announcements/ibm-granite-4-0-hyper-efficient-high-performance-hybrid-models</description>
  </item>
  <item>
    <title>Octave 2 (Hume)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Octave%202-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>Hume</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Hume</description>
  </item>
  <item>
    <title>EVI 4 mini (Hume)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#EVI%204%20mini-2025-10-01</guid>
    <pubDate>Wed, 01 Oct 2025 00:00:00 GMT</pubDate>
    <category>Hume</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Hume</description>
  </item>
  <item>
    <title>GLM-4.6 (Z.ai (Zhipu AI),Tsinghua University)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GLM-4.6-2025-09-30</guid>
    <pubDate>Tue, 30 Sep 2025 00:00:00 GMT</pubDate>
    <category>Z.ai (Zhipu AI),Tsinghua University</category>
    <category>open</category>
    <description>Parameters: 357B
License: open
Type: model
AI model by Z.ai (Zhipu AI),Tsinghua University</description>
  </item>
  <item>
    <title>Sora 2.0 (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Sora%202.0-2025-09-30</guid>
    <pubDate>Tue, 30 Sep 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by OpenAI</description>
  </item>
  <item>
    <title>Kandinsky 5.0 Video Lite (Sber)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Kandinsky%205.0%20Video%20Lite-2025-09-30</guid>
    <pubDate>Tue, 30 Sep 2025 00:00:00 GMT</pubDate>
    <category>Sber</category>
    <category>open</category>
    <description>Parameters: 2B
License: open
Type: model
AI model by Sber</description>
  </item>
  <item>
    <title>Claude Sonnet 4.5 (Anthropic)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Claude%20Sonnet%204.5-2025-09-29</guid>
    <pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate>
    <category>Anthropic</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Anthropic</description>
  </item>
  <item>
    <title>NVIDIA Isaac GR00T N1.6 (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#NVIDIA%20Isaac%20GR00T%20N1.6-2025-09-29</guid>
    <pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by NVIDIA</description>
  </item>
  <item>
    <title>Cosmos-Transfer2.5-2B (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Cosmos-Transfer2.5-2B-2025-09-29</guid>
    <pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: 2B
License: open
Type: model
AI model by NVIDIA</description>
  </item>
  <item>
    <title>Cosmos-Predict2.5-14B (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Cosmos-Predict2.5-14B-2025-09-29</guid>
    <pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>closed</category>
    <description>Parameters: 14B
License: closed
Type: model
AI model by NVIDIA</description>
  </item>
  <item>
    <title>Cosmos-Predict2.5 2B (NVIDIA)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Cosmos-Predict2.5%202B-2025-09-29</guid>
    <pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate>
    <category>NVIDIA</category>
    <category>open</category>
    <description>Parameters: 2B
License: open
Type: model
AI model by NVIDIA</description>
  </item>
  <item>
    <title>DeepSeek-V3.2-Exp (DeepSeek)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#DeepSeek-V3.2-Exp-2025-09-29</guid>
    <pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate>
    <category>DeepSeek</category>
    <category>open</category>
    <description>Parameters: 671B
License: open
Type: model
AI model by DeepSeek</description>
  </item>
  <item>
    <title>MinerU2.5 (Shanghai AI Lab,Peking University,Shanghai Jiao Tong University)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#MinerU2.5-2025-09-29</guid>
    <pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate>
    <category>Shanghai AI Lab,Peking University,Shanghai Jiao Tong University</category>
    <category>open</category>
    <description>Parameters: 1.2B
License: open
Type: model
AI model by Shanghai AI Lab,Peking University,Shanghai Jiao Tong University</description>
  </item>
  <item>
    <title>Wan 2.5 (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Wan%202.5-2025-09-29</guid>
    <pubDate>Mon, 29 Sep 2025 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Alibaba</description>
  </item>
  <item>
    <title>Seedream 4.0 (ByteDance)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Seedream%204.0-2025-09-28</guid>
    <pubDate>Sun, 28 Sep 2025 00:00:00 GMT</pubDate>
    <category>ByteDance</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by ByteDance</description>
  </item>
  <item>
    <title>Kling 2.5 Turbo (Kuaishou Technology)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Kling%202.5%20Turbo-2025-09-26</guid>
    <pubDate>Fri, 26 Sep 2025 00:00:00 GMT</pubDate>
    <category>Kuaishou Technology</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Kuaishou Technology</description>
  </item>
  <item>
    <title>Suno v5 (Suno)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Suno%20v5-2025-09-25</guid>
    <pubDate>Thu, 25 Sep 2025 00:00:00 GMT</pubDate>
    <category>Suno</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Suno</description>
  </item>
  <item>
    <title>Gemini Robotics 1.5 (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Gemini%20Robotics%201.5-2025-09-25</guid>
    <pubDate>Thu, 25 Sep 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Google DeepMind</description>
  </item>
  <item>
    <title>Gemini Robotics-ER 1.5 (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Gemini%20Robotics-ER%201.5-2025-09-25</guid>
    <pubDate>Thu, 25 Sep 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Google DeepMind</description>
  </item>
  <item>
    <title>GigaEmbeddings (Sber,Moscow Institute of Physics and Technology)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GigaEmbeddings-2025-09-25</guid>
    <pubDate>Thu, 25 Sep 2025 00:00:00 GMT</pubDate>
    <category>Sber,Moscow Institute of Physics and Technology</category>
    <category>open</category>
    <description>Parameters: 3B
License: open
Type: model
AI model by Sber,Moscow Institute of Physics and Technology</description>
  </item>
  <item>
    <title>SimpleFold (Apple)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#SimpleFold-2025-09-23</guid>
    <pubDate>Tue, 23 Sep 2025 00:00:00 GMT</pubDate>
    <category>Apple</category>
    <category>open</category>
    <description>Parameters: 3B
License: open
Type: model
AI model by Apple</description>
  </item>
  <item>
    <title>DeepSeek-V3.1-Terminus (DeepSeek)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#DeepSeek-V3.1-Terminus-2025-09-22</guid>
    <pubDate>Mon, 22 Sep 2025 00:00:00 GMT</pubDate>
    <category>DeepSeek</category>
    <category>open</category>
    <description>Parameters: 671B
License: open
Type: model
AI model by DeepSeek</description>
  </item>
  <item>
    <title>Qwen3-Omni-Flash (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Qwen3-Omni-Flash-2025-09-22</guid>
    <pubDate>Mon, 22 Sep 2025 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>closed</category>
    <description>Parameters: 35.3B
License: closed
Type: model
AI model by Alibaba</description>
  </item>
  <item>
    <title>Qwen3-Omni-30B-A3B (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Qwen3-Omni-30B-A3B-2025-09-22</guid>
    <pubDate>Mon, 22 Sep 2025 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>open</category>
    <description>Parameters: 35.3B
License: open
Type: model
AI model by Alibaba</description>
  </item>
  <item>
    <title>Grok 4 Fast (xAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Grok%204%20Fast-2025-09-19</guid>
    <pubDate>Fri, 19 Sep 2025 00:00:00 GMT</pubDate>
    <category>xAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by xAI</description>
  </item>
  <item>
    <title>Magistral Small 1.2 (Mistral AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Magistral%20Small%201.2-2025-09-18</guid>
    <pubDate>Thu, 18 Sep 2025 00:00:00 GMT</pubDate>
    <category>Mistral AI</category>
    <category>open</category>
    <description>Parameters: 24B
License: open
Type: model
AI model by Mistral AI</description>
  </item>
  <item>
    <title>Magistral Medium 1.2 (Mistral AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Magistral%20Medium%201.2-2025-09-18</guid>
    <pubDate>Thu, 18 Sep 2025 00:00:00 GMT</pubDate>
    <category>Mistral AI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Mistral AI</description>
  </item>
  <item>
    <title>Granite-Docling (IBM)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Granite-Docling-2025-09-17</guid>
    <pubDate>Wed, 17 Sep 2025 00:00:00 GMT</pubDate>
    <category>IBM</category>
    <category>open</category>
    <description>Parameters: 258M
License: open
Type: model
AI model by IBM</description>
  </item>
  <item>
    <title>AgentFounder-30B (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#AgentFounder-30B-2025-09-16</guid>
    <pubDate>Tue, 16 Sep 2025 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>open</category>
    <description>Parameters: 30B
License: open
Type: model
AI model by Alibaba</description>
  </item>
  <item>
    <title>Fabric 1.0 (Veed)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Fabric%201.0-2025-09-15</guid>
    <pubDate>Mon, 15 Sep 2025 00:00:00 GMT</pubDate>
    <category>Veed</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Veed</description>
  </item>
  <item>
    <title>GPT‑5-Codex (OpenAI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#GPT%E2%80%915-Codex-2025-09-15</guid>
    <pubDate>Mon, 15 Sep 2025 00:00:00 GMT</pubDate>
    <category>OpenAI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by OpenAI</description>
  </item>
  <item>
    <title>Qwen3-Next-80B-A3B (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Qwen3-Next-80B-A3B-2025-09-10</guid>
    <pubDate>Wed, 10 Sep 2025 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>open</category>
    <description>Parameters: 80B
License: open
Type: model
AI model by Alibaba</description>
  </item>
  <item>
    <title>Lucid Origin (Leonardo AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Lucid%20Origin-2025-09-10</guid>
    <pubDate>Wed, 10 Sep 2025 00:00:00 GMT</pubDate>
    <category>Leonardo AI</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Leonardo AI</description>
  </item>
  <item>
    <title>Ling-mini-base-2.0-20T (Ant Group)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Ling-mini-base-2.0-20T-2025-09-10</guid>
    <pubDate>Wed, 10 Sep 2025 00:00:00 GMT</pubDate>
    <category>Ant Group</category>
    <category>open</category>
    <description>Parameters: 16B
License: open
Type: model
AI model by Ant Group</description>
  </item>
  <item>
    <title>Ling-flash-base-2.0-20T (Ant Group)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Ling-flash-base-2.0-20T-2025-09-10</guid>
    <pubDate>Wed, 10 Sep 2025 00:00:00 GMT</pubDate>
    <category>Ant Group</category>
    <category>open</category>
    <description>Parameters: 100B
License: open
Type: model
AI model by Ant Group</description>
  </item>
  <item>
    <title>K2 Think (Mohamed bin Zayed University of Artificial Intelligence (MBZUAI),G42)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#K2%20Think-2025-09-09</guid>
    <pubDate>Tue, 09 Sep 2025 00:00:00 GMT</pubDate>
    <category>Mohamed bin Zayed University of Artificial Intelligence (MBZUAI),G42</category>
    <category>open</category>
    <description>Parameters: 32B
License: open
Type: model
AI model by Mohamed bin Zayed University of Artificial Intelligence (MBZUAI),G42</description>
  </item>
  <item>
    <title>Signal Processing Transformer (Softbank)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Signal%20Processing%20Transformer-2025-09-09</guid>
    <pubDate>Tue, 09 Sep 2025 00:00:00 GMT</pubDate>
    <category>Softbank</category>
    <category>closed</category>
    <description>License: closed
Type: model
AI model by Softbank</description>
  </item>
  <item>
    <title>Qwen3-Max (Alibaba)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Qwen3-Max-2025-09-05</guid>
    <pubDate>Fri, 05 Sep 2025 00:00:00 GMT</pubDate>
    <category>Alibaba</category>
    <category>closed</category>
    <description>Parameters: 1T
License: closed
Type: model
AI model by Alibaba</description>
  </item>
  <item>
    <title>EmbeddingGemma (Google DeepMind)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#EmbeddingGemma-2025-09-05</guid>
    <pubDate>Fri, 05 Sep 2025 00:00:00 GMT</pubDate>
    <category>Google DeepMind</category>
    <category>open</category>
    <description>Parameters: 308M
License: open
Type: model
AI model by Google DeepMind</description>
  </item>
  <item>
    <title>Chatterbox Multilingual (Resemble AI)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Chatterbox%20Multilingual-2025-09-04</guid>
    <pubDate>Thu, 04 Sep 2025 00:00:00 GMT</pubDate>
    <category>Resemble AI</category>
    <category>open</category>
    <description>License: open
Type: model
AI model by Resemble AI</description>
  </item>
  <item>
    <title>Apertus 8B (ETH Zurich,Ecole Polytechnique F´ed´erale de Lausanne (EPFL),Swiss National Supercomputing Centre (CSCS),Swisscom)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Apertus%208B-2025-09-02</guid>
    <pubDate>Tue, 02 Sep 2025 00:00:00 GMT</pubDate>
    <category>ETH Zurich,Ecole Polytechnique F´ed´erale de Lausanne (EPFL),Swiss National Supercomputing Centre (CSCS),Swisscom</category>
    <category>open</category>
    <description>Parameters: 8B
License: open
Type: model
AI model by ETH Zurich,Ecole Polytechnique F´ed´erale de Lausanne (EPFL),Swiss National Supercomputing Centre (CSCS),Swisscom</description>
  </item>
  <item>
    <title>Apertus 70B (ETH Zurich,Ecole Polytechnique F´ed´erale de Lausanne (EPFL),Swiss National Supercomputing Centre (CSCS),Swisscom)</title>
    <link>https://llm-timeline.duyet.net</link>
    <guid isPermaLink="false">https://llm-timeline.duyet.net/#Apertus%2070B-2025-09-02</guid>
    <pubDate>Tue, 02 Sep 2025 00:00:00 GMT</pubDate>
    <category>ETH Zurich,Ecole Polytechnique F´ed´erale de Lausanne (EPFL),Swiss National Supercomputing Centre (CSCS),Swisscom</category>
    <category>open</category>
    <description>Parameters: 70B
License: open
Type: model
AI model by ETH Zurich,Ecole Polytechnique F´ed´erale de Lausanne (EPFL),Swiss National Supercomputing Centre (CSCS),Swisscom</description>
  </item>
  </channel>
</rss>