Mistral releases Leanstral, a 6B parameter AI agent for Lean 4 formal verification, beating larger models at 1/15th the cost under Apache 2.0 license. (Read MoreMistral releases Leanstral, a 6B parameter AI agent for Lean 4 formal verification, beating larger models at 1/15th the cost under Apache 2.0 license. (Read More

Mistral AI Launches Leanstral Open-Source Proof Agent for Lean 4

2026/03/17 03:13
3 min read
For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

Mistral AI Launches Leanstral Open-Source Proof Agent for Lean 4

Zach Anderson Mar 16, 2026 19:13

Mistral releases Leanstral, a 6B parameter AI agent for Lean 4 formal verification, beating larger models at 1/15th the cost under Apache 2.0 license.

Mistral AI Launches Leanstral Open-Source Proof Agent for Lean 4

Mistral AI released Leanstral on March 16, 2026—the first open-source AI agent built specifically for Lean 4 formal verification. The 120B parameter model runs on just 6B active parameters and ships under Apache 2.0 licensing, making production-grade theorem proving accessible without enterprise budgets.

Why does this matter for crypto? Formal verification—mathematical proof that code does exactly what it claims—has become the gold standard for securing smart contracts and blockchain protocols. Bugs in DeFi code have cost billions. Leanstral could dramatically lower the barrier for projects seeking verified security.

Performance vs. Cost Trade-offs

Mistral benchmarked Leanstral against both proprietary and open-source competitors using FLTEval, a new evaluation suite testing real proof engineering tasks from the Fermat's Last Theorem formalization project.

The numbers are striking. Leanstral at pass@2 scored 26.3 points for $36 in compute costs. Claude Sonnet 4.6 managed 23.7 points but ran up a $549 bill—over 15x the cost for worse performance. Even at pass@16, where Leanstral hits 31.9 points for $290, it still costs less than one-fifth of Claude Opus 4.6's $1,650 price tag (though Opus leads quality at 39.6).

Against open-source alternatives, the efficiency gap widens further. GLM5-744B-A40B and Kimi-K2.5-1T-A32B plateau around 16-20 points despite having 6-8x more active parameters. Qwen3.5-397B-A17B needs four passes to reach 25.4 points—Leanstral beats that with two.

Technical Architecture

Leanstral uses a sparse mixture-of-experts architecture optimized for proof engineering workflows. The model integrates with Lean's language server protocol through MCP (Model Context Protocol), specifically trained for maximal performance with lean-lsp-mcp tooling.

Lean 4 itself launched stable in September 2023 and has seen rapid adoption for formalizing mathematics. The Mathlib library—a massive collection of mathematical proofs—successfully ported to Lean 4 that same year. Projects like the formal proof of Fermat's Last Theorem demonstrate the platform's capability for serious mathematical work.

Real-World Applications

Mistral showcased Leanstral handling a genuine Stack Exchange debugging question about breaking changes in Lean 4.29.0-rc6. The agent diagnosed a definitional equality issue with type aliases and correctly identified that swapping def for abbrev would restore tactic matching.

The model also demonstrated cross-language translation, converting Rocq (formerly Coq) definitions to Lean 4 while preserving proof semantics and implementing custom notation.

Access Options

Three deployment paths exist: direct integration in Mistral Vibe (use /leanstall to start), a free API endpoint at labs-leanstral-2603 for limited-time feedback gathering, or self-hosted deployment with the Apache 2.0 weights.

For blockchain projects, the calculus is straightforward. Formal verification has traditionally required either expensive auditing firms or deep in-house expertise. An open-source agent that can prove code correctness at $36-290 per task could reshape how protocols approach security—assuming the proofs hold up under production conditions.

Image source: Shutterstock
  • mistral ai
  • leanstral
  • lean 4
  • formal verification
  • open source
Market Opportunity
4 Logo
4 Price(4)
$0.008048
$0.008048$0.008048
+3.97%
USD
4 (4) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Polygon Tops RWA Rankings With $1.1B in Tokenized Assets

Polygon Tops RWA Rankings With $1.1B in Tokenized Assets

The post Polygon Tops RWA Rankings With $1.1B in Tokenized Assets appeared on BitcoinEthereumNews.com. Key Notes A new report from Dune and RWA.xyz highlights Polygon’s role in the growing RWA sector. Polygon PoS currently holds $1.13 billion in RWA Total Value Locked (TVL) across 269 assets. The network holds a 62% market share of tokenized global bonds, driven by European money market funds. The Polygon POL $0.25 24h volatility: 1.4% Market cap: $2.64 B Vol. 24h: $106.17 M network is securing a significant position in the rapidly growing tokenization space, now holding over $1.13 billion in total value locked (TVL) from Real World Assets (RWAs). This development comes as the network continues to evolve, recently deploying its major “Rio” upgrade on the Amoy testnet to enhance future scaling capabilities. This information comes from a new joint report on the state of the RWA market published on Sept. 17 by blockchain analytics firm Dune and data platform RWA.xyz. The focus on RWAs is intensifying across the industry, coinciding with events like the ongoing Real-World Asset Summit in New York. Sandeep Nailwal, CEO of the Polygon Foundation, highlighted the findings via a post on X, noting that the TVL is spread across 269 assets and 2,900 holders on the Polygon PoS chain. The Dune and https://t.co/W6WSFlHoQF report on RWA is out and it shows that RWA is happening on Polygon. Here are a few highlights: – Leading in Global Bonds: Polygon holds 62% share of tokenized global bonds (driven by Spiko’s euro MMF and Cashlink euro issues) – Spiko U.S.… — Sandeep | CEO, Polygon Foundation (※,※) (@sandeepnailwal) September 17, 2025 Key Trends From the 2025 RWA Report The joint publication, titled “RWA REPORT 2025,” offers a comprehensive look into the tokenized asset landscape, which it states has grown 224% since the start of 2024. The report identifies several key trends driving this expansion. According to…
Share
BitcoinEthereumNews2025/09/18 00:40
Shiba Inu’s 1,549% Spike: Can Bulls Take Control Again And Trigger An Explosive Rally?

Shiba Inu’s 1,549% Spike: Can Bulls Take Control Again And Trigger An Explosive Rally?

Shiba Inu (SHIB) has experienced a sudden increase in futures net flows, skyrocketing more than 1,549% in one day. The spike comes amid broader market volatility
Share
NewsBTC2026/03/17 04:30
US Stocks Surge Higher: Major Indices Post Significant Gains in Bullish Trading Session

US Stocks Surge Higher: Major Indices Post Significant Gains in Bullish Trading Session

BitcoinWorld US Stocks Surge Higher: Major Indices Post Significant Gains in Bullish Trading Session Major US stock indices closed substantially higher today,
Share
bitcoinworld2026/03/17 04:30