REDWOOD CITY, Calif., Jan. 30, 2026 /PRNewswire/ — Zilliz, the company behind the leading open-source vector database Milvus, today announced the open-source releaseREDWOOD CITY, Calif., Jan. 30, 2026 /PRNewswire/ — Zilliz, the company behind the leading open-source vector database Milvus, today announced the open-source release

Zilliz Open Sources Industry-First Bilingual “Semantic Highlighting” Model to Slash RAG Token Costs and Boost Accuracy

2026/01/31 02:31
3 min read

REDWOOD CITY, Calif., Jan. 30, 2026 /PRNewswire/ — Zilliz, the company behind the leading open-source vector database Milvus, today announced the open-source release of its Bilingual Semantic Highlighting Model, an industry-first AI model designed to dramatically reduce token usage and improve answer quality in production RAG-powered AI applications.

This highlighting model introduces sentence-level relevance filtering, enabling AI developers to remove low-signal context before sending prompts to large language models. This approach directly addresses rising inference costs and accuracy issues caused by oversized context windows in enterprise RAG and RAG-powered AI deployments.

“As RAG systems move into production, teams are running into very real cost and quality limits,” said James Luan, VP of Engineering at Zilliz. “This model gives developers a practical way to reduce prompt size and improve answer accuracy without reworking their existing pipelines.”

Key Innovations and Technical Breakthroughs

  • Bilingual relevance by design: Optimized for both English and Chinese, the model addresses cross-lingual relevance challenges common in global RAG deployments. It is built on the MiniCPM-2B architecture, enabling low-latency, production-ready performance.
  • Sentence-level context filtering: Rather than scoring entire document chunks, the model evaluates relevance at the sentence level and retains only content that directly supports a user query before sending it to the LLM.
  • Lower token usage, higher answer quality: Zilliz reports that sentence-level filtering significantly compresses prompt size while improving downstream response quality, helping teams reduce inference costs and improve generation speed in production environments.

Availability

The Bilingual Semantic Highlighting Model is available today as an open-source release. To learn more about the training methodology and performance benchmarks, visit the Zilliz Technical Blog.

Download: : zilliz/semantic-highlight-bilingual-v1

About Zilliz

Zilliz is the company behind Milvus, the world’s most widely adopted open-source vector database. Zilliz Cloud brings that performance to production with a fully managed, cloud-native platform built for scalable, low-latency vector search and hybrid retrieval. It supports billion-scale workloads with sub-10ms latency, auto-scaling, and optimized indexes for GenAI use cases like semantic search and RAG.

Zilliz is built to make AI not just possible—but practical. With a focus on performance and cost-efficiency, it helps engineering teams move from prototype to production without overprovisioning or complex infrastructure. Over 10,000 organizations worldwide rely on Zilliz to build intelligent applications at scale.

Headquartered in Redwood Shores, California, Zilliz is backed by leading investors, including Aramco’s Prosperity 7 Ventures, Temasek’s Pavilion Capital, Hillhouse Capital, 5Y Capital, Yunqi Partners, Trustbridge Partners, and others. Learn more at  Zilliz.com.

Cision View original content to download multimedia:https://www.prnewswire.com/news-releases/zilliz-open-sources-industry-first-bilingual-semantic-highlighting-model-to-slash-rag-token-costs-and-boost-accuracy-302675291.html

SOURCE Zilliz

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Fed Decides On Interest Rates Today—Here’s What To Watch For

Fed Decides On Interest Rates Today—Here’s What To Watch For

The post Fed Decides On Interest Rates Today—Here’s What To Watch For appeared on BitcoinEthereumNews.com. Topline The Federal Reserve on Wednesday will conclude a two-day policymaking meeting and release a decision on whether to lower interest rates—following months of pressure and criticism from President Donald Trump—and potentially signal whether additional cuts are on the way. President Donald Trump has urged the central bank to “CUT INTEREST RATES, NOW, AND BIGGER” than they might plan to. Getty Images Key Facts The central bank is poised to cut interest rates by at least a quarter-point, down from the 4.25% to 4.5% range where they have been held since December to between 4% and 4.25%, as Wall Street has placed 100% odds of a rate cut, according to CME’s FedWatch, with higher odds (94%) on a quarter-point cut than a half-point (6%) reduction. Fed governors Christopher Waller and Michelle Bowman, both Trump appointees, voted in July for a quarter-point reduction to rates, and they may dissent again in favor of a large cut alongside Stephen Miran, Trump’s Council of Economic Advisers’ chair, who was sworn in at the meeting’s start on Tuesday. It’s unclear whether other policymakers, including Kansas City Fed President Jeffrey Schmid and St. Louis Fed President Alberto Musalem, will favor larger cuts or opt for no reduction. Fed Chair Jerome Powell said in his Jackson Hole, Wyoming, address last month the central bank would likely consider a looser monetary policy, noting the “shifting balance of risks” on the U.S. economy “may warrant adjusting our policy stance.” David Mericle, an economist for Goldman Sachs, wrote in a note the “key question” for the Fed’s meeting is whether policymakers signal “this is likely the first in a series of consecutive cuts” as the central bank is anticipated to “acknowledge the softening in the labor market,” though they may not “nod to an October cut.” Mericle said he…
Share
BitcoinEthereumNews2025/09/18 00:23
Robinhood Chain Public Testnet Launch: A Strategic Pivot into Ethereum’s Layer 2 Ecosystem

Robinhood Chain Public Testnet Launch: A Strategic Pivot into Ethereum’s Layer 2 Ecosystem

BitcoinWorld Robinhood Chain Public Testnet Launch: A Strategic Pivot into Ethereum’s Layer 2 Ecosystem In a significant move that expands its footprint beyond
Share
bitcoinworld2026/02/11 10:05
Russian State Duma passes bill on cryptocurrency seizure and confiscation procedures

Russian State Duma passes bill on cryptocurrency seizure and confiscation procedures

PANews reported on February 11 that, according to Bits.media, the Russian State Duma has passed a procedural law on the seizure and confiscation of cryptocurrencies
Share
PANews2026/02/11 09:54