Charts
DataOn-chain
VIP
Market Cap
API
Rankings
CoinOSNew
CoinClaw🦞
Language
  • 简体中文
  • 繁体中文
  • English
Leader in global market data applications, committed to providing valuable information more efficiently.

Features

  • Real-time Data
  • Special Features
  • AI Grid

Services

  • News
  • Open Data(API)
  • Institutional Services

Downloads

  • Desktop
  • Android
  • iOS

Contact Us

  • Chat Room
  • Business Email
  • Official Email
  • Official Verification

Join Community

  • Telegram
  • Twitter
  • Discord

© Copyright 2013-2026. All rights reserved.

简体繁體English
|Legacy

Microsoft Made GPT and Claude Work Together—And the Result Beats Every AI Research Tool Out There

CN
Decrypt
Follow
4 hours ago
AI summarizes in 5 seconds.


Deep research AI has been one of the hottest arms races in tech this year. Google announced its research agent for Gemini in December 2024, OpenAI released its own research agent in February 2025, xAI followed suit, Perplexity doubled down, and Anthropic's Claude built a loyal following among professionals who need detailed, cited answers, introducing its agent in April of last year.


Every company has been trying to convince you that their single AI model is the smartest researcher in the room. Microsoft just said: Why pick one?


The company announced two new features on Monday for Copilot's Researcher tool—called Critique and Council—that put OpenAI's GPT and Anthropic's Claude to work on the same research task in sequence. The result, according to Microsoft's testing against an industry benchmark, scores higher than every system included in that test, including models from the top AI companies.



“Critique is a new multi model deep research system designed for complex research tasks. It separates generation from evaluation and utilizes a combination of models from Frontier labs, including Anthropic and OpenAI,” Microsoft explains. “One model leads the generation phase, planning the task, iterating through retrieval, and producing an initial draft, while a second model focuses on review and refinement, acting as an expert reviewer before the final report is produced.”


Here's the basic problem Critique is designed to fix: Every AI research tool today works the same way. You ask a question, one model plans a search, scours sources, writes a report, and hands it back to you. That single model is doing everything with no one checking its work.


This can end up with some hallucinations slipping in, some errors in citations, fake or inaccurate claims, etc.





Critique breaks that workflow in two. GPT handles the first phase—it plans the research, pulls sources, and writes an initial draft. Then Claude steps in as a strict editor, reviewing the report for factual accuracy, citation quality, and whether the answer actually addressed what was asked. Only after that review does the final report reach the user. Microsoft says the roles can eventually run in the opposite direction too, with Claude drafting and GPT critiquing, though for now GPT goes first.


On the DRACO benchmark—a standardized test covering 100 complex research tasks across 10 domains including medicine, law, and technology—Copilot with Critique scored 57.4. points with Anthropic's Claude Opus 4.6 by itself hitting 42.7. Microsoft's combined system beats the next best result by nearly 14%.



Image: Microsoft

The biggest gains showed up in breadth of analysis and presentation quality, with factual accuracy also posting a significant improvement.


The second feature, Council, takes a different approach to the same problem. Instead of having one model review the other's work, Council runs GPT and Claude simultaneously and puts their full reports side by side. A third "judge" model then reads both and writes a summary explaining where the two AIs agreed, where they diverged, and what unique angles each one caught that the other missed. Comparing AI research tools manually has been something users have had to do themselves until now.


In Critique, the models essentially collaborate with each other while in Council the models compete against each other.


Critique is the default experience in Researcher whereas Council requires you to select "Model Council" from the picker to activate the side-by-side mode. Both features are currently available to users enrolled in Microsoft's Frontier program, the early-access channel for Copilot's newest capabilities. A Microsoft 365 Copilot license ($30/user/month) is required, but users also need to be enrolled in Frontier to access them.



Image: Microsoft

OpenAI and Microsoft have a multibillion-dollar partnership, but Microsoft's bet is that no single model stays on top for long, and that the real value is in the orchestration layer that routes tasks to whichever combination works best.


免责声明:本文章仅代表作者个人观点,不代表本平台的立场和观点。本文章仅供信息分享,不构成对任何人的任何投资建议。用户与作者之间的任何争议,与本平台无关。如网页中刊载的文章或图片涉及侵权,请提供相关的权利证明和身份证明发送邮件到support@aicoin.com,本平台相关工作人员将会进行核查。

返20%!OKX龙虾AI,安全+快速+自动化
广告
|
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

|
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

Selected Articles by Decrypt

2 hours ago
Senator Questions SEC Over Treatment of Trump-Linked Crypto Businesses
4 hours ago
Senators Reveal \\\'Mined in America\\\' Bill to Boost Bitcoin Mining, Support Trump\\\'s Reserve
4 hours ago
Chainlink Labs, Anchorage Digital Back New Crypto Super PAC Ahead of Midterms
View More

Table of Contents

|
|
APP
Windows
Mac
Share To

X

Telegram

Facebook

Reddit

CopyLink

Related Articles

avatar
avatarbitcoin.com
2 minutes ago
Strategy’s Latest SEC Filing Shows No Bitcoin Purchases or Share Sales During Quiet Week
avatar
avatarbitcoin.com
1 hour ago
Why Higher XRP Prices Make Payments Cheaper, Ripple’s Schwartz Clarifies Misconception
avatar
avatarbitcoin.com
1 hour ago
Bitcoin, Ether ETFs Hit by $503 Million Exodus as Selling Intensifies
avatar
avatarDecrypt
2 hours ago
Senator Questions SEC Over Treatment of Trump-Linked Crypto Businesses
avatar
avatarbitcoin.com
3 hours ago
Washington State Targets Kalshi in Illegal Online Betting Lawsuit
APP
Windows
Mac

X

Telegram

Facebook

Reddit

CopyLink