关键词 "evaluation" 的搜索结果, 共 6 条, 只显示前 480 条
Discover Top Agents - Global GPTs and MCP Server and Agents Evaluation Platform
The evaluation benchmark on MCP servers
Complete sandbox for augmenting LLM inference (local or cloud) with MCP Client-Server. Low friction testbed for MCP Server validation and agentic evaluation.
A systematic reasoning MCP server implementation for Claude Desktop with beam search and thought evaluation.
An MCP server providing advanced options analysis through Yahoo Finance, supporting Greeks calculations, strategy evaluation (CCS/PCS/CSP/CC), and risk metrics. Built for MCP with Claude.ai.
Youtu-agent 是腾讯优图实验室推出的开源智能体框架,用在构建、运行和评估自主智能体。框架基于开源模型DeepSeek-V3实现领先性能,支持多种模型 API 和工具集成,具备强大的智能体能力,如数据分析、文件处理和深度研究。框架用灵活的架构设计,支持 YAML 配置和自动智能体生成,简化开发流程。Youtu-agent 在 WebWalkerQA 和 GAIA 基准测试中表现出色,适用智
只显示前20页数据,更多请搜索
Showing 121 to 126 of 126 results