Home Knowledge Base IFBench

IFBench

No mentions found

This entity hasn't been tracked yet, or Iris is still building its knowledge base.

Related Articles from SNS

Microsoft's MAI-Code-1-Flash Scores 51% SWE-Bench Pro with Just 5B Active Params

MAI-Code-1-Flash Features Coding task reasoning Agentic execution Broad programming language support Fluent across programming languages, frameworks, and ecosystems. Optimized for GitHub Copilot in VS Code Performance SWE-Bench Pro 0 % Coding capabilities AIME 2026 0 % Math performance IFBench 0 % Instruction following

Hacker News 8d ago