This edition of our Changelog is a little different. No product features or release notes — instead, two big milestones to remind you of:
Details below!
We’re proud that Refact.ai Agent is now the best open-source AI Agent in SWE-bench Verified, the leading benchmark for evaluating AI Agents on software engineering tasks
Score: 70.4%. Tasks solved: 352 / 500. Fully autonomously.
We even outperformed AI Agents running Claude 4 Sonnet (we used 3.7 Sonnet).
Our full SWE-bench pipeline is open-source — check it out on GitHub.
Key elements of the approach:
debug_script()
sub-agent using pdb.strategic_planning()
tool with o3 for debug-to-solution reasoningIf you feel like sharing this milestone on LinkedIn or X, we’d really appreciate the support 🫶 Thank you for building with Refact.ai!
And yes, the new score with Claude 4 Sonnet is coming soon.
Just got back from Dublin Tech Summit! We spent two days talking with devs, CTOs, and tech leaders about what matters in enterprise AI. Refact.ai stood out — on-prem, autonomous, and ready for real-world workflows.
Here’s what teams told us they need — and what Refact.ai delivers:
✅ On-prem security. No code or telemetry ever leaves servers. Refact.ai offers on-prem deployment, giving companies 100% data privacy.
✅ Fully autonomous & transparent. Solves tasks end-to-end while letting developers preview and guide every step.
✅ Truly context-aware. Understands your project, codebase, repo structure, dependencies, and more, adapting to how your team actually works.
✅ Shared knowledge for teams. Captures successful interactions with AI and builds a team-wide memory, speeding up collaboration across developers.
Want to see what Refact.ai could do inside your company?
Discover how Refact.ai Enterprise turns AI into a real programming partner for faster, smarter development across complex codebases!
If you’re ready to work with an AI that understands your environment, works across your tools, and delivers reliable, product-ready results — Refact.ai is ready for you.
Autonomous when you want it, collaborative when you step in. Get Refact.ai for VS Code or JetBrains today and start doubling your output — without doubling working hours.