Multi-SWE-bench Articles

AI Insights: ByteDance Releases Multi-SWE-bench; Alibaba Cloud Launches MCP Service; Kimi Open Sources 16B Lightweight Visual Language Model

2025-10-05 by boardor

01 ByteDance Releases Multi-SWE-bench First Multi-Language Code Auto-Fix Benchmark The ByteDance Doubao large model team has launched the first multi-language software engineering dataset, Multi-SWE-bench, covering eight mainstream programming languages including Python and Java, with 1,632 real GitHub issue instances. This dataset provides a systematic evaluation of large model code repair capabilities through unified testing standards … Read more

Good News for Programmers! The Open Source Multi-SWE-bench by Doubao Team Tackles Code Bugs and Assesses Model Performance!

2025-10-05 by boardor

Programmers, are you fighting bugs every day? Good news is here! Recently, the Doubao Team from ByteDance has made a significant move by open-sourcing a tool called Multi-SWE-bench. This is not just an ordinary tool; it is specifically designed to test the “automatic bug-fixing” capabilities of large models, and it supports multiple programming languages! Now … Read more

Doubao Team Open Sources Multi-SWE-bench: A New Starting Point for Large Models’ ‘Automatic Bug Fixing’ Capabilities

2025-09-22 by boardor

Doubao Team’s Major Open Source Initiative Empowers Large Models’ ‘Bug Fixing’ Capabilities In the rapid development of large model technology, the ability to fix code has become one of the key indicators of its performance. Recently, the Doubao team has stood out by open-sourcing the first multi-language code repair benchmark, Multi-SWE-bench. This initiative marks a … Read more