Doubao Team Open Sources Multi-SWE-bench: A New Starting Point for Large Models’ ‘Automatic Bug Fixing’ Capabilities

Doubao Team Open Sources Multi-SWE-bench: A New Starting Point for Large Models' 'Automatic Bug Fixing' Capabilities

Doubao Team’s Major Open Source Initiative Empowers Large Models’ ‘Bug Fixing’ Capabilities In the rapid development of large model technology, the ability to fix code has become one of the key indicators of its performance. Recently, the Doubao team has stood out by open-sourcing the first multi-language code repair benchmark, Multi-SWE-bench. This initiative marks a … Read more

Multi-SWE-bench: The First Multilingual Code Repair Benchmark Open Source

Multi-SWE-bench: The First Multilingual Code Repair Benchmark Open Source

The ByteDance Doubao large model team has officially open-sourced the first multilingual SWE dataset – Multi-SWE-bench, which can be used to evaluate and enhance the “automatic bug fixing” capabilities of large models.Building on SWE-bench, Multi-SWE-bench covers seven mainstream programming languages beyond Python for the first time, making it a truly comprehensive benchmark for “full-stack engineering” … Read more