Multi-SWE-bench: The First Multilingual Code Repair Benchmark Open Source

Multi-SWE-bench: The First Multilingual Code Repair Benchmark Open Source

The ByteDance Doubao large model team has officially open-sourced the first multilingual SWE dataset – Multi-SWE-bench, which can be used to evaluate and enhance the “automatic bug fixing” capabilities of large models.Building on SWE-bench, Multi-SWE-bench covers seven mainstream programming languages beyond Python for the first time, making it a truly comprehensive benchmark for “full-stack engineering” … Read more