Oct 23, 2024 · We are delighted to announce that MarsCode Agent is ranked 1st place in SWE-bench Lite, a benchmark to evaluate large language models and agents on solving real-world github issues.. Original Twitter link. Recent advances in large language models (LLMs) have shown significant potential to automate various software development tasks, including code completion, test generation, and bug fixing.