로딩 중...

Senior SWE-Bench: open-source benchmark that assesses agents as senior engineers | AI Paper Digest | AI Paper Digest