©RillNews
new
show
ask
jobs
submit
login
Why SWE-bench Verified no longer measures frontier coding capabilities
openai.com
10 points by
tedsanders
8 days ago
|
0 comments
add comment