©RillNews
new
show
ask
jobs
submit
login
VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO
arxiv.org
252 points by
timhigins
11 hours ago
|
106 comments
add comment