Back to News
Hacker News 5 hours ago

VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO

Summary

Score: 137 points

Read Full Article

This is a preview of the article. Click below to read the full article on Hacker News.

Visit Hacker News

Article Details

Source
Hacker News
Category
Technology
Published
Tuesday, June 23, 2026
Time
02:01 AM

Share this article