Simon Willison
Simon Willison
@simonw
Mar 28 26 days ago 3 tweets Read on X

A year ago nobody outside OpenAI had trained a model as good as GPT-4

Today there are dozens - and if you trust the benchmarks that includes some that you can run on a laptop (Qwen2.5-32B perhaps?)

What changed? What techniques are used now that weren't known a year ago?

Here's what I wrote about this in December last year

Tweet image 1

Could this be about other labs getting better at instruction tuning their models?

@simonw
@Yuchenj_UW That's an interesting possibility: maybe GPT-4's strength was in its instruction and fine-tuning, not so much its size - and the other labs have caught up in terms of that training by data now

Missing some Tweet in this thread? You can try to Update

More Threads by @simonw

3 tweets • 4 days ago
Read Thread
I'm looking for OCR models that can "guess" partial words and aren't restricted by safety filters or content policies, e...
5 tweets • 17 days ago
Read Thread
8 tweets • 18 days ago
Read Thread
2 tweets • 2 months ago
Read Thread
2 tweets • 3 months ago
Read Thread

Unroll Another Thread

Convert any Twitter threads to an easy-to-read article instantly

Have you tried our Twitter bot?

You can now unroll any thread without leaving Twitter/X. Here's how to use our Twitter bot to do it.

  • Give us a follow on Twitter. follow us
  • Drop a comment, mentioning us @unrollnow on the thread you want to Unroll.
  • Wait For Some Time, We will reply to your comment with Unroll Link.
UnrollNow Twitter Bot
Modal Image
0:00 / 0:00