OpenAI's Strawberry models can reason like an…

Sep 13, 2024

When models can think

14 Comments

I've written about why just depending on pretraining won't be enough to create AGI, but it seems like we may have a viable solution now. Reasoning is all you need.

Expand full comment

Reply (1)

J.K. Lund

Sep 13

I know that sounded familiar, I read that :)

Expand full comment

Étienne Fortier-Dubois

Sep 13

Minor point: that Latin magic square is a famous example (as o1 itself points out), so it seems to be displaying ability to retrieve known solutions rather than come up with new ones. But at least it clearly understood what you were looking for!

Expand full comment

Reply (1)

Rohit Krishnan

Sep 13

Yes precisely it figured out a meta answer

Expand full comment

J.K. Lund

Sep 13

Nice piece Rohit. Is it too much to ask for OpenAI to come up with a naming scheme that is not so confusing?

The golden question is, with GPT-o1, how far are we from AGI? Months? Years?

I would like to know so I can update my recent essay on the topic.

Expand full comment

Reply (1)

Rohit Krishnan

Sep 13

A version of this grounded in search or documents might as well be called AGI I think, at least in the narrow sense

Expand full comment

Reply (1)

J.K. Lund

Sep 13

That's somewhat what I was thinking. I am reading this is PhD level intelligence, at least across narrow areas.

Expand full comment

R Meager

Sep 14

i think that the fact that they were sitting on this is why they were so confident / breezy about the stochastic parrot diss

Expand full comment

Jon

Sep 13

Is it possible that you don't know very well how to judge short stories and poems, and that calling AI output "very good" or "pretty good" for both these things is out of your domain of expertise? I'm not accusing, just wondering. What if AI just churns out completely passable examples of these forms, because almost has the ability to properly judge the quality of them? It makes me wonder: what would Helen Vendler say, for example?

Expand full comment

Reply (2)