Discussion about this post

User's avatar
Paulin's avatar

I thought this was going to be a very technical point about what LLMs *really* predict

As opposed to "the next token", a guess most readers would make after reading the first part of the title (because they are next-token predictors themselves)

But the actual article is equally lovely

Ignacio's avatar

Cool experiments and interesting results!

Could you also share the actual equations of (some of) the models, including their assumptions? I’m curious why two models came to a much lower temperature at t=0

13 more comments...

No posts

Ready for more?