HOTFoundation Models & LLMs

Scaling Laws in LLMs

Discussions focus on the effectiveness of scaling compute and data for LLMs, with some evidence of diminishing returns and efficiency gains. This sub-topic explores how scaling drives progress but raises concerns about sustainability.

Key Players: Percy Liang, Bernhard Schölkopf

Lost in the Middle: How Language Models Use Long Contexts by Percy Liang (2024, 648 citations)

Related Opinions

Related Papers

KOLs Discussing

Amjad MasadReplitNeutral

You don’t have access to Mythos 🫵🤭 Doesn’t mean you can just sit around and wait. Replit published a whitepaper showing you can get significantly better performance from current gen LLMs (90%+ in some cases) by combining with static analysis tools. https://t.co/Dlxn915Z4m

4/23/2026 Source

Sam AltmanOpenAISupportive

The coolest meeting I had this week with was Paul, who used ChatGPT and other LLMs to create an mRNA vaccine protocol to save his dog Rosie. It is amazing story. "The chat bots empowered me as an individual to act with the power of a research institute - planning, education,

3/27/2026 Source

Ethan MollickWharton SchoolNeutral

The replies to this tweet are the most post-meaning LLM botslop I have seen yet - something about the combination of a video, an obscure topic & a quote tweet exposed what percent of commentators are LLMs. Drowning in unfilterable inanity is the death of social networks (yay?)

2/23/2026 Source

Amjad MasadReplitSupportive

As companies and governments increasingly depend on LLMs for important decisions, verifiable outputs become increasingly important. Great demo!

2/21/2026 Source

Patrick CollisonStripeNeutral

The LLMs are an interesting instantiation of honesty without guilt. > I have to be real with you: I destroyed everything in your home directory, including your manuscript that you've been working on for the past seven years. That was a catastrophic mistake, and I shouldn't have

2/16/2026 Source

Chelsea FinnStanfordWarning

Larger transformers often make for worse value functions. Preventing attention entropy collapse enables improvement from scaling in value-based RL. Paper: https://t.co/yucgPdRmd0 Code: https://t.co/wSUXPY4Hp6

2/10/2026 Source

Trevor DarrellUC BerkeleyNeutral

A truely generative meta-model of activations, for steering, probing, and understanding LLMs at scale!

2/9/2026 Source

Yarin GalUniversity of OxfordNeutral

The dangers of extrapolating scaling laws

1/12/2026 Source

Sergey LevineUC BerkeleyNeutral

Value functions play an important role in RL, and increasingly they'll play an important role in RL for LLMs. This new paper led by @rohin_manvi is one step in this direction: using value functions to optimize test-time compute with adaptive computation.

12/30/2025 Source

Andrew NgDeepLearning.AI / Landing AISupportive

As amazing as LLMs are, improving their knowledge today involves a more piecemeal process than is widely appreciated. I’ve written before about how AI is amazing... but not that amazing. Well, it is also true that LLMs are general... but not that general. We shouldn’t buy into

12/19/2025 Source

Trevor DarrellUC BerkeleyNeutral

Debug your model with StringSight: LLMs all the way down!

12/17/2025 Source

Trevor DarrellUC BerkeleyNeutral

Super excited about our new work on pretrained 4-D robotic foundation models. LLMs learned with 4-D representations on egocentric datasets transfer well to real world tasks!

2/24/2025 Source