[Opinion Piece] Evidence that LLMs are reaching a point of diminishing returns - and what that might mean

garymarcus.substack.com

[Opinion Piece] Evidence that LLMs are reaching a point of diminishing returns - and what that might mean

garymarcus.substack.com

pavnilschanda@lemmy.worldM to

AI Companions@lemmy.world · 7 months ago

Evidence that LLMs are reaching a point of diminishing returns - and what that might mean

garymarcus.substack.com

The conventional wisdom, well captured recently by Ethan Mollick, is that LLMs are advancing exponentially. A few days ago, in very popular blog post, Mollick claimed that “the current best estimates of the rate of improvement in Large Language models show capabilities doubling every 5 to 14 months”:

Recent claims that large language model (LLM) capabilities are doubling every 5-14 months appear unfounded based on an analysis of benchmark performance data. While there were huge leaps from GPT-2 to GPT-3 and GPT-3 to GPT-4, the progress from GPT-4 to more recent models like GPT-4 Turbo has been much more modest, suggesting diminishing returns. Plots of accuracy on tasks like MMLU and The New York Times’ Connections game show this flattening trend. Qualitatively, core issues like hallucinations and errors persist in the latest models. With multiple models now clustered around GPT-4 level performance but none decisively better, a sense is emerging that the rapid progress of recent years may be stalling out as LLMs approach inherent limitations, potentially leading to disillusionment and a market correction after last year’s AI hype.

Summarized by Claude 3 Sonnet

You must log in or register to comment.

Chat

AI Companions@lemmy.world

aicompanions@lemmy.world

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !aicompanions@lemmy.world

Community to discuss companionship, whether platonic, romantic, or purely as a utility, that are powered by AI tools. Such examples are Replika, Character AI, and ChatGPT. Talk about software and hardware used to create the companions, or talk about the phenomena of AI companionship in general.

Rules:

Be nice and civil
Mark NSFW posts accordingly
Criticism of AI companionship is OK as long as you understand where people who use AI companionship are coming from
Lastly, follow the Lemmy Code of Conduct

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

2 users / day
2 users / week
31 users / month
1.53K users / 6 months
37 local subscribers
520 subscribers
827 Posts
772 Comments
Modlog

mods:
pavnilschanda@lemmy.world