The Oath

Gary Marcus

Rank 11 of 47
|
Score 83
Gary Marcus
@GaryMarcus
--
@vanessaparli @indexingai @StanfordHAI can’t wait, and hope you go back to GPT-2 wherever possible.
4/12/2024, 1:39:08 AM
X
In reply to:
Vanessa Parli
@vanessaparli
·
661d
@GaryMarcus The 2024 @indexingai comes out next week which includes a lot of these type of benchmark comparisons across models! @StanfordHAI
Gary Marcus
@GaryMarcus
·
661d
What happens when you plot GPT-2, 3, 4, and Turbo side-by-side?

Below I have plotted one common measure, MMLU, where there are easy to find data going back to GPT-2. (There may be others with data going back that far; this is just a first quick attempt.)


What I see is an…
Gary Marcus
@GaryMarcus
·
661d
Could we see GPT 3 and 3.5 and GPT 4 on the same plot? And Gemini Pro 1.5 and Claude Opus?

The statement is a personal expression of anticipation and preference regarding AI model releases. It is conversational in nature and does not engage with broader public issues or policies.

FacebookInstagramTwitterYouTube

© 2023-2024 The Oath, All rights reserved.