Common types of “AI beats humans at X” studies
“We compared ChatGPT to a bunch of reddit trolls!”
“We measured the productivity of people who do stuff for $5 at tasks nobody ever does then generalise that to everything”
“We benchmarked the AI on tests in its training data”