“I tested how well ChatGPT can pull data out of messy PDFs”
Bullshitting, misgendering, typo injection, and a 1-6% error rate 😬
Using these tools to extract structured data from unstructured seems like a huge mistake.
... works as a web developer in Hveragerði, Iceland, and writes about the web, digital publishing, and web/product development
These are his notes
“I tested how well ChatGPT can pull data out of messy PDFs”
Bullshitting, misgendering, typo injection, and a 1-6% error rate 😬
Using these tools to extract structured data from unstructured seems like a huge mistake.