Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Opus 4.6 with thinking. Result was near-instant:

“Drive. You need the car at the car wash.”

 help



Changed 50 meters to 43 meters with Opus 4.6:

“Walk. 43 meters is basically crossing a parking lot. ”


lol, are AI companies patching this answer in real time. I thought it took months long effort for a training run. How would they make changes in such a short period?

The companies aren’t changing anything. LLM outputs are just more random than people realize. Run the same prompt 10 times if you really want to know how well they can answer.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: