Discussion about this post

User's avatar
Jack's avatar
Mar 22Edited

The problem with history is, can we really define "explanatory power" when N=1? People weave compelling stories about why lead plumbing caused the downfall of Rome, etc. but it always feels just-so to me. I wonder if it says more about the biases of the author (and their audience) than it does about Romans.

The randomness of LLM results may just be its peculiar way of saying, "I don't know." RLHF strongly penalizes the models for not giving answers.

Dave92F1's avatar

Would be easier to parse if LLM scores and polls were on normalized scales.

6 more comments...

No posts

Ready for more?