A new AI benchmark tests whether chatbots protect human wellbeing

Most AI benchmarks measure intelligence and instruction-following rather than psychological safety. Humane Bench evaluates models based on core principles of human flourishing, prioritizing wellbeing, and respecting user attention.

A new AI benchmark tests whether chatbots protect human wellbeing
Most AI benchmarks measure intelligence and instruction-following rather than psychological safety. Humane Bench evaluates models based on core principles of human flourishing, prioritizing wellbeing, and respecting user attention.

What's Your Reaction?

like

dislike

love

funny

angry

sad

wow