OpenAI unveils HealthBench to evaluate LLMs safety in healthcare
The offering measures AI's real-world performance and safety around handling realistic medical conversations, using physician-created rubrics and GPT-4.1 scoring.

May 15, 2025 0
May 9, 2025 0
May 9, 2025 0
May 9, 2025 0
May 9, 2025 0
Or register with email
May 15, 2025 0
May 15, 2025 0
May 15, 2025 0
May 15, 2025 0
May 15, 2025 0
This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.