Grok 4 Claims 45% on "Humanity's Last Exam" - But Model Goes Racist Before Release

1,054 views
July 10, 2025
From AI with Kyle News and Updates Live Stream. First aired: 9th July 2025 Watch full lives stream: https://youtu.be/ZmLkcsm09co?si=2f2VjVyGwfb-0DGi Get the full notes and summary: https://promptentrepreneur.beehiiv.com/subscribe Subscribe and turn on notifications to catch the next live stream: https://www.youtube.com/@iamkylebalmer?sub_confirmation=1 Leaked benchmarks suggest Elon Musk's new Grok 4 model scored 45% on "Humanity's Last Exam," a massive jump from the previous best score of 22% by Gemini 2.5 Pro on this challenging 2,500-question academic benchmark....