gekko

doing the math for you

Benchmark

Humanity’s Last Exam: The Ultimate Test for AI and the Future of Intelligence

Jan 29, 2025

—

by

gekko

in GPTs

Humanity’s Last Exam exposes AI limits: top models falter on reasoning. DeepSeek R shows promise but true AGI remains out of reach.
When AI Can’t Count: A Hilarious Look at the Math Skills of Text-to-Image Models

Dec 10, 2024

—

by

gekko

in GPTs

Discover the amusing shortcomings of text-to-image AI models as they hilariously fumble basic arithmetic, drawing bananas instead of apples, and mismatching quantities. Dive into Google DeepMind’s latest research on the importance of numerical reasoning in AI, exploring the deeper implications for safety, reliability, and the future of artificial intelligence.