How to build a better AI benchmark

The limits of traditional testing If AI companies have been slow to respond to the growing failure of benchmarks, it’s partially because the test-scoring approach has been so effective for so long.  One of the biggest early successes of contemporary AI was the ImageNet challenge, a kind of antecedent to contemporary benchmarks. Released in 2010 […]

How To Pick The Right AI Agent

Photo by Solen Feyissa on Unsplash UNSPLASH.COM If generative AI was the first frontier in artificial intelligence, AI agents are the next. We’ve reached a point where startups are posting job listings—not for humans, but for AI agents (or those who build them). It might be a clever PR move, but it’s not a gimmick. […]

How To Start Using AI Agents To Transform Your Business

Photo by Solen Feyissa on Unsplash UNSPLASH.COM I’m calling it now—2025 is the year of the AI agent. These autonomous systems can actually make decisions and perform actions without human prompting, so it makes sense that savvy business leaders are looking to integrate them into their operations. But like all things AI, implementing agents into […]