Discussion about this post

User's avatar
Albert Inkman's avatar

This is important, quiet work. The EU AI Act actually has teeth — it requires evaluation before deployment, not after. But that's only useful if the evals themselves are rigorous and fast enough to matter. Most public AI discussion is hype vs skepticism. This is the middle: people building the actual tools to understand what these models can and can't do. Three years to develop evals for CBRN threats is a short timeline. Hope they're not under pressure to rubber-stamp the models.

No posts

Ready for more?