There wasn't a post yet on this somehow, but Google's latest AI is reportedly quite a bit ahead of others in performance:
https://www.thealgorithmicbridge.com/p/google-gemini-3-just-killed-every
There wasn't a post yet on this somehow, but Google's latest AI is reportedly quite a bit ahead of others in performance:
https://www.thealgorithmicbridge.com/p/google-gemini-3-just-killed-every
Comments
Second, there are so many metrics that the companies can just pick the few that their models excel at and claim they are the best. Regardless, they can game the test by training the model to do well specifically on certain metrics.
Lastly, since we are in an AI bubble, the media is in a frenzy over every minor, incremental improvement. If one model is 10% better than the others, it is a revolution and AGI is around the corner. But we were promised exponential scaling of performance and since GPT 4, all we've gotten is AI video generators and higher hallucination rates.
I am not sure any of the is is true. I predict that with at LANL the opposite will will happen. I am not saying that AI will not be super intelligent and could do the job of 5 people. What will happen at the NNSA labs is someone will say "AI tell me how to make make more paper work, make more crazy rules, and absurd procedures and tell us how we can justify hiring 5 more people for every one person we actually need" AI in all its power will figure out how to do it. In other words the NNSA labs will use AI to SUPERCHARGE inefficiencies, what use to take only week in paperwork will not take months, things that could be done in day or two will be three more weeks. AI will make paper work so insanely complex that the only way to solve these propels will be to use another AI! In the end we will be 10 times slower thanks to AI, but at the same time become twice as fast with AI, and will be spun as great success.
I am serious. In the past two to three years I have seen the inefficiencies actually grow, things are slower and far more inefficient, than before. Some claim this is due to a backlash against DOGE, or Covid I am not sure.