Researchers compare two solutions for approximating LLM rankings of Claude 4, GPT-4o, Gemini 2.5, and Grok-3. Researchers ...