I put Gemini 3.0 and Grok 4.1 through 9 head-to-head prompts — from logic and creativity to humor and tone. Here's what happened and how the models each surprised me.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results