Grok 3 Has Better Reasoning Than ChatGPT: Experts React To Elon Musk, xAI's Latest Chatbot

Grok 3 unveiling — Elon Musk joined xAI engineers and experts Monday night to unveil the much-anticipated Grok 3 AI chatbot. xAI on X / Live Broadcast Screenshot

KEY POINTS

Lex Fridman was mind-blown by Grok 3 after 'extensively' testing the chatbot's early version
Robert Scoble hailed the AI assistant's learning speed, which he noted was faster than others
Brian Roemmele compared Grok 3's impressive reasoning capabilities to ChatGPT's 'lame' outputs
Grok 3 is now No. 1 across all categories on AI chatbot benchmarking platform LM Arena

Elon Musk's artificial intelligence startup xAI has unveiled its latest flagship model, Grok 3, and so far, industry experts are impressed with the AI assistant's capabilities, with some praising the chatbot's "reasoning" prowess.

The tech titan previously said Grok 3 is the "smartest AI on Earth," having been trained on synthetic data and was built with a self-correction feature to avoid making errors.

Industry experts who were given early access to the latest iteration of the Grok AI chatbot and those who kept up with the live demo on X have shared their thoughts on the model. So far, things are going all too well for xAI.

Tech Experts Impressed with Grok 3's Learning Benchmarks

Prominent American computer scientist Lex Fridman revealed he was able to use the chatbot "extensively" and ahead of the model's unveiling.

"My mind is blown, very impressive model," he said, sending his congratulations to Musk and the xAI team for bringing the AI assistant to life.

I got to use Grok 3 extensively (early). My mind is blown, very impressive model 🤯 Congrats to Elon and the team for bringing it to life 👊
— Lex Fridman (@lexfridman) February 18, 2025

Well-known AI pioneer Robert Scoble was also impressed. "The thing to really pay attention to in AI is learning speed. And @xai is learning way faster than any other," he wrote, along with a screenshot of Grok 3's learning benchmarks compared to the earlier Grok iterations, Google's Gemini 2 Pro model, Chinese DeepSeek, Claude 3.5 Sonnet, and OpenAI's ChatGPT4.

Grok 3 benchmarks.

The thing to really pay attention to in AI is learning speed. And @xai is learning way faster than any other.

Who said that?

Apple Siri cofounder Tom Gruber. He told me at dinner a decade ago that that is the most important thing to pay attention to. pic.twitter.com/yWCiJsN9pU
— Robert Scoble (@Scobleizer) February 18, 2025

Software expert and tech executive Aaron Levie said Grok 3 was coming out "very strong." He said the model's learning abilities are "great proof that the scaling laws are not, in fact, over."

DeepSeek's V3 scored 40 on the Coding benchmark, ChatGPT scored 43, while Grok 3 logged a score of 57. The numbers are even higher on Science (GPQA), with Grok 3 scoring 75, DeepSeek scoring 65, and ChatGPT settling at 50.

Grok 3 Ranks 1^st on LM Arena

Within hours since the early version of Grok 3 went out on popular AI chatbot comparison platform LM Arena, Musk's AI assistant took the top spot, besting Google's Gemini 2.0 Flash, ChatGPT, and several other chatbots.

BREAKING: @xAI early version of Grok-3 (codename "chocolate") is now #1 in Arena! 🏆

Grok-3 is:
- First-ever model to break 1400 score!
- #1 across all categories, a milestone that keeps getting harder to achieve

Huge congratulations to @xAI on this milestone! View thread 🧵… https://t.co/p8z8lccNd5 pic.twitter.com/hShGy8ZN1o
— lmarena.ai (formerly lmsys.org) (@lmarena_ai) February 18, 2025

According to LM Arena, Grok 3 was the first model to break a score of 1,400 and it is also the top chatbot "across all categories, a milestone that keeps getting harder to achieve."

Grok 3 Has Better Reasoning Skills Than ChatGPT?

The xAI chatbot's LM Arena rankings on various benchmarks are already impressive as is, but for some industry experts, Grok 3's reasoning abilities are what make it stand out.

"Grok 3 has one of the best reasoning engines available today," said tech expert and researcher Brian Roemmele. He said the model shows "significantly more than 'OpenAI' lame PowerPoint 'reasoning' output."

Grok 3 has one of the best Reasoning Engines available today. Although it is not fully naked (showing all element of reasoning) it show significantly more than “OpenAI” lame PowerPoint “reasoning” output.

Congratulations ⁦@elonmusk⁩ and the ⁦@xai⁩ team! pic.twitter.com/FojsXQlRML
— Brian Roemmele (@BrianRoemmele) February 18, 2025

Scale AI CEO Alexander Wang had similar thoughts, noting how Grok 3 scored "impressively on pretraining and reasoning" evaluations.

Grok 3 is a new best model in the world from the @xai team!

Grok 3 ranks #1 on Chatbot Arena w/a big gap, and scores impressively on pretraining and reasoning evals.

congrats to @elonmusk @ibab @jimmybajimmyba @Yuhu_ai_

looking forward to more partnership on grok4 & beyond 🚀 pic.twitter.com/BrPGz17P51
— Alexandr Wang (@alexandr_wang) February 18, 2025

For Amjad Masad, the CEO of AI agent developer Replit, xAI's accomplishment with Grok 3 shouldn't be ignored, considering "how late in the game they started."

Grok 3 appears to be a state-of-the-art frontier model. This is a huge accomplishment, especially considering how late in the game they started.

Congrats @ibab, @elonmusk, and the rest of the @xai team. Can’t wait to start building on it. pic.twitter.com/fYMkwKYmDD
— Amjad Masad (@amasad) February 18, 2025

The model's unveiling came after Chinese startup DeepSeek's AI assistant rocked the tech world due to its model that it supposedly developed on a small budget.

Originally published on IBTimes