Get All Access for $5/mo

Elon Musk's Newest AI Chatbot Outperformed ChatGPT in One Key Area Musk's AI startup announced an upgrade to its Grok chatbot on Thursday.

By Sherin Shibu

Key Takeaways

  • Elon Musk's xAI company is upgrading its Grok AI chatbot.
  • The new model outperformed OpenAI's AI model on one key HumanEval test.
  • Musk stated in a Friday social media post that Grok 1.5 should be available on X, formerly Twitter, by next week.

Nearly two weeks after Elon Musk's xAI startup opened up the AI model behind Grok to the public, its AI chatbot is set to get an upgrade.

The company announced Grok-1.5 on Thursday and claimed that its latest model can understand longer documents, handle more complex prompts, and perform more advanced reasoning.

While Grok-1.5 appears to be a step up from the original 1.0 with improvements in coding and math skills, its announcement post shows that it still lags behind Google's Gemini Pro 1.5 AI, OpenAI's GPT-4, and Anthropic's Claude 3 Opus in some benchmark tests, while outperforming OpenAI on one key HumanEval test.

Related: Meet Grok: Elon Musk Unveils 'Spicy' AI Chatbot Riddled With 'Sarcasm' and 'Humor'

Grok-1.5 scored higher than GPT-4 on the HumanEval benchmark, which consists of 164 challenging programming problems not included in the AI model's training data. GPT-4 had a score of 67% and Gemini Pro 1.5 scored 71.9%, while Grok-1.5 received 74.1%.

Elon Musk's xAI company is set to release a new version of the Grok AI chatbot, a ChatGPT competitor. Photo by Jaap Arriens/NurPhoto via Getty Images.

With a score of 81.3% on the MMLU test, which covers knowledge of 57 subjects from an elementary to an advanced level, Grok-1.5 performed close to Google Gemini's score (83.7%).

It also scored close to GPT-4's score of 52.9% with a score of 50.6% on the MATH test, a benchmark that covers grade school to high school math competition problems.

Related: Elon Musk Sues ChatGPT-Maker OpenAI, Accuses the Company of Working to 'Maximize Profits For Microsoft, Rather Than For the Benefit of Humanity'

Musk stated in a Friday social media post that Grok 1.5 should be available on X, formerly Twitter, by next week.

The X owner has high expectations for the next generation of Grok, writing that the next step after Grok-1.5 will outperform the AI currently available "on all metrics." Grok 2 is "in training now," he wrote in the post.

Grok AI is currently only available to those with a $16 a month or higher Premium+ subscription on X.

Musk sued OpenAI, a competitor of xAI, earlier this month and asked for a court ruling that would force OpenAI to make the research and technology behind its AI public.

Sherin Shibu

Entrepreneur Staff

News Reporter

Sherin Shibu is a business news reporter at Entrepreneur.com. She previously worked for PCMag, Business Insider, The Messenger, and ZDNET as a reporter and copyeditor. Her areas of coverage encompass tech, business, strategy, finance, and even space. She is a Columbia University graduate.

Want to be an Entrepreneur Leadership Network contributor? Apply now to join.

Side Hustle

At Age 15, He Used Facebook Marketplace to Start a Side Hustle — Then It Became Something Much Bigger: 'Raised Over $1.6 Million'

Dylan Zajac, now a 21-year-old senior at Babson College, wanted to bridge the digital divide.

Franchise

McDonald's Announces the Return of the Snack Wrap in 2025 — Here's What to Expect From Its Comeback

The decision comes after years of persistent customer demand for the portable snack, which debuted nearly two decades ago.

Side Hustle

'I Just Hustled': She Earned More Than $300,000 Wrapping Gifts Last Year — and It All Started With a Side Hustle

When Michelle Hensley lost her husband to cancer, she needed to figure out how to earn an income for her family.

Business News

OpenAI Just Released Its Text-to-Video Generator, Sora. Here's How the New AI Could Impact Small Businesses and Creators.

Sora has a variety of use cases for businesses, from social media campaigns to video creation.

Productivity

6 Habits That Help Successful People Maximize Their Time

There aren't enough hours in the day, but these tips will make them feel slightly more productive.

Business Ideas

63 Small Business Ideas to Start in 2024

We put together a list of the best, most profitable small business ideas for entrepreneurs to pursue in 2024.