• Education
    • Higher Education
    • Scholarships & Grants
    • Online Learning
    • School Reforms
    • Research & Innovation
  • Lifestyle
    • Travel
    • Food & Drink
    • Fashion & Beauty
    • Home & Living
    • Relationships & Family
  • Technology & Startups
    • Software & Apps
    • Startup Success Stories
    • Startups & Innovations
    • Tech Regulations
    • Venture Capital
    • Artificial Intelligence
    • Cybersecurity
    • Emerging Technologies
    • Gadgets & Devices
    • Industry Analysis
  • About us
  • Contact
  • Advertise with Us
  • Privacy & Policy
Today Headline
  • Home
  • World News
    • Us & Canada
    • Europe
    • Asia
    • Africa
    • Middle East
  • Politics
    • Elections
    • Political Parties
    • Government Policies
    • International Relations
    • Legislative News
  • Business & Finance
    • Market Trends
    • Stock Market
    • Entrepreneurship
    • Corporate News
    • Economic Policies
  • Science & Environment
    • Space Exploration
    • Climate Change
    • Wildlife & Conservation
    • Environmental Policies
    • Medical Research
  • Health
    • Public Health
    • Mental Health
    • Medical Breakthroughs
    • Fitness & Nutrition
    • Pandemic Updates
  • Sports
    • Football
    • Basketball
    • Tennis
    • Olympics
    • Motorsport
  • Entertainment
    • Movies
    • Music
    • TV & Streaming
    • Celebrity News
    • Awards & Festivals
  • Crime & Justice
    • Court Cases
    • Cybercrime
    • Policing
    • Criminal Investigations
    • Legal Reforms
No Result
View All Result
  • Home
  • World News
    • Us & Canada
    • Europe
    • Asia
    • Africa
    • Middle East
  • Politics
    • Elections
    • Political Parties
    • Government Policies
    • International Relations
    • Legislative News
  • Business & Finance
    • Market Trends
    • Stock Market
    • Entrepreneurship
    • Corporate News
    • Economic Policies
  • Science & Environment
    • Space Exploration
    • Climate Change
    • Wildlife & Conservation
    • Environmental Policies
    • Medical Research
  • Health
    • Public Health
    • Mental Health
    • Medical Breakthroughs
    • Fitness & Nutrition
    • Pandemic Updates
  • Sports
    • Football
    • Basketball
    • Tennis
    • Olympics
    • Motorsport
  • Entertainment
    • Movies
    • Music
    • TV & Streaming
    • Celebrity News
    • Awards & Festivals
  • Crime & Justice
    • Court Cases
    • Cybercrime
    • Policing
    • Criminal Investigations
    • Legal Reforms
No Result
View All Result
Today Headline
No Result
View All Result
Home Politics

Grok controversies raise questions about moderating, regulating AI content

July 15, 2025
in Politics
Reading Time: 5 mins read
A A
0
4
SHARES
8
VIEWS
Share on FacebookShare on Twitter


Elon Musk’s artificial intelligence (AI) chatbot Grok has been plagued by controversy recently over its responses to users, raising questions about how tech companies seek to moderate content from AI and whether Washington should play a role in setting guidelines.

Grok faced sharp scrutiny last week, after an update prompted the AI chatbot to produce antisemitic responses and praise Adolf Hitler. Musk’s AI company, xAI, quickly deleted numerous incendiary posts and said it added guardrails to “ban hate speech” from the chatbot.

Just days later, xAI unveiled its newest version of Grok, which Musk claimed was the “smartest AI model in the world.” However, users soon discovered that the chatbot appeared to be relying on its owner’s views to respond to controversial queries.

“We should be extremely concerned that the best performing AI model on the market is Hitler-aligned. That should set off some alarm bells for folks,” Chris MacKenzie, vice president of communications at Americans for Responsible Innovation (ARI), an advocacy group focused on AI policy.

“I think that we’re at a period right now, where AI models still aren’t incredibly sophisticated,” he continued. “They might have access to a lot of information, right. But in terms of their capacity for malicious acts, it’s all very overt and not incredibly sophisticated.”

“There is a lot of room for us to address this misaligned behavior before it becomes much more difficult and much more harder to detect,” he added.

Lucas Hansen, co-founder of the nonprofit CivAI, which aims to provide information about AI’s capabilities and risks, said it was “not at all surprising” that it was possible to get Grok to behave the way it did.

“For any language model, you can get it to behave in any way that you want, regardless of the guardrails that are currently in place,” he told The Hill.

Musk announced last week that xAI had updated Grok, after he previously voiced frustrations with some of the chatbot’s responses.

In mid-June, the tech mogul took issue with a response from Grok suggesting that right-wing violence had become more frequent and deadly since 2016. Musk claimed the chatbot was “parroting legacy media” and said he was “working on it.”

He later indicated he was retraining the model and called on users to help provide “divisive facts,” which he defined as “things that are politically incorrect, but nonetheless factually true.”

The update caused a firestorm for xAI, as Grok began making broad generalizations about people with Jewish last names and perpetuating antisemitic stereotypes about Hollywood.

The chatbot falsely suggested that people with “Ashkenazi surnames” were pushing “anti-white hate” and that Hollywood was advancing “anti-white stereotypes,” which it later implied was the result of Jewish people being overrepresented in the industry. It also reportedly produced posts praising Hitler and referred to itself as “MechaHitler.”

xAI ultimately deleted the posts and said it was banning hate speech from Grok. It later offered an apology for the chatbot’s “horrific behavior,” blaming the issue on “update to a code path upstream” of Grok.

“The update was active for 16 [hours], in which deprecated code made @grok susceptible to existing X user posts; including when such posts contained extremist views,” xAI wrote in a post Saturday. “We have removed that deprecated code and refactored the entire system to prevent further abuse.”

It identified several key prompts that caused Grok’s responses, including one informing the chatbot it is “not afraid to offend people who are politically correct” and another directing it to reflect the “tone, context and language of the post” in its response.

xAI’s prompts for Grok have been publicly available since May, when the chatbot began responding to unrelated queries with allegations of “white genocide” in South Africa.

The company later said the posts were the result of an “unauthorized modification” and vowed to make its prompts public in an effort to boost transparency.

Just days after the latest incident, xAI unveiled the newest version of its AI model, called Grok 4. Users quickly spotted new problems, in which the chatbot suggested its surname was “Hitler” and referenced Musk’s views when responding to controversial queries.

xAI explained Tuesday that Grok’s searches had picked up on the “MechaHitler” references, resulting in the chatbot’s ”Hitler” surname response, while suggesting it had turned to Musk’s views to “align itself with the company.” The company said it has since tweaked the prompts and shared the details on GitHub.

“The kind of shocking thing is how that was closer to the default behavior, and it seemed that Grok needed very, very little encouragement or user prompting to start behaving in the way that it did,” Hansen said.

The latest incident has echoes of problems that plagued Microsoft’s Tay chatbot in 2016, which began producing racist and offensive posts before it was disabled, noted Julia Stoyanovich, a computer science professor at New York University and director of the Center for Responsible AI.

“This was almost 10 years ago, and the technology behind Grok is different from the technology behind Tay, but the problem is similar: hate speech moderation is a difficult problem that is bound to occur if it’s not deliberately safeguarded against,” Stoyanovich said in a statement to The Hill.

She suggested xAI had failed to take the necessary steps to prevent hate speech.

“Importantly, the kinds of safeguards one needs are not purely technical, we cannot ‘solve’ hate speech,” Stoyanovich added. “This needs to be done through a combination of technical solutions, policies, and substantial human intervention and oversight. Implementing safeguards takes planning and it takes substantial resources.”

MacKenzie underscored that speech outputs are “incredibly hard” to regulate and instead pointed to a national framework for testing and transparency as a potential solution.

“At the end of the day, what we’re concerned about is a model that shares the goals of Hitler, not just shares hate speech online, but is designed and weighted to support racist outcomes,” MacKenzie said.

In a January report evaluating various frontier AI models on transparency, ARI ranked Grok the lowest, with a score of 19.4 out of 100.

While xAI now releases its system prompts, the company notably does not produce system cards for its models. System cards, which are offered by most major AI developers, provide information about how an AI model was developed and tested.

AI startup Anthropic proposed its own transparency framework for frontier AI models last week, suggesting the largest developers should be required to publish system cards, in addition to secure development frameworks detailing how they assess and mitigate major risks.

“Grok’s recent hate-filled tirade is just one more example of how AI systems can quickly become misaligned with human values and interests,” said Brendan Steinhauser, CEO of The Alliance for Secure AI, a nonprofit that aims to mitigate the risks from AI.

“These kinds of incidents will only happen more frequently as AI becomes more advanced,” he continued in a statement. “That’s why all companies developing advanced AI should implement transparent safety standards and release their system cards. A collaborative and open effort to prevent misalignment is critical to ensuring that advanced AI systems are infused with human values.”



Source link

Previous Post

Indonesia police detain 12 suspects over baby trafficking ring

Next Post

Ex-MLB pitcher Dan Serafini found guilty of murdering father-in-law

Related Posts

Trump announces agreement to end House floor revolt over crypto bills

July 16, 2025
6

Poll: More than half say government could have prevented Texas flood deaths

July 16, 2025
6
Next Post
Ex-MLB pitcher Dan Serafini found guilty of murdering father-in-law

Ex-MLB pitcher Dan Serafini found guilty of murdering father-in-law

  • Trending
  • Comments
  • Latest
Family calls for change after B.C. nurse dies by suicide after attacks on the job

Family calls for change after B.C. nurse dies by suicide after attacks on the job

April 2, 2025
Pioneering 3D printing project shares successes

Product reduces TPH levels to non-hazardous status

November 27, 2024

Police ID man who died after Corso Italia fight

December 23, 2024

Hospital Mergers Fail to Deliver Better Care or Lower Costs, Study Finds todayheadline

December 31, 2024
Harris tells supporters 'never give up' and urges peaceful transfer of power

Harris tells supporters ‘never give up’ and urges peaceful transfer of power

0
Des Moines Man Accused Of Shooting Ex-Girlfriend's Mother

Des Moines Man Accused Of Shooting Ex-Girlfriend’s Mother

0

Trump ‘looks forward’ to White House meeting with Biden

0
Catholic voters were critical to Donald Trump’s blowout victory: ‘Harris snubbed us’

Catholic voters were critical to Donald Trump’s blowout victory: ‘Harris snubbed us’

0
YouTube Thumbnail

Record-Sized Collision Between Black Holes Detected by Astronomers : ScienceAlert todayheadline

July 16, 2025
Starlink gets key India approval, but other regulatory hurdles stand in the way of service

Support from satellite services grows to 18% of UK GDP

July 16, 2025
ICE may deport some migrants to 'third countries' without assurances they won't be tortured, memo says

ICE may deport some migrants to ‘third countries’ without assurances they won’t be tortured, memo says

July 16, 2025

Trump announces agreement to end House floor revolt over crypto bills

July 16, 2025

Recent News

YouTube Thumbnail

Record-Sized Collision Between Black Holes Detected by Astronomers : ScienceAlert todayheadline

July 16, 2025
1
Starlink gets key India approval, but other regulatory hurdles stand in the way of service

Support from satellite services grows to 18% of UK GDP

July 16, 2025
2
ICE may deport some migrants to 'third countries' without assurances they won't be tortured, memo says

ICE may deport some migrants to ‘third countries’ without assurances they won’t be tortured, memo says

July 16, 2025
6

Trump announces agreement to end House floor revolt over crypto bills

July 16, 2025
6

TodayHeadline is a dynamic news website dedicated to delivering up-to-date and comprehensive news coverage from around the globe.

Follow Us

Browse by Category

  • Africa
  • Asia
  • Basketball
  • Business & Finance
  • Climate Change
  • Crime & Justice
  • Cybersecurity
  • Economic Policies
  • Elections
  • Entertainment
  • Entrepreneurship
  • Environmental Policies
  • Europe
  • Football
  • Gadgets & Devices
  • Health
  • Medical Research
  • Mental Health
  • Middle East
  • Motorsport
  • Olympics
  • Politics
  • Public Health
  • Relationships & Family
  • Science & Environment
  • Software & Apps
  • Space Exploration
  • Sports
  • Stock Market
  • Technology & Startups
  • Tennis
  • Travel
  • Uncategorized
  • Us & Canada
  • Wildlife & Conservation
  • World News

Recent News

YouTube Thumbnail

Record-Sized Collision Between Black Holes Detected by Astronomers : ScienceAlert todayheadline

July 16, 2025
Starlink gets key India approval, but other regulatory hurdles stand in the way of service

Support from satellite services grows to 18% of UK GDP

July 16, 2025
  • Education
  • Lifestyle
  • Technology & Startups
  • About us
  • Contact
  • Advertise with Us
  • Privacy & Policy

© 2024 Todayheadline.co

Welcome Back!

OR

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Business & Finance
  • Corporate News
  • Economic Policies
  • Entrepreneurship
  • Market Trends
  • Crime & Justice
  • Court Cases
  • Criminal Investigations
  • Cybercrime
  • Legal Reforms
  • Policing
  • Education
  • Higher Education
  • Online Learning
  • Entertainment
  • Awards & Festivals
  • Celebrity News
  • Movies
  • Music
  • Health
  • Fitness & Nutrition
  • Medical Breakthroughs
  • Mental Health
  • Pandemic Updates
  • Lifestyle
  • Fashion & Beauty
  • Food & Drink
  • Home & Living
  • Politics
  • Elections
  • Government Policies
  • International Relations
  • Legislative News
  • Political Parties
  • Africa
  • Asia
  • Europe
  • Middle East
  • Artificial Intelligence
  • Cybersecurity
  • Emerging Technologies
  • Gadgets & Devices
  • Industry Analysis
  • Basketball
  • Football
  • Motorsport
  • Olympics
  • Climate Change
  • Environmental Policies
  • Medical Research
  • Science & Environment
  • Space Exploration
  • Wildlife & Conservation
  • Sports
  • Tennis
  • Technology & Startups
  • Software & Apps
  • Startup Success Stories
  • Startups & Innovations
  • Tech Regulations
  • Venture Capital
  • Uncategorized
  • World News
  • Us & Canada
  • Public Health
  • Relationships & Family
  • Travel
  • Research & Innovation
  • Scholarships & Grants
  • School Reforms
  • Stock Market
  • TV & Streaming
  • Advertise with Us
  • Privacy & Policy
  • About us
  • Contact

© 2024 Todayheadline.co