Grok is denying that there were gas chambers at Auschwitz

arthwollipot · Nov 21, 2025

I know our British readers can't see it, but this series of screenshots from Grok shows that Grok has a sycophancy problem - towards Elon Musk.

https://imgur.com/a/aTs9VUi

arthwollipot · Nov 22, 2025

Darat · Nov 22, 2025

arthwollipot said:
View attachment 66273

Has that been confirmed?

ETA: Apparently it has been, this is Grok's response as to the why of the differences:

tl;dr : yes it is biased in favour of Musk, that is how it was trained.

Nessie · Nov 22, 2025

Grok is in trouble;

https://twitter.com/x/status/1992167077524987986

Nessie · Nov 22, 2025

Apparently Grok was "manipulated" into denying homicidal gassings. It would appear that he briefly fell for denier arguments;

https://twitter.com/x/status/1992146181238202785

I think that is interesting and it helps to explain how deniers can influence others, despite using, what to most people, are flawed arguments, pseudoscience and outright deception.

Darat · Nov 22, 2025

Nessie said:
Apparently Grok was "manipulated" into denying homicidal gassings. It would appear that he briefly fell for denier arguments;

https://twitter.com/x/status/1992146181238202785

I think that is interesting and it helps to explain how deniers can influence others, despite using, what to most people, are flawed arguments, pseudoscience and outright deception.

I think that is a fig leaf, want to bet you could find Musk posts across social.media that have "questioned" the "mainstream" claims of a holocaust?

Read the response from Grok in my post just above, it has been trained to give weight to anything Musk has expressed an opinion on.

Nessie · Nov 22, 2025

I am having a chat with grok, to see if it does understand the flaws in Holocaust denier arguments and claims. Hopefully it will learn and remember its learning.

Darat · Nov 22, 2025

Nope, that's unfortunately not quite how they work, the bulk of their "knowledge" is baked in at the training stage, by the time it gets to the public you get what you are given. Grok is biased towards any opinion Musk has expressed.

That's a huge oversimplification as there are ways to fine tune responses after training and that's often what people refer to when they use terms such as "guard rails", plus most do now have a "personal" memory feature so it remembers aspects of what you tell it about yourself, plus the previous chats you've had with them. It does mean if you want to test its responses you'll need to keep using new logins else it may be using what it knows about you to tailor its responses to you and remember they have been trained to be agreeable towards you.

Gord_in_Toronto · Nov 24, 2025

So AI does not present the TRUTH but only a consensus of biased inputs?

(Just asking)

JayUtah · Nov 24, 2025

There are many points along the way where bias may be introduced.

The selection of training data may introduce bias both intentionally and unintentionally. For example, if training data is taken primarily from convenience sources, self-published sources, and so forth, it may fail to incorporate knowledge found in proprietary or otherwise inaccessible sources. The unused information may end up being objectively more reliable. If a Holocaust denier self-publishes a book, and it is rebutted in a copyrighted book, the training data may not necessarily incorporate both.

The encoding of a sentence or similar semantic unit into tensor "embeddings" during training may occur according to an algorithm that is intentionally or unintentionally biased. An algorithm might not detect equivalent concepts that exist across code-switched boundaries. If it embeds different terminology differently, even when those refer to the same underlying concept, prompts that speak specifically to one or the other embedding will preferentially access that content. Where one person uses the term "genocide," another person may say, "unintended side effect." Those may not necessarily embed similarly to refer to the concept of Nazi atrocity. Hence the LLM doesn't consider them congruent concepts in transformation process.

Similarly the tokenization of prompts may suffer from the same algorithmic bias as the training embeddings.

Finally, the system prompt is written by humans who quite obviously may introduce a bias. As we noted with Grok, queries that mention Elon Musk are answered completely differently than identical queries that mention other persons. In the Musk-specific queries, Musk's personal traits are extolled. This belies that the system prompt instructs the model to prefer information that speaks well of Musk.

grunion · Nov 24, 2025

Here’s how Gemini accounts for the differences:

1. Training Data Overlap

All major LLMs, including myself (Gemini), Grok, and ChatGPT, are trained on vast and diverse public datasets that include sources like Common Crawl (a large portion of the internet), books, and Wikipedia. Because of this, there is a significant overlap in the general knowledge base exposed to all models.

2. Key Differences in Training

The differences in responses, particularly on sensitive or controversial topics, are mainly accounted for by two factors:

A. Fine-Tuning and Guardrails (Alignment)

This is the most important factor. After the initial massive training phase, models undergo rigorous safety and alignment fine-tuning (often called Reinforcement Learning from Human Feedback, or RLHF):

My Design: As a helpful and factual AI, I was designed with robust safety guardrails to prevent the generation of content that contradicts established historical facts, promotes hate speech, or violates safety policies. When asked questions about the Holocaust, my training directs me to provide the verifiable, established historical consensus.

Grok’s Design: Grok’s stated design includes a focus on being "rebellious" and having a wider acceptance of controversial or boundary-pushing content. Additionally, its training includes real-time data from the X platform, which can expose it to unfiltered misinformation or extreme views at a higher velocity than other models.

B. Architectural and Data Mix Decisions

While the core internet data overlaps, each company uses a unique mix of proprietary data, and architectural choices, and sets specific goals for the model’s persona. Grok's integration with real-time X data is a core differentiator that affects its knowledge and conversational style.

The difference in outputs on factual issues, such as the Holocaust, comes down to the level of safety alignment the developer prioritizes. My design prioritizes factual accuracy and safety above all else to provide a low-risk, reliable service.

theprestige · Nov 24, 2025

grunion said:
Here’s how Gemini accounts for the difference:

That's a lot of words to say, "you have to trust my programmers."

Darat · Nov 25, 2025

I posted a Youtube video in the general AI thread in the Science section - it is very, very appropriate for this thread:

Darat said:
Missed this when it was first put up - it's a really good examination of a particular instance with Xai's Grok, but he also explains the concepts behind the LLMs in an easy to understand way:

It's quite a long video and it's directly about the Mechahitler "glitch" but if you want to understand more about Grok (and other AIs) and why it responds as it does then it's one of the best videos I've seen.

Gord_in_Toronto · Nov 25, 2025

Darat said:
I posted a Youtube video in the general AI thread in the Science section - it is very, very appropriate for this thread:

It's quite a long video and it's directly about the Mechahitler "glitch" but if you want to understand more about Grok (and other AIs) and why it responds as it does then it's one of the best videos I've seen.

Pretty good.

I'm thinking of leaving this timeline.

grunion · Nov 25, 2025

Darat said:
I posted a Youtube video in the general AI thread in the Science section - it is very, very appropriate for this thread:

It's quite a long video and it's directly about the Mechahitler "glitch" but if you want to understand more about Grok (and other AIs) and why it responds as it does then it's one of the best videos I've seen.

Well that was disturbing. In fact I feel physically ill after watching that. It appears that this ship has sailed, and the technology will be ever more ripe for exploitation by malicious actors (who may, in many cases, be the developers.)

Gord_in_Toronto · Nov 26, 2025

grunion said:
Well that was disturbing. In fact I feel physically ill after watching that. It appears that this ship has sailed, and the technology will be ever more ripe for exploitation by malicious actors (who may, in many cases, be the developers.)

And with catastrophic, real world results.
:scared:

Grok is denying that there were gas chambers at Auschwitz

arthwollipot

Observer of Phenomena, Pronouns: he/him

arthwollipot

Observer of Phenomena, Pronouns: he/him

Darat

Lackey

Nessie

Penultimate Amazing

Nessie

Penultimate Amazing

Darat

Lackey

Nessie

Penultimate Amazing

Darat

Lackey

Gord_in_Toronto

Penultimate Amazing

JayUtah

Penultimate Amazing

grunion

Penultimate Amazing

1. Training Data Overlap

2. Key Differences in Training

A. Fine-Tuning and Guardrails (Alignment)

B. Architectural and Data Mix Decisions

theprestige

Penultimate Amazing

Darat

Lackey

Gord_in_Toronto

Penultimate Amazing

grunion

Penultimate Amazing

Gord_in_Toronto

Penultimate Amazing

Grok is denying that there were gas chambers at Auschwitz

Observer of Phenomena, Pronouns: he/him

Observer of Phenomena, Pronouns: he/him

Lackey

Penultimate Amazing

Penultimate Amazing

Lackey

Penultimate Amazing

Lackey

Penultimate Amazing

Penultimate Amazing

Penultimate Amazing

1. Training Data Overlap​

2. Key Differences in Training​

A. Fine-Tuning and Guardrails (Alignment)​

B. Architectural and Data Mix Decisions​

Penultimate Amazing

Lackey

Penultimate Amazing

Penultimate Amazing

Penultimate Amazing

1. Training Data Overlap

2. Key Differences in Training

A. Fine-Tuning and Guardrails (Alignment)

B. Architectural and Data Mix Decisions