Resources

Subliminal Learning in AIs

Today’s freaky LLM behavior:

We study subliminal learning, a surprising phenomenon where language models learn traits from model-generated data that is semantically unrelated to those traits. For example, a “student” model learns to prefer owls when trained on sequences of numbers generated by a “teacher” model that prefers owls. This same phenomenon can transmit misalignment through data that appears completely benign. This effect only occurs when the teacher and student share the same base model.

Interesting security implications.

I am more convinced than ever that we need serious research into AI integrity if we are ever going to have trustworthy AI.

Related resources

8 September 2025

AI in Government

Just a few months after Elon Musk’s retreat from his unofficial role leading the Department of Government Efficiency (DOGE), we have a clearer picture of his vision of government powered […]

2 September 2025

1965 Cryptanalysis Training Workbook Released by the NSA

In the early 1960s, National Security Agency cryptanalyst and cryptanalysis instructor Lambros D. Callimahos coined the term “Stethoscope” to describe a diagnostic computer program used to unravel the internal structure […]

25 August 2025

Poor Password Choices

Look at this: McDonald’s chose the password “123456” for a major corporate system.

SharePoint under fire: ToolShell attacks hit organizations worldwide

U.S. Sanctions Firm Behind N. Korean IT Scheme; Arizona Woman Jailed for Running Laptop Farm

Resources

Subliminal Learning in AIs

Related resources

AI in Government

1965 Cryptanalysis Training Workbook Released by the NSA

Poor Password Choices

Links

Enterprise Solutions

SMB Solutions

Contact Us

Follow Us