Mohammed Abu Baker (Shahoyi) – Home

ARENA 6.0 Capstone: Model Organism of Encoded Reasoning

AI Safety

Technical

RL

TL;DR For our capstone project in ARENA 6.0 (Sep 2025), we tried to create a model organism of encoded reasoning from Qwen3-4B using RL. Specifically, we tried to make the…

Luca Baroni, Mo Baker

AI Safety Mindmap

AI Safety

Technical

AI safety is a broad with many suibfields each prioritising and focusing on different asepcts of making AI safe. Therefore, I decided to crate a mindmap to hopefully capture…

Changes in Attention Patterns of Data Poisoned Sleeper Agent LLMs

Sleeper Agents

AI Safety

Technical

I’m excited to share some findings from a recent project I undertook as part of my masters studies. I dived into the attention patterns of “sleeper agent” Large Language…