The Bayesian Conspiracy
213 – Are Transformer Models Aligned By Default?
Our species has begun to scrute the inscrutable shoggoth! With Matt Freeman LINKS Anthropics latest AI Safety research paper, on interpretability Anthropic is hiring Episode 93 of The Mind Killer T