back
Get SIGNAL/NOISE in your inbox daily
I will argue that a large class of reward functions, which I call “behaviorist”, and which includes almost every reward function in the RL and LLM literature, are all doomed to eventually lead to AI that will “scheme”—i.e., pretend to be docile and cooperative while secretly looking for opportunities to behave in egregiously bad ways such as world takeover (cf. “treacherous turn”)…
Recent Stories
Jan 19, 2026
Andreessen Horowitz makes a $3 billion bet that there’s no AI bubble
The venture capital firm, which goes by the nickname a16z, set up a dedicated $1.25 billion war chest in 2024 for bets on AI infrastructure, a term that the fund defines more broadly than the costl…
Jan 19, 2026Asus confirms its smartphone business is on indefinite hiatus
Asus chairman Jonney Shih sees AI applications as the company's main focus going forward.
Jan 19, 2026Lanner Electronics unveils EAI-I351 robotic AI platform powered by NVIDIA Jetson Thor, Blackwell
Lanner Electronics has released its EAI-I351 robotic AI platform, powered by the NVIDIA Jetson Thor and Blackwell architecture.