Discussion about this post

User's avatar
Brad & Butter's avatar

The idiot in me can't help but hope this problem can be rephrased as equivalent to how political nepotism happens to the initially innocent ("vandals"). When the goal is consistently aligned, but the environment adapts based on previous behavior, the current recommended behavior would change to minimize goal-acquisition and tolerating unintended side effects. This idea can be transferred to collaborative agents as well, including some deceptive agents ("deputies"). https://graymirror.substack.com/p/3-descriptive-constitution-of-the?s=r

Expand full comment
Bicolio's avatar

“Luckily, most doctors are smarter than addicts”

Yikes. I dunno about that.

Expand full comment
2 more comments...

No posts