Aligned dynamics might only be a tiny cluster, while misaligned ones might take on a whole range of flavors. This might mean that aiming to isolate and neutralize specific dynamics (e.g. deception) might turn out worthless, as Alex always has new tricks up its sleeve. Efforts focused on taking the conjunction of whitelisted dynamics rather than negating blacklisted ones might turn out more promising in that case. However, deception appears quite fundamental across a broad range of manipulative tactics, bringing into question the applicability of the AK principle here. But perhaps Alex could overtly persuade human operators even without deception (e.g. blackmail).