Hypothesis Subspace

How does representational alignment relate to concrete challenges in alignment?

Similar to abstraction inductors, physicalist fluency provides a different path to promoting shared abstractions between human and AI, which can then be made use of as part of the objective function. However, it has the added benefit of providing an arbitrary limit on the richness of internal representations in relation to those of one human. In this, it addresses inner alignment.

How does representational alignment relate to concrete challenges in alignment?