Hypothesis Subspace

Representational Tethers

Representational tethers are a merger of [[bridger-languages]] and [[representational-alignment]]. They are means of relating the internal representations of ML models with internal representations employed by humans. One part of this process is incentivizing the artificial representations to be cross-compatible with humans ones (the legacy of [[representational-alignment]]). Another aspect is enabling people to perceive and understand said artificial representations by making them more legible and cognitively ergonomic for humans (the legacy of [[bridger-languages]]). The two previous frames complement each other: it's easier to understand a language containing concepts you're familiar with than a more alien one (a shortcoming of [[differentiable-cosmogonies]]), and it's easier to keep a representation within reach if you make sure it's consciously accessible at the other end. Representational tethers are meant to further alignment by improving human oversight or automated versions thereof.

Representational Tethers