Numbered distributions over lettered ones because one model's output is another model's inputs in interesting arrangements. Over named ones because names create clutter.
Name learned maps by letter on tail. Shared weights translated to identical names across arrows.