Anthropic sur Twitter : "Neural networks often pack many unrelated concepts into a single neuron – a puzzling phenomenon known as 'polysemanticity' which makes interpretability much more challenging..."
Tags:
About This Document
File info
Documents with similar tags (experimental)