Shannon's Scaling Laws
A Folder from David
Favicon
Attention Is All You Need
↗
Favicon
A Mathematical Theory of Communication
↗
Favicon
Prediction and Entropy of Printed English
↗
Favicon
Scaling Laws for Neural Language Models
↗
Favicon
Scaling Laws and Interpretability of Learning
↗
Favicon
A Mathematical Framework for Transformer Circuits
↗
Favicon
Training Compute-Optimal Large Language Models
↗
Favicon
The Entropy of Words—Learnability and Expressivity across More than 1000 Languages
↗
Favicon
Human languages order information efficiently
↗
Favicon
Different languages, similar encoding efficiency: Comparable information rates across the human communicative niche - PubMed
↗
Favicon
Towards a universal model of reading | Behavioral and Brain Sciences | Cambridge Core
↗
Favicon
The Collected Papers of Charles Sanders Peirce
↗
Favicon
Deep Learning and the Information Bottleneck Principle
↗