HKU IDS Scholar Seminar Series #24: Unlocking Interpretable Control for Large Language Models and Beyond via Sparse Autoencoders Read More
HKU IDS Scholar Seminar Series #23: A Tangram Theory of Generalization: Rethinking Machine Learning via the Lens of Composition Read More