Skip to content

HKU IDS Guest Seminar: Demystifying Attention Mechanism in Transformer and its application to Better Inference of Large Language Models (LLMs)