PRIMATE: Processing in Memory Acceleration for Dynamic Token-pruning Transformers
Published in 29th Asia and South Pacific Design Automation Conference (ASP-DAC), 2024
Recommended citation: Yue Pan, Minxuan Zhou, Chonghan Lee, Zheyu Li, Rishika Kushwah, Vijaykrishnan Narayanan, and Tajana Rosing, “PRIMATE: Processing in Memory Acceleration for Dynamic Token-pruning Transformers”, 29th Asia and South Pacific Design Automation Conference (ASP-DAC), 2024