Inside Our Program
Program Events

Events

  • Apr
    19

    CS Seminar: Hardware-Aware Efficient Primitives for Machine Learning (Dan Fu)

    Department of Computer Science (CS)

    12:00 PM 3514, Mudd Hall ( formerly Seeley G. Mudd Library)

    EVENT DETAILS

    CS Seminar: Hardware-Aware Efficient Primitives for Machine Learning (Dan Fu)

    Friday / CS Seminar
    April 19th / 12:00 PM
    In Person / Mudd 3514

    Speaker
    Dan Fu, Stanford University

    Talk Title
    Hardware-Aware Efficient Primitives for Machine Learning

    Abstract
    Efficiency is increasingly tied to quality to machine learning, with more efficient training algorithms leading to more powerful models. However, today's most popular machine learning models are built on asymptotically inefficient primitives. For example, attention in Transformers scales quadratically in the input size, while MLPs scale quadratically in model dimension. In this talk, I discuss my work on improving the efficiency of the core primitives in machine learning, with an emphasis on hardware-aware algorithms and long-context applications. First, I focus on replacing attention with gated state space models (SSMs) and convolutions, which scale sub-quadratically in context length. I describe the H3 (Hungry Hungry Hippos) architecture, a gated SSM architecture that matches Transformers in quality up to 3B parameters and achieves 2.4x faster inference. Second, I focus on developing hardware-aware algorithms for SSMs and convolutions. I describe FlashFFTConv, a fast algorithm for computing SSMs and convolutions on GPU by optimizing the Fast Fourier Transform (FFT). FlashFFTConv yields up to 7x speedup and 5x memory savings, even over vendor solutions from Nvidia. Third, I will briefly touch on how these same techniques can also be used to develop sub-quadratic scaling in the model dimension. I will describe Monarch Mixer, which uses a generalization of the FFT to achieve sub-quadratic scaling in both sequence length and model dimension. Throughout the talk, I will give examples of how these ideas are beginning to take hold, with gated SSMs and their variants now leading to state-of-the-art performance in long-context language models, embedding models, and DNA foundation models.

    Biography
    Dan Fu is a PhD student in the Computer Science Department at Stanford University, where he is co-advised by Christopher Ré and Kayvon Fatahalian. His research interests are at the intersection of systems and machine learning. Recently, he has focused on developing algorithms and architectures to make machine learning more efficient, especially for enabling longer-context applications. His research has appeared as oral and spotlight presentations at NeurIPS, ICML, and ICLR, and he has received the best student paper runner up at UAI. Dan has also been supported by an NDSEG fellowship.

    Research Area/Interests
    machine learning, systems

    more less

    TIME Friday, April 19, 2024 at 12:00 PM - 1:00 PM

    LOCATION 3514, Mudd Hall ( formerly Seeley G. Mudd Library)    map it

    ADD TO CALENDAR

    CONTACT Wynante R Charles    wynante.charles@northwestern.edu EMAIL

    CALENDAR Department of Computer Science (CS)

  • May
    1

    ECE Distinguished Seminar - Gu-Yeon Wei

    Department of Electrical and Computer Engineering (ECE)

    11:00 AM L440, Technological Institute

    EVENT DETAILS

    TIME Wednesday, May 1, 2024 at 11:00 AM - 12:00 PM

    LOCATION L440, Technological Institute    map it

    ADD TO CALENDAR

    CONTACT Catherine Healey    catherine.healey@northwestern.edu EMAIL

    CALENDAR Department of Electrical and Computer Engineering (ECE)

  • May
    1

    CS Distinguished Lecture: Tong Zhang

    Department of Computer Science (CS)

    12:00 PM 3514, Mudd Hall ( formerly Seeley G. Mudd Library)

    EVENT DETAILS

    TIME Wednesday, May 1, 2024 at 12:00 PM - 1:00 PM

    LOCATION 3514, Mudd Hall ( formerly Seeley G. Mudd Library)    map it

    ADD TO CALENDAR

    CONTACT Wynante R Charles    wynante.charles@northwestern.edu EMAIL

    CALENDAR Department of Computer Science (CS)

  • May
    8

    CS Distinguished Lecture: Monica Lam

    Department of Computer Science (CS)

    12:00 PM 3514, Mudd Hall ( formerly Seeley G. Mudd Library)

    EVENT DETAILS

    TIME Wednesday, May 8, 2024 at 12:00 PM - 1:00 PM

    LOCATION 3514, Mudd Hall ( formerly Seeley G. Mudd Library)    map it

    ADD TO CALENDAR

    CONTACT Wynante R Charles    wynante.charles@northwestern.edu EMAIL

    CALENDAR Department of Computer Science (CS)

  • May
    9

    MS in Information Technology (MSIT) Online Information Session

    MS in Information Technology (MSIT) Program

    6:00 PM

    EVENT DETAILSmore info

    TIME Thursday, May 9, 2024 at 6:00 PM - 7:00 PM

    ADD TO CALENDAR

    CONTACT Svetlana Korzeniowski    msit@northwestern.edu EMAIL

    CALENDAR MS in Information Technology (MSIT) Program

  • May
    10

    CS Distinguished Lecture: Trust, Backdoor Vulnerabilities and Possible Mitigations (Shafi Goldwasser)

    Department of Computer Science (CS)

    12:00 PM 3514, Mudd Hall ( formerly Seeley G. Mudd Library)

    EVENT DETAILS

    TIME Friday, May 10, 2024 at 12:00 PM - 1:00 PM

    LOCATION 3514, Mudd Hall ( formerly Seeley G. Mudd Library)    map it

    ADD TO CALENDAR

    CONTACT Wynante R Charles    wynante.charles@northwestern.edu EMAIL

    CALENDAR Department of Computer Science (CS)

  • May
    30

    TGI Bagel Thursday

    Department of Computer Science (CS)

    9:00 AM 3514, Mudd Hall ( formerly Seeley G. Mudd Library)

    EVENT DETAILS

    TIME Thursday, May 30, 2024 at 9:00 AM - 11:00 AM

    LOCATION 3514, Mudd Hall ( formerly Seeley G. Mudd Library)    map it

    ADD TO CALENDAR

    CONTACT Wynante R Charles    wynante.charles@northwestern.edu EMAIL

    CALENDAR Department of Computer Science (CS)

  • May
    30

    End of Year Awards

    Department of Computer Science (CS)

    3:00 PM TBA, Mudd Hall ( formerly Seeley G. Mudd Library)

    EVENT DETAILS

    TIME Thursday, May 30, 2024 at 3:00 PM - 5:00 PM

    LOCATION TBA, Mudd Hall ( formerly Seeley G. Mudd Library)    map it

    ADD TO CALENDAR

    CONTACT Wynante R Charles    wynante.charles@northwestern.edu EMAIL

    CALENDAR Department of Computer Science (CS)