State-Space Models: Is There a New Revolution on Our Doorstep
December 17, 2024
11:15
10:45
Hall A
English | Advanced | Sequential Data Processing (SSMs)
State-space models (SSMs) are a powerful mathematical framework for describing dynamic systems using sets of first-order differential or difference equations governed by state variables. Recently, deep learning researchers have unlocked a fascinating capability of SSMs: constructing efficient memory representations for sequential data, including text, time series, and audio. The beauty of SSM-based architectures lies in their duality. On one hand, they can be trained in a fully parallel regime, similar to Transformers, allowing for the utilization of massive datasets and computational resources. On the other hand, inference can be performed with the efficiency and speed of Recurrent Neural Networks (RNNs), even for extremely long sequences like DNAs, audio, and lengthy texts.

In the past four years, several innovative variants of SSM-based deep learning architectures have been proposed, including S3, S4, H3, and S5, each bringing its unique contributions and advancements. Among these, Mamba, or S6, stands out as a cutting-edge SSM recently introduced by leading researchers Albert Gu and Tri Dao in their paper, "Mamba: Linear-Time Sequence Modeling with Selective State Spaces." Mamba is specifically designed to tackle sequences with highly complex structures, showcasing its advanced capabilities in this domain.

In this presentation, we embark on a journey through the evolution of SSMs, starting from their early beginnings and delving into the mathematical foundations that underpin their remarkable capabilities. We explore how SSMs have revolutionized our approach to sequential data, offering new avenues for efficient and effective processing of highly-complex sequential data.
Mike E
Footer Social media icons - LinkedIn
Footer Social media icons - Twitter X
 Mike Erlihson
Head of AI

Mike Erlihson is a seasoned AI professional currently leading AI development at a stealth company, leveraging his PhD in Mathematics and extensive expertise in deep learning and data science. As a prolific scientific content creator and lecturer, he has reviewed over 250 deep learning papers and hosted more than 20 recorded podcasts in the field, building a substantial following of over 5OK in LinkedIn. In addition to his professional work, Mike is committed to education and knowledge-sharing in the AI community, making complex topics accessible through his various content platforms.

Cancellation Policy

Sponsor Cancellation:

In case of cancellation of the event, we will offer a full refund to all attendees and sponsors.

Attendee cancellations:

Up to 30 days prior to the event – 100% Refund 30-14 days prior to the event – 50% Refund No refund will be offered later than that.

Cancellation Policy

Sponsor Cancellation:

In case of cancellation of the event, we will offer a full refund to all attendees and sponsors.

Attendee cancellations:

Up to 30 days prior to the event – 100% Refund.
30-14 days prior to the event – 50% Refund.
No refund will be offered later than that.