Skip to content Skip to footer

0 items - $0.00 0

The FFT Strikes Back: An Efficient Alternative to Self-Attention by iNic

10CommentsShare PostShare on Facebook Share on XShare by EmailSend Link

Vídeo

The FFT Strikes Back: An Efficient Alternative to Self-Attention by iNic

ByHackTech February 26, 2025

10Comments

Share This Article

Sed ut perspiciatis unde.

Send to HN

[Submitted on 25 Feb 2025]

View PDF
HTML (experimental)

Abstract:Conventional self-attention mechanisms incur quadratic complexity, limiting their scalability on long sequences. We introduce FFTNet, an adaptive spectral filtering framework that leverages the Fast Fourier Transform (FFT) to achieve global

0Likes

Written by

HackTech

View all posts by HackTech

Show comments (10)

10 Comments

Post Author

larodi

Posted February 26, 2025 at 10:53 am

Man, this all really gets increasingly more complex in increasingly complex math…

0Likes Log in to Reply
Post Author

avereveard

Posted February 26, 2025 at 10:56 am

Isn't flash attention already n log n?

0Likes Log in to Reply
Post Author

pointlessone

Posted February 26, 2025 at 11:09 am

OK, I admit that the math flies way over my head and I barely understand the text around the math. Can someone please explain (in basic English) how this is equivalent to attention mechanism? What friquencies does it talk about? How does it encode positional relations between tokens?

0Likes Log in to Reply
Post Author

xeonmc

Posted February 26, 2025 at 11:22 am

Basically leverages convolution theorem[0]: expensive convolutions in direct space becomes simple multiplications in reciprocal space, and vice versa.

Whereever you have a convolution operation on your data, transform them to the conjugate domain to turn it into multiplication.

In other words, work in the domain that is natural to your data.

[0] https://en.wikipedia.org/wiki/Convolution_theorem

0Likes Log in to Reply
Post Author

yorwba

Posted February 26, 2025 at 11:24 am

I don't see how you could fit causal masking into this framework without having to do n different FFTs, and there's no mention of positional embeddings either, so I guess the self-attention implementation being compared against is noncausal NoPE, which would make this a case of baseline sandbagging and maybe not so impressive.

If the results were close to state-of-the-art, probably the author would've mentioned it?

0Likes Log in to Reply
Post Author

yagizdegirmenci

Posted February 26, 2025 at 11:24 am

Google introduced this idea in 2022 with "FNet: Mixing Tokens with Fourier Transforms" [0].

Later they found out that, performance of their TPU(s) for matrix multiplication was faster than FFT in the most scenarios.

[0]: https://arxiv.org/abs/2105.03824

0Likes Log in to Reply
Post Author

cs702

Posted February 26, 2025 at 11:26 am

TL;DR:

1. Take FNet (https://arxiv.org/abs/2105.03824).

2. Replace the fixed (frequency-domain) convolution filter with one that is dynamically computed from the data.

3. Apply non-linear functions to both real and imaginary components, before mapping the convolved data back to the time domain.

0Likes Log in to Reply
Post Author

A7C3D5

Posted February 26, 2025 at 11:59 am

I'll never not read FFT as Final Fantasy Tactics.

0Likes Log in to Reply
Post Author

DrNosferatu

Posted February 26, 2025 at 12:03 pm

Can someone confirm the big O time complexities of

1. traditional Self-Attention;

2. Flash-Attention?

3. Any novel others?

0Likes Log in to Reply
Post Author

nnnnico

Posted February 26, 2025 at 12:12 pm

It really is waves all the way down

0Likes Log in to Reply

The FFT Strikes Back: An Efficient Alternative to Self-Attention by iNic

The FFT Strikes Back: An Efficient Alternative to Self-Attention by iNic

Share This Article

Newsletter

HackTech

10 Comments

larodi

avereveard

pointlessone

xeonmc

yorwba

yagizdegirmenci

cs702

A7C3D5

DrNosferatu

nnnnico

Leave a comment Cancel reply

Editor's Choice

The FFT Strikes Back: An Efficient Alternative to Self-Attention by iNic

The FFT Strikes Back: An Efficient Alternative to Self-Attention by iNic

Share This Article

Newsletter

10 Comments

Leave a comment Cancel reply

Editor's Choice

Sign Up to Our Newsletter