FNET EXPERIMENTATION

This document presents a comparative study of Discrete Fourier Transform (DFT) and Discrete Cosine Transform (DCT) within FNet architectures, highlighting the advantages of FNet over traditional transformers in terms of memory usage and training speed. The study utilizes the Jigsaw Toxic Comment Classification dataset to evaluate model performance based on various metrics. Key methodologies include the implementation of FNet with different DCT variants and preprocessing steps for data preparation.

Uploaded by

Sachin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

FNET EXPERIMENTATION

Uploaded by

Sachin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Comparative Study of DFT and

DCT in FNet Architectures

Submitted by Param Rastogi and
Sachin
Dayalbagh Educational Institute
November 2024
Introduction to Transformers
• What are Transformers?
• - Revolutionized NLP with self-attention
mechanisms (Vaswani et al., 2017).
• - Applications: Machine Translation, Text
Summarization, Sentiment Analysis.

• Limitations:
• - High memory usage due to quadratic
complexity of self-attention.
FNet Model Overview
• What is FNet?
• - Introduced by Lee-Thorp et al. in 2021.
• - Replaces self-attention with Fourier
Transform (FT).

• Key Advantages:
• - Faster training (80% on GPU, 70% on TPU).
• - Reduced memory requirements.
Methodology

• Model Architecture: FNet with DFT, 1D-DCT,

and 2D-DCT variants.
• Evaluation Metrics: Accuracy, Precision, Recall,
F1-score, Training time, and Memory usage.
• Experimental Design: Preprocessing steps
include tokenization, normalization, and
padding.
Dataset Description
• Dataset: Jigsaw Toxic Comment Classification
• - Purpose: Classify toxicity in online
comments.
• - Size: 230,000 labeled comments.
• - Labels: Toxic, Severe Toxic, Obscene, Threat,
Insult, Identity Hate.

• Preprocessing Steps:
• - Tokenization using TweetTokenizer.
FNet Model Architecture
• Refer to Figure 1 in the report for the
architecture diagram of FNet variants.

A Survey On Vision Transformer
No ratings yet
A Survey On Vision Transformer
23 pages
FNET REPORT
No ratings yet
FNET REPORT
13 pages
Fnet: Mixing Tokens With Fourier Transforms
No ratings yet
Fnet: Mixing Tokens With Fourier Transforms
16 pages
Fnet: Mixing Tokens With Fourier Transforms
No ratings yet
Fnet: Mixing Tokens With Fourier Transforms
18 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
PyTorch Essentials: A Comprehensive Guide to Machine Learning Techniques
From Everand
PyTorch Essentials: A Comprehensive Guide to Machine Learning Techniques
Adam Jones
No ratings yet
Machine Learning with PyTorch: From Basics to Expert Proficiency
From Everand
Machine Learning with PyTorch: From Basics to Expert Proficiency
William Smith
No ratings yet
PyTorch Foundations and Applications: Definitive Reference for Developers and Engineers
From Everand
PyTorch Foundations and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
PPT
No ratings yet
PPT
20 pages
A_review_of_advances_in_image_recognition_models_F
No ratings yet
A_review_of_advances_in_image_recognition_models_F
5 pages
Classic Cnn
No ratings yet
Classic Cnn
39 pages
CMT: Convolutional Neural Networks Meet Vision Transformers
No ratings yet
CMT: Convolutional Neural Networks Meet Vision Transformers
11 pages
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
BERT Foundations and Applications: Definitive Reference for Developers and Engineers
From Everand
BERT Foundations and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Transformers: Principles and Applications
From Everand
Transformers: Principles and Applications
Richard Johnson
No ratings yet
Deep Learning Based Sentiment
No ratings yet
Deep Learning Based Sentiment
62 pages
Transformers in Deep Learning Architecture: Definitive Reference for Developers and Engineers
From Everand
Transformers in Deep Learning Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cse (Convolutional Neural Network) PPT+Questions
No ratings yet
Cse (Convolutional Neural Network) PPT+Questions
18 pages
Lec25 Architectures
No ratings yet
Lec25 Architectures
52 pages
Advancements in Image Classification Using Convolutional Neural Network
No ratings yet
Advancements in Image Classification Using Convolutional Neural Network
8 pages
7 CNN
No ratings yet
7 CNN
66 pages
IJCRT2210371
No ratings yet
IJCRT2210371
4 pages
40 Machine Learning Algorithms
From Everand
40 Machine Learning Algorithms
Anam Giri
No ratings yet
Recent advances in convolutional neural networks-2018
No ratings yet
Recent advances in convolutional neural networks-2018
42 pages
Lecture 01
No ratings yet
Lecture 01
53 pages
2103 TiT
No ratings yet
2103 TiT
10 pages
yarn own bd'
No ratings yet
yarn own bd'
9 pages
2012.12556
No ratings yet
2012.12556
23 pages
DL Inference FPGA Class1
No ratings yet
DL Inference FPGA Class1
56 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
Practical MXNet Applications: Definitive Reference for Developers and Engineers
From Everand
Practical MXNet Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Smarter Decisions – The Intersection of Internet of Things and Decision Science
From Everand
Smarter Decisions – The Intersection of Internet of Things and Decision Science
Jojo Moolayil
No ratings yet
Transformers in Machine Learning _ GeeksforGeeks
No ratings yet
Transformers in Machine Learning _ GeeksforGeeks
9 pages
How Neural Network Helps in Facebook
100% (3)
How Neural Network Helps in Facebook
17 pages
CNN
No ratings yet
CNN
9 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
Mastering Python Algorithms: Practical Solutions for Complex Problems
From Everand
Mastering Python Algorithms: Practical Solutions for Complex Problems
Robert Johnson
No ratings yet
Deep Facial Recognition
No ratings yet
Deep Facial Recognition
27 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Deep Learning: Alberto Ezpondaburu
No ratings yet
Deep Learning: Alberto Ezpondaburu
58 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Poster 2
No ratings yet
Poster 2
1 page
FPGAEnergy Eff Transformers
No ratings yet
FPGAEnergy Eff Transformers
7 pages
Training Deep Neural Networks in Python Keras Framework (Tensor Ow Backend) With Inertial Sensor Data For Human Activity Classification
No ratings yet
Training Deep Neural Networks in Python Keras Framework (Tensor Ow Backend) With Inertial Sensor Data For Human Activity Classification
28 pages
NeurIPS 2021 Transformer in Transformer Paper
No ratings yet
NeurIPS 2021 Transformer in Transformer Paper
12 pages
Comprehensive Machine Learning Techniques: A Guide for the Experienced Analyst
From Everand
Comprehensive Machine Learning Techniques: A Guide for the Experienced Analyst
Adam Jones
No ratings yet
Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process
From Everand
Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process
Maicon Melo Alves
No ratings yet
A Survey of Transformers:, Yuxin Wang, Xiangyang Liu, and
No ratings yet
A Survey of Transformers:, Yuxin Wang, Xiangyang Liu, and
40 pages
Contemporary Machine Learning Methods: Harnessing Scikit-Learn and TensorFlow
From Everand
Contemporary Machine Learning Methods: Harnessing Scikit-Learn and TensorFlow
Adam Jones
No ratings yet
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
Facial Expression Recognition System Using Convolution Neural Network"
No ratings yet
Facial Expression Recognition System Using Convolution Neural Network"
30 pages
Transformer Design Report
No ratings yet
Transformer Design Report
21 pages
Transformer
No ratings yet
Transformer
5 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
42 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
4 pages
CHATGPT DALL.E 3: Complete Guide. Third Edition
From Everand
CHATGPT DALL.E 3: Complete Guide. Third Edition
Hesham Mohamed Elsherif
No ratings yet
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
From Everand
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
Nelson Ambrose
No ratings yet
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
No ratings yet
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
76 pages
Loreggia Giacomo
No ratings yet
Loreggia Giacomo
80 pages
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
No ratings yet
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
24 pages
PROJECT PROPOSAL
No ratings yet
PROJECT PROPOSAL
1 page
CDN - CONTENT DELIVERY NETWORK
No ratings yet
CDN - CONTENT DELIVERY NETWORK
25 pages
UNDERSTANDING SELF-ATTENTION
No ratings yet
UNDERSTANDING SELF-ATTENTION
37 pages
Database Management System _ Weekly Test 04 - Test Paper
No ratings yet
Database Management System _ Weekly Test 04 - Test Paper
6 pages
Theory Of Computation _ Weekly Test 05 - Test Paper
No ratings yet
Theory Of Computation _ Weekly Test 05 - Test Paper
5 pages
Algorithms Test Paper -1
No ratings yet
Algorithms Test Paper -1
5 pages

FNET EXPERIMENTATION

Uploaded by

FNET EXPERIMENTATION

Uploaded by

Comparative Study of DFT and

DCT in FNet Architectures

• Model Architecture: FNet with DFT, 1D-DCT,

You might also like