24CH10039 AGV Task 4

The paper discusses the implementation of neural networks on FPGAs, highlighting their advantages such as energy efficiency, parallel processing, and real-time performance. It details the hardware architecture used, focusing on forward and backward propagation processes, and emphasizes the flexibility of the design for various neural network types. Potential applications include embedded systems, real-time processing, and AI-driven technologies like speech recognition and autonomous vehicles.

Uploaded by

poribaf830

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

24CH10039 AGV Task 4

Uploaded by

poribaf830

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

AGV Task 4

Neural Networks on FPGAs

Soham Agarwal 24CH10039
After reading this paper, I find that implementing neural networks on FPGAs is challenging and yet
fascinating. In this paper the basic hardware architecture for neural networks running on FPGAs and
then highlighting the pros and cons of doing so.

Introduction to FPGA
FPGAs or Field Programmable Gate Arrays have become increasingly more important in recent times,
just like CPUs and GPUs, but unlike them they have the special ability that allows them to do multiple
calculations simultaneously.
As we know already that there are multiple matrix multiplications in a neural network, and FGPAs do it
in much fewer clock cycles as compared to CPUs and GPUs. This makes them better suited.
Some key advantages of FPGAs mentioned in the paper include:
1. Higher energy efficiency.
2. Parallel processing capabilities.
3. Real-time computation performance.
4. Flexibility in implementing custom algorithms.
Major companies have already started using FPGAs in their AI systems, like Microsoft with Bing
search engine and Baidu with their speech recognition applications

Neural Network Basics

The Neural Networks have multiple layers containing various neurons, there is an input layer, an
output layer and all the layers between them are the hidden layers. Each neuron is a small linear
regression model in itself, where it uses a back propagation method to minimize the Loss by
changing the weights and biases.
The paper mentions different types of neural networks:
1. Deep Neural Networks (DNNs) with multiple hidden layers
2. Recurrent Neural Networks (RNNs) with feedback loops
3. Convolutional Neural Networks (CNNs) specialized for tasks like image processing
Not only Neural Networks, all machine learning algorithms are just making predictions on what the
target must be, learning from its mistakes again and again, so this process of making a prediction
and then learning from the errors is called one training epoch. In making a good neural network,
hundreds of training epochs must be made. Making a prediction is the forward propagation, and the
learning part is the back propagation.
Hardware Implementation on FPGA
In this paper, we have seen the Neural Network architecture being implemented on the FPGA System
on Chip (SoC) platform.
This is a very flexible and reusable architecture which can be used for many different cases. It uses
reusable components like multiply-add banks, RAM modules, and activation function lookup tables,
all of which can be re-configured and can be applied to many neural networks.

The model focuses on the forward propagation in detail, input layer to hidden layer, hidden layer to
hidden layer, and hidden layer to output layer. During the first stage, input vectors are multiplied by the
first hidden layer weight matrix using the multiply-add bank, which consists of many parallel
multiplication and accumulation units that can be adjusted. The results are then stored in memory
before being passed through activation functions implemented using lookup tables. This approach
allows different activation functions like sigmoid, ReLU, or tanh to be used by simply loading different
parameters into the lookup table. The output from the activation function is stored in another RAM
module before being processed by the next layer. For multi-layer networks, these components can be
reused to process each subsequent layer, making the architecture very flexible and efficient.

In the backward propagation process implements the learning algorithm that adjusts the network
weights. The paper uses cross-entropy as the loss function to calculate error derivatives for the
backpropagation algorithm. The core calculation involves multiplying the error (the difference
between predicted and actual values) by the derivative of the activation function.

It's implemented on the XILINX ZU9CG FPGA SoC platform, offering 2520 DSPs and 32Mb on-chip
memory. For larger networks, multiple FPGAs can be clustered. Additionally, the paper mentions the
possibility of deploying deep learning frameworks like TensorFlow directly on the 64-bit FPGA SoC
platform, calling FPGA hardware resources directly.

The main components of this architecture are

1. Multiply-add banks for matrix operations
2. RAM modules for storing intermediate results
3. Activation function lookup tables
4. Control units for managing the process flow
This architecture focuses on the forward propagation in detail, from input layer to hidden layer, in
between hidden layers and then from hidden layer to output layer.

Advantages and Potential Applications

One of the main strengths of this architecture is its scalability and adaptability. Different neural
networks can be implemented by reusing modules and making slight modifications. For larger
networks, multiple FPGAs can be clustered together.
The paper suggests that this approach could be particularly useful for:
1. Embedded systems where energy efficiency is crucial
2. Real-time applications requiring fast processing
3. Autonomous vehicles
4. Speech recognition systems
5. Other AI and machine learning applications

Module Three Assignment Guidelines and Rubric - IT-140-15598-M01 Introduction To Scripting 2024 C-3 May - Jun
No ratings yet
Module Three Assignment Guidelines and Rubric - IT-140-15598-M01 Introduction To Scripting 2024 C-3 May - Jun
2 pages
A Deep Learning Prediction Process Accelerator Based FPGA PDF
No ratings yet
A Deep Learning Prediction Process Accelerator Based FPGA PDF
4 pages
A General Neural Network Hardware Architecture On FPGA
No ratings yet
A General Neural Network Hardware Architecture On FPGA
6 pages
FFSN Inplementation4
No ratings yet
FFSN Inplementation4
18 pages
FPGA Implementation of A Trained Neural Network: Seema Singh, Shreyashree Sanjeevi, Suma V, Akhil Talashi
No ratings yet
FPGA Implementation of A Trained Neural Network: Seema Singh, Shreyashree Sanjeevi, Suma V, Akhil Talashi
10 pages
Development and Implementation of Parameterized FPGA-Based General Purpose Neural Networks For Online Applications
No ratings yet
Development and Implementation of Parameterized FPGA-Based General Purpose Neural Networks For Online Applications
12 pages
ANN
No ratings yet
ANN
6 pages
541 - Literature Review
No ratings yet
541 - Literature Review
19 pages
ML Unit 4
No ratings yet
ML Unit 4
16 pages
Implementing AI Models on FPGAs_ A Comprehensive T
No ratings yet
Implementing AI Models on FPGAs_ A Comprehensive T
43 pages
Hardware Implementation of Neural Networks
No ratings yet
Hardware Implementation of Neural Networks
5 pages
Ug4 Proj
No ratings yet
Ug4 Proj
44 pages
Ann On Fpga
No ratings yet
Ann On Fpga
5 pages
Design and implementation of deep neural network hardware chip and its performance analysis
No ratings yet
Design and implementation of deep neural network hardware chip and its performance analysis
10 pages
Applsci 12 10771 v2
No ratings yet
Applsci 12 10771 v2
44 pages
Unit 4 Notes
100% (1)
Unit 4 Notes
45 pages
PP&DS-5
No ratings yet
PP&DS-5
31 pages
Unit-5 (1)
No ratings yet
Unit-5 (1)
10 pages
DL Inference FPGA Class1
No ratings yet
DL Inference FPGA Class1
56 pages
International Refereed Journal of Engineering and Science (IRJES)
No ratings yet
International Refereed Journal of Engineering and Science (IRJES)
4 pages
Review of Deep Learning Algorithms and Architectur
No ratings yet
Review of Deep Learning Algorithms and Architectur
29 pages
2501.08043v1
No ratings yet
2501.08043v1
13 pages
UNIT-1 Foundations of Deep Learning
100% (1)
UNIT-1 Foundations of Deep Learning
51 pages
AI-Hardware
No ratings yet
AI-Hardware
4 pages
FPGA Based Implementation of Neural Network
No ratings yet
FPGA Based Implementation of Neural Network
5 pages
Hardware Accleration For ML
No ratings yet
Hardware Accleration For ML
26 pages
Dsa Theory Da
No ratings yet
Dsa Theory Da
41 pages
Lect 2 Common Architectural Principles of Deep Networks (3)
No ratings yet
Lect 2 Common Architectural Principles of Deep Networks (3)
20 pages
CP4252 ML UNIT- V
No ratings yet
CP4252 ML UNIT- V
17 pages
1 s2.0 S0167926024000397 Main
No ratings yet
1 s2.0 S0167926024000397 Main
11 pages
Fpga 11893295 - 122
No ratings yet
Fpga 11893295 - 122
2 pages
Artificial Neural Network Concepts and Examples
No ratings yet
Artificial Neural Network Concepts and Examples
61 pages
02-92
No ratings yet
02-92
15 pages
Comparison Study and Analysis of Implementing Activation Function of Machine Learning in MATLAB and FPGA
No ratings yet
Comparison Study and Analysis of Implementing Activation Function of Machine Learning in MATLAB and FPGA
10 pages
Design of VLSI Architecture For A Flexible Testbed of Artificial Neural Network For Training and Testing On FPGA
No ratings yet
Design of VLSI Architecture For A Flexible Testbed of Artificial Neural Network For Training and Testing On FPGA
7 pages
deep learning UNIT 1
No ratings yet
deep learning UNIT 1
22 pages
Fpga Implementation of Neural Networks: Main Contents
No ratings yet
Fpga Implementation of Neural Networks: Main Contents
21 pages
Bharath Kumara J 1DT21MC009 Report Changed
No ratings yet
Bharath Kumara J 1DT21MC009 Report Changed
72 pages
Autoencoders: Parallel Programming Parallel Processing
No ratings yet
Autoencoders: Parallel Programming Parallel Processing
5 pages
ML 6
No ratings yet
ML 6
10 pages
Sid AIML SEM6
No ratings yet
Sid AIML SEM6
32 pages
Models of Artificial Neural Networks
No ratings yet
Models of Artificial Neural Networks
6 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
ANN Presentation
No ratings yet
ANN Presentation
10 pages
Lutmul: Exceed Conventional Fpga Roofline Limit by Lut-Based Efficient Multiplication For Neural Network Inference
No ratings yet
Lutmul: Exceed Conventional Fpga Roofline Limit by Lut-Based Efficient Multiplication For Neural Network Inference
7 pages
four unit
No ratings yet
four unit
3 pages
Artificial Neural Network Part-2
No ratings yet
Artificial Neural Network Part-2
15 pages
A Literature Survey For Object Recognition Using Neural Networks in FPGA
No ratings yet
A Literature Survey For Object Recognition Using Neural Networks in FPGA
6 pages
EasyChair-Preprint-15723
No ratings yet
EasyChair-Preprint-15723
10 pages
Int254 Unit 3
No ratings yet
Int254 Unit 3
29 pages
Applications Enabled by FPGA-Based Technology
No ratings yet
Applications Enabled by FPGA-Based Technology
4 pages
Unit I
No ratings yet
Unit I
10 pages
Institute of Engineering and Technology Davv, Indore: Lab Assingment On
No ratings yet
Institute of Engineering and Technology Davv, Indore: Lab Assingment On
14 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Demystifying Deep Convolutional Neural Networks - Adam Harley (2014) CNN PDF
No ratings yet
Demystifying Deep Convolutional Neural Networks - Adam Harley (2014) CNN PDF
27 pages
Unit Ii ML
No ratings yet
Unit Ii ML
22 pages
Introduction neural
No ratings yet
Introduction neural
13 pages
Convolutional Neural Network Layers Implementation On Low-Cost Reconfigurable Edge Computing Platforms
No ratings yet
Convolutional Neural Network Layers Implementation On Low-Cost Reconfigurable Edge Computing Platforms
31 pages
E8627 IranArze
No ratings yet
E8627 IranArze
18 pages
FPGA Implementation of Convolutional Neural Networ PDF
No ratings yet
FPGA Implementation of Convolutional Neural Networ PDF
10 pages
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
Cst303 Computer Networks, December 2023
No ratings yet
Cst303 Computer Networks, December 2023
2 pages
2024-11-29 Biz Main
No ratings yet
2024-11-29 Biz Main
31 pages
10-Pin, 24-Bit, 192 KHZ Stereo D/A Converter: Features Description
No ratings yet
10-Pin, 24-Bit, 192 KHZ Stereo D/A Converter: Features Description
24 pages
honeywell-ControlEdge-RTU-Process-Controller-–-Product-Information-Note-
No ratings yet
honeywell-ControlEdge-RTU-Process-Controller-–-Product-Information-Note-
5 pages
Project For Interview
No ratings yet
Project For Interview
16 pages
Reviews Scam, Legit or Safe Check Scamadviser
No ratings yet
Reviews Scam, Legit or Safe Check Scamadviser
1 page
The Impact of Social Media On The Self-Esteem of Youth 10 - 17 Year PDF
100% (1)
The Impact of Social Media On The Self-Esteem of Youth 10 - 17 Year PDF
49 pages
For Routers and Managed Layer 3 Switches: Technical Tips and Tricks
No ratings yet
For Routers and Managed Layer 3 Switches: Technical Tips and Tricks
106 pages
1910013128-Omada SDN Controller User Guide 5.0 (Windows&Linux)
No ratings yet
1910013128-Omada SDN Controller User Guide 5.0 (Windows&Linux)
404 pages
DojoLab Network Premium Edition
No ratings yet
DojoLab Network Premium Edition
69 pages
Your Big Idea PDF
No ratings yet
Your Big Idea PDF
21 pages
Chanda Kumari
No ratings yet
Chanda Kumari
37 pages
Endress-Hauser Liquisys M COM253 en
No ratings yet
Endress-Hauser Liquisys M COM253 en
2 pages
Data Science Course Syllabus 01
100% (1)
Data Science Course Syllabus 01
20 pages
Hyper-V Cmdlets in Windows PowerShell PDF
No ratings yet
Hyper-V Cmdlets in Windows PowerShell PDF
246 pages
TD SGT NX I Flight Sim Manual
No ratings yet
TD SGT NX I Flight Sim Manual
33 pages
C# Final Jouranal
No ratings yet
C# Final Jouranal
29 pages
Guide c07 740737
No ratings yet
Guide c07 740737
37 pages
CP02 Course
No ratings yet
CP02 Course
124 pages
Java Lab Assignment.
No ratings yet
Java Lab Assignment.
2 pages
PYTHON Lab Manual
No ratings yet
PYTHON Lab Manual
28 pages
06 BinomialCoefficient
No ratings yet
06 BinomialCoefficient
29 pages
Experiment 7 PLC Programming
No ratings yet
Experiment 7 PLC Programming
4 pages
Study - Id146754 - Manufacturing Market Data and Analysis
No ratings yet
Study - Id146754 - Manufacturing Market Data and Analysis
113 pages
Getting Started With EIOS
No ratings yet
Getting Started With EIOS
4 pages
TVL-ICT-CSS-11-Q3_ICCS-Week-5-6
No ratings yet
TVL-ICT-CSS-11-Q3_ICCS-Week-5-6
9 pages
Ccs0003 Computer Programming 1 Lec Syllabus
No ratings yet
Ccs0003 Computer Programming 1 Lec Syllabus
6 pages
Turbo HD 4.0 Solution - 1
No ratings yet
Turbo HD 4.0 Solution - 1
24 pages
Tata Play Fiber 4_unlocked
No ratings yet
Tata Play Fiber 4_unlocked
1 page