Query Processing

The document discusses the process of query processing in a database management system. It involves 5 main steps: 1) parsing the query syntax and checking privileges, 2) translating the query from SQL to relational algebra, 3) optimizing the query using statistical data, 4) generating an execution plan, and 5) evaluating and executing the plan to retrieve the query results from the database in an efficient manner. The goal is to choose the most optimal execution plan that minimizes the time and resources needed to run the query.

Uploaded by

anon_189503955

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

138 views5 pages

Query Processing

Uploaded by

anon_189503955

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Query Processing

Query Processing would mean the entire process or activity which involves query
translation into low level instructions, query optimization to save resources, cost
estimation or evaluation of query, and extraction of data from the database.
It is the step by step process of breaking the high-level language into low level
language which machine can understand and perform the requested action for user.
Query processor in the DBMS performs this task.

Let us consider the following two relations as the example tables for our
discussion;
Employee (Eno, Ename, Phone)
Proj_Assigned (Eno, Proj_No, Role, DOP)
where,
 Eno is Employee number,
 Ename is Employee name,
 Proj_No is Project Number in which an employee is assigned,
 Role is the role of an employee in a project,
 DOP is duration of the project in months.
Here we write a query to find the list of all employees who are working in a project
which is more than 10 months old.
SELECT Ename
FROM Employee, Proj_Assigned
WHERE Employee. Eno = Proj_Assigned. Eno AND DOP > 10;
Step 1: Parsing
In this step, the parser of the query processor module checks the syntax of the
query, the user’s privileges to execute the query, the table names and attribute
names, etc. The correct table names, attribute names and the privilege of the users
can be taken from the system catalog (data dictionary).

Step 2: Translation
If we have written a valid query, then it is converted from high level language SQL
to low level instruction in Relational Algebra.
For example, our SQL query can be converted into a Relational Algebra equivalent
as follows;
πEname(σDOP>10 Λ Employee. Eno=Proj_Assigned. Eno (Employee X Prof_Assigned))
Step 3: Optimizer
Optimizer uses the statistical data stored as part of data dictionary. The statistical
data are information about the size of the table, the length of records, the indexes
created on the table, etc. Optimizer also checks for the conditions and conditional
attributes which are parts of the query.
Step 4: Execution Plan
The query processor module, at this stage, using the information collected in step 3
to find different relational algebra expressions that are equivalent and return the
result of the one which we have written already.
For our example, the query written in Relational algebra can also be written
as the one given below;
πEname (Employee ⋈Eno (σDOP>10 (Prof_Assigned)))

Step 5: Evaluation
There are many execution plans constructed through statistical data, though they
return same result, they differ in terms of Time consumption, or the Space required
executing the query. Hence, it is mandatory choose one plan which obviously
consumes less cost.
At this stage, we choose one execution plan. This Execution plan accesses data
from the database to give the final result.
 In our example, the second plan may be good. In the first plan, we join two
relations (costly operation) then apply the condition (conditions are
considered as filters) on the joined relation. This consumes more time as
well as space.
 In the second plan, we filter one of the tables (Proj_Assigned) and the result
is joined with the Employee table. This join may need to compare a smaller
number of records. Hence, the second plan is the best (with the information
known, not always).

Query Optimization:
A single query can be executed through different algorithms or re-written in
different forms and structures. Hence, the question of query optimization comes
into the picture – Which of these forms or pathways is the most optimal? The
query optimizer attempts to determine the most efficient way to execute a given
query by considering the possible query plans.
Importance: The goal of query optimization is to reduce the system resources
required to fulfill a query, and ultimately provide the user with the correct result set
faster.
 First, it provides the user with faster results, which makes the application
seem faster to the user.
 Secondly, it allows the system to service more queries in the same amount of
time, because each request takes less time than unoptimized queries.
 Thirdly, query optimization ultimately reduces the amount of wear on the
hardware (e.g. disk drives), and allows the server to run more efficiently
(e.g. lower power consumption, less memory usage).
The query optimizer uses these two techniques to determine which process or
expression to consider for evaluating the query.

Cost based Optimization

This is based on the cost of the query. The query can use different paths based on
indexes, constraints, sorting methods etc. This method mainly uses the statistics
like record size, number of records, number of records per block, number of
blocks, table size, whether whole table fits in a block, organization of tables,
uniqueness of column values, size of columns etc.

 Dynamic programming
 Left Deep Trees
 Inserting sort orders
Heuristic Optimization (Logical)
This method is also known as rule-based optimization. This is based on the
equivalence rule on relational expressions; hence the number of combination of
queries get reduces here. Hence the cost of the query too reduces.
Some of the common heuristic rules are
 Perform select and project operations before join operations. This is done by
moving the select and project operations down the query tree. This reduces
the number of tuples available for join
 Perform the most restrictive select/project operations at first before the other
operations.
 Avoid cross-product operation since they result in very large-sized
intermediate tables.

Steps for Query Optimization

Query optimization involves three steps, namely query tree generation, plan
generation, and query plan code generation.

Step 1 − Query Tree Generation

A query tree is a tree data structure representing a relational algebra expression. The
tables of the query are represented as leaf nodes. The relational algebra operations are
represented as the internal nodes. The root represents the query as a whole.
During execution, an internal node is executed whenever its operand tables are
available. The node is then replaced by the result table. This process continues for all
internal nodes until the root node is executed and replaced by the result table.

For example, let us consider the following schemas −

EMPLOYEE

EmpID EName Salary DeptNo DateOfJoining

DEPARTMENT

DNo DName Location

Step 2 − Query Plan Generation

After the query tree is generated, a query plan is made. A query plan is an extended
query tree that includes access paths for all operations in the query tree. Access paths
specify how the relational operations in the tree should be performed. For example, a
selection operation can have an access path that gives details about the use of B+ tree
index for selection.
Besides, a query plan also states how the intermediate tables should be passed from
one operator to the next, how temporary tables should be used and how operations
should be pipelined/combined.

Step 3− Code Generation

Code generation is the final step in query optimization. It is the executable form of the
query, whose form depends upon the type of the underlying operating system. Once the
query code is generated, the Execution Manager runs it and produces the results.

Presentation9 - Query Processing and Query Optimization in DBMS
No ratings yet
Presentation9 - Query Processing and Query Optimization in DBMS
36 pages
Chapter 4 Query Optimization
100% (2)
Chapter 4 Query Optimization
35 pages
Oracle Queries
100% (1)
Oracle Queries
509 pages
Computer - 11 EM
No ratings yet
Computer - 11 EM
226 pages
OceanofPDF.com What Engineer Should Know About Python - Raymond J Madachy
No ratings yet
OceanofPDF.com What Engineer Should Know About Python - Raymond J Madachy
489 pages
Python-Course-PPT
No ratings yet
Python-Course-PPT
184 pages
ODBC SQL Microsoft
No ratings yet
ODBC SQL Microsoft
297 pages
INF1511 - Chapter 1 - Python and Its Features
No ratings yet
INF1511 - Chapter 1 - Python and Its Features
3 pages
Student DB
No ratings yet
Student DB
11 pages
Mba 851 Project Evaluation
100% (1)
Mba 851 Project Evaluation
228 pages
Python in One Shot
No ratings yet
Python in One Shot
10 pages
SQL Tutorial 1
No ratings yet
SQL Tutorial 1
107 pages
Python Manual (2)
No ratings yet
Python Manual (2)
205 pages
Mathematical-Economics Solved MCQs (Set-7)
No ratings yet
Mathematical-Economics Solved MCQs (Set-7)
5 pages
Chapter 2 Query Optimization
No ratings yet
Chapter 2 Query Optimization
31 pages
Chapter - 1 - Query Optimization
No ratings yet
Chapter - 1 - Query Optimization
38 pages
Introduction To Oracle 11g SQL Programming: Student Workbook
No ratings yet
Introduction To Oracle 11g SQL Programming: Student Workbook
48 pages
CH - 2 Query Process
No ratings yet
CH - 2 Query Process
44 pages
(Chapman & Hall_CRC The Python Series) William J.B. Mattingly - Introduction to Python for Humanists-CRC Press_Chapman & Hall (2023)
No ratings yet
(Chapman & Hall_CRC The Python Series) William J.B. Mattingly - Introduction to Python for Humanists-CRC Press_Chapman & Hall (2023)
362 pages
Basics of Information Technology
0% (1)
Basics of Information Technology
61 pages
Mathematical-Economics Solved MCQs (Set-4)
No ratings yet
Mathematical-Economics Solved MCQs (Set-4)
8 pages
Rdbms Assignment
No ratings yet
Rdbms Assignment
12 pages
1.basics of Computers - Quick Guide - Tutorialspoint
No ratings yet
1.basics of Computers - Quick Guide - Tutorialspoint
47 pages
Advanced Statistical Computing PDF
No ratings yet
Advanced Statistical Computing PDF
329 pages
Updated NMC Publication Criteria
No ratings yet
Updated NMC Publication Criteria
4 pages
Data Science Book
No ratings yet
Data Science Book
383 pages
HOTEL RESERVATION
No ratings yet
HOTEL RESERVATION
23 pages
Data Transformation With Dplyr - Cheatsheet
100% (1)
Data Transformation With Dplyr - Cheatsheet
2 pages
Paper-8 Cost Accounting & Financial Mangement (Syllabus 2012) PDF
100% (1)
Paper-8 Cost Accounting & Financial Mangement (Syllabus 2012) PDF
492 pages
Data Visualization With Python
No ratings yet
Data Visualization With Python
19 pages
ST2195 Complete
No ratings yet
ST2195 Complete
430 pages
Database Performance and Query Optimization
No ratings yet
Database Performance and Query Optimization
334 pages
Intro To Economics With Statistics (1st Edition)
No ratings yet
Intro To Economics With Statistics (1st Edition)
586 pages
B0CQ9K43RZ
No ratings yet
B0CQ9K43RZ
287 pages
Lecture 77777
No ratings yet
Lecture 77777
104 pages
Dbms Lab Record_merged (2) (1)
No ratings yet
Dbms Lab Record_merged (2) (1)
43 pages
SQL Notes
No ratings yet
SQL Notes
190 pages
Data Structures Using Python Lab Manual (R20a0583)
No ratings yet
Data Structures Using Python Lab Manual (R20a0583)
71 pages
SQL Road Map - BossAcademy
No ratings yet
SQL Road Map - BossAcademy
17 pages
Chapter 2 Querry Proccessing
No ratings yet
Chapter 2 Querry Proccessing
7 pages
RDBMS Lab Exp 5
No ratings yet
RDBMS Lab Exp 5
4 pages
2 Chapter 3 Query Optimization
No ratings yet
2 Chapter 3 Query Optimization
29 pages
E Computer Notes - Oracle9i Extensions To DML and DDL Statements
No ratings yet
E Computer Notes - Oracle9i Extensions To DML and DDL Statements
20 pages
SQL Statements: - Select - Insert - Update - Delete - Create - Alter - Drop - Rename - Truncate - Commit - Rollback - Savepoint
100% (1)
SQL Statements: - Select - Insert - Update - Delete - Create - Alter - Drop - Rename - Truncate - Commit - Rollback - Savepoint
231 pages
02 Normalization
No ratings yet
02 Normalization
82 pages
R Programming Course Notes
No ratings yet
R Programming Course Notes
28 pages
Law Book 1
No ratings yet
Law Book 1
83 pages
[Ebooks PDF] download Research Methods, Statistics, and Applications Kathrynn A. Adams full chapters
100% (1)
[Ebooks PDF] download Research Methods, Statistics, and Applications Kathrynn A. Adams full chapters
65 pages
Probability in Computer Science
100% (1)
Probability in Computer Science
353 pages
3 Introduction To Programming Using Python
No ratings yet
3 Introduction To Programming Using Python
5 pages
Introduction To Computer Programming Using Python Comp 111
No ratings yet
Introduction To Computer Programming Using Python Comp 111
227 pages
Fundamentals of Python: First Programs Second Edition
No ratings yet
Fundamentals of Python: First Programs Second Edition
42 pages
SQL Injection: Not Only AND 1 1: Bernardo Damele Assumpção Guimarães
No ratings yet
SQL Injection: Not Only AND 1 1: Bernardo Damele Assumpção Guimarães
41 pages
Visual Programming 1: - Exam Preparation: With MCQS, Pracs, Questions and Solutions
No ratings yet
Visual Programming 1: - Exam Preparation: With MCQS, Pracs, Questions and Solutions
27 pages
Unit 1 JDBC
No ratings yet
Unit 1 JDBC
16 pages
Dlmdmdql01 Course Book
No ratings yet
Dlmdmdql01 Course Book
104 pages
Distributed Query Processing
No ratings yet
Distributed Query Processing
24 pages
The Performance Management Cycle
No ratings yet
The Performance Management Cycle
5 pages
Introduction To Hive: Liyin Tang Liyintan@usc - Edu
No ratings yet
Introduction To Hive: Liyin Tang Liyintan@usc - Edu
24 pages
ER Model
No ratings yet
ER Model
14 pages
Chapter 1 Introduction - Review Questions
No ratings yet
Chapter 1 Introduction - Review Questions
82 pages
2020 May
No ratings yet
2020 May
3 pages
DBMS
No ratings yet
DBMS
240 pages
Useful Workflow Scripts
No ratings yet
Useful Workflow Scripts
4 pages
DMU-Advanced Database System -Chapter 2- Lecture Note
No ratings yet
DMU-Advanced Database System -Chapter 2- Lecture Note
9 pages
Notes Functions in Python 2022 23
No ratings yet
Notes Functions in Python 2022 23
26 pages
Principal of Programming Language
No ratings yet
Principal of Programming Language
67 pages
SQL Script
No ratings yet
SQL Script
10 pages
ICT2622 Diagrams Chapter Answers Systems - Analysis - and - Design PDF
No ratings yet
ICT2622 Diagrams Chapter Answers Systems - Analysis - and - Design PDF
31 pages
R Lnaguager
No ratings yet
R Lnaguager
38 pages
TAFJ MSSQLInstall
No ratings yet
TAFJ MSSQLInstall
32 pages
2021 2023 Syllabus
No ratings yet
2021 2023 Syllabus
31 pages
Basic SQL: ITCS 201 Web Programming Part II
No ratings yet
Basic SQL: ITCS 201 Web Programming Part II
29 pages
Lecture 4
No ratings yet
Lecture 4
4 pages
Python Pt1 0702
No ratings yet
Python Pt1 0702
121 pages
Introduction To Query Processing and Optimization
No ratings yet
Introduction To Query Processing and Optimization
4 pages
Python OOPs Concepts
No ratings yet
Python OOPs Concepts
4 pages
Theory and Concept Assignment #3: Objective:-To Implement The Restrictions On The Table
No ratings yet
Theory and Concept Assignment #3: Objective:-To Implement The Restrictions On The Table
5 pages
Experiment 5 - Functions in DBMS
No ratings yet
Experiment 5 - Functions in DBMS
6 pages
MathType Training Handout
No ratings yet
MathType Training Handout
24 pages
Performance Task 1 Prog 114 No. 2 B
100% (1)
Performance Task 1 Prog 114 No. 2 B
4 pages
98-364 MTA Database Fundamentals
No ratings yet
98-364 MTA Database Fundamentals
1 page
Diving Into Microsoft Net Entity Framework
No ratings yet
Diving Into Microsoft Net Entity Framework
217 pages
SQL Tutorial
No ratings yet
SQL Tutorial
6 pages
MathType - Equation Editor Tips
No ratings yet
MathType - Equation Editor Tips
9 pages
List Comprehension in Python
No ratings yet
List Comprehension in Python
8 pages
Cursor Trigger
No ratings yet
Cursor Trigger
7 pages
Math Study Plan
No ratings yet
Math Study Plan
7 pages
Computer-Controlled Systems: Theory and Design, Third Edition
From Everand
Computer-Controlled Systems: Theory and Design, Third Edition
Karl J Åström
3/5 (1)
Social Media Data Mining and Analytics
From Everand
Social Media Data Mining and Analytics
Gabor Szabo
No ratings yet

Query Processing

Uploaded by

Query Processing

Uploaded by

Query Processing

Cost based Optimization

Steps for Query Optimization

Step 1 − Query Tree Generation

For example, let us consider the following schemas −

EmpID EName Salary DeptNo DateOfJoining

DNo DName Location

Step 2 − Query Plan Generation

Step 3− Code Generation

You might also like