Data Warehouse: Bilal Hussain

The document provides an overview of key concepts for designing and optimizing a data warehouse including dimensional modeling, ETL processes, indexing, partitioning, parallelism, compression and query optimization techniques. It outlines a course plan covering these topics and provides examples of how to implement indexing, partitioning, parallelism and compression to improve query performance and reduce the physical storage requirements in a data warehouse.

Uploaded by

Daneil Radcliffe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Data Warehouse: Bilal Hussain

Uploaded by

Daneil Radcliffe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Data Warehouse

Bilal Hussain
• Course Outlines:
1. Introduction & Background.
2. De-Normalization.
3. OLAP & Dimensional Modeling.
4. ETL and Data Quality Management (DQM).
5. Database Performance (Parallelism, Partitioning).
6. ETL Implementation using ODI.
7. Data Visualization using OBIEE.
8. Project (Design Data warehouse for any organization using any
ETL and BI Tool).
Course
Week #
plan: Assignment # Quiz No
1
2
3 Assign #1 Quiz # 1
4
5
6 Assign #2 Quiz # 2
7
8
9 Mid-Term
10-23-12-2021 Assign #3 Quiz # 3
11-30-12-2021
12-06-01-2022 Assign #4 Quiz # 4
13-13-01-2022
14-20-01-2022
15-27-01-2022
16-03-02-2022 Final Exam
How to improve response time in DWH.
• Indexes
• Partitioning
• Parallelism
• Compression
• Minimize bottleneck.
Types of Queries
• Point Query.
• Select count(*) from emp where empno=1;
• Full Table Scan.
• Select count(*) from emp;
• Range.
• Select count(*) from emp where hiredate between firstdate and seconddate;
What is Index
• An index is a database structure/segment that provides quick lookup
of data in a column or columns of a table.
Where Index can be used.
• How many customers I have in Islamabad.
• What is total sale amount in Jan.
• Total Students in MS-CS.
• I/O Bottleneck.
Types of Indexes.
• B-Tree
• Bitmap
• Function Based Index.
• Partitioned Index.
• Clustered Index.
• Index organized Tables.
What is Table Partitioning?
• Partitioning enables tables and indexes to be subdivided into individual
smaller pieces. Each piece of the database object is called a partition. A
partition has its own name, and may optionally have its own storage
characteristics. From the perspective of a database administrator, a
partitioned object has multiple pieces that can be managed either
collectively or individually. This gives the administrator considerable
flexibility in managing a partitioned object. However, from the perspective
of the application, a partitioned table is identical to a non-partitioned table;
no modifications are necessary when accessing a partitioned table using
SQL DML commands. Logically, it is still only one table and any application
can access this one table as they do for a non-partitioned table.
Types of Partitioning
• List
• Range
• Hash
Partition Pruning.
• Partition pruning is an essential performance feature for Data
warehouse. In partition pruning, the optimizer analyzes from and
where clauses in SQL statements to eliminate unneeded partitions.
Example
• CREATE TABLE Sales_part
• ( "PRODKEY" NUMBER(5,0), "PERIODKEY" NUMBER(10,0),
• "INVNBR" NUMBER(10,0), "CUSTKEY" NUMBER(5,0),
• "DWACOSTEXTND" FLOAT(126),"REPCOSTEXTND" FLOAT(126),
• "ACTLEXTND" FLOAT(126), "UNITSHPD" NUMBER(10,0),
• "UNITORDD" NUMBER(10,0), "NETWGHTSHPD" FLOAT(126),
• "CMDOLRS" FLOAT(126), "NULL_FIELD" NUMBER(10,0)
• )
• partition by range (prodkey)
•(
• partition p01 values less than (1094),
• partition p02 values less than (9999)
• );
• Insert into sales_part select * from sales;commit;
What is Parallelism?
• Parallelism is the idea of breaking down a task so that, instead of one
process doing all of the work in a query, many processes do part of
the work at the same time. An example of this is when 12 processes
handle 12 different months in a year instead of one process handling
all 12 months by itself. The improvement in performance can be quite
high.
Parallelism Advantages.
Parallel execution improves processing for
• Large Table scans and joins.
• Creation of large indexes.
• Partitioned index scans.
• Bulk inserts, updates, and deletes.
• Aggregations and copying.
Query Example:

• set autotrace on;

• select /*+ PARALLEL(5) */
• count(*)
• from sales_compressed s
• inner join d1_products p
• on s.prodkey=p.productkey
• where suppliercode=2300;
What is Compression?
• Database compression is a set of techniques that reorganizes
database content to save on physical storage space and improve
performance.

• 111000000111110011111
• 13#06#15#02#15
Example:
• CREATE TABLE Sales_Compressed
• ( "PRODKEY" NUMBER(5,0), "PERIODKEY" NUMBER(10,0),
• "INVNBR" NUMBER(10,0), "CUSTKEY" NUMBER(5,0),
• "DWACOSTEXTND" FLOAT(126),"REPCOSTEXTND" FLOAT(126),
• "ACTLEXTND" FLOAT(126), "UNITSHPD" NUMBER(10,0),
• "UNITORDD" NUMBER(10,0), "NETWGHTSHPD" FLOAT(126),
• "CMDOLRS" FLOAT(126), "NULL_FIELD" NUMBER(10,0)
• )
• COMPRESS for oltp;
Space and Query Speed Comparison.
• set autotrace on;
• select count(*)
• from sales s
• inner join d1_products p
• on s.prodkey=p.productkey
• where suppliercode=2300;
• 66sec – 216
• Set autotrace on;
• select count(*)
• from sales_compressed s
• inner join d1_products p
• on s.prodkey=p.productkey
• where suppliercode=2300;
• 9sec –168 – 13%
End

StableNet Administrator Manual
100% (1)
StableNet Administrator Manual
122 pages
Software Engineering PDF
50% (2)
Software Engineering PDF
162 pages
Python Question Bank-BCCA SEM6
67% (3)
Python Question Bank-BCCA SEM6
7 pages
Oracle DBA Basics 2
No ratings yet
Oracle DBA Basics 2
19 pages
Parallel Databases
No ratings yet
Parallel Databases
19 pages
Oracle Partitioning
100% (1)
Oracle Partitioning
3 pages
Oracle Partitioning
No ratings yet
Oracle Partitioning
6 pages
NYOUG08 Part
No ratings yet
NYOUG08 Part
10 pages
18 Partitioned Tables and Indexes: Introduction To Partitioning
No ratings yet
18 Partitioned Tables and Indexes: Introduction To Partitioning
84 pages
Partitioned Tables and Indexes: Introduction To Partitioning
No ratings yet
Partitioned Tables and Indexes: Introduction To Partitioning
18 pages
O9ir2 Partitioning TWP
No ratings yet
O9ir2 Partitioning TWP
7 pages
20762C 03
No ratings yet
20762C 03
29 pages
Partitioning
No ratings yet
Partitioning
224 pages
Partitioned Tables and Indexes
100% (1)
Partitioned Tables and Indexes
24 pages
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
3. Table Optimizations
No ratings yet
3. Table Optimizations
31 pages
Five Tuning Tips For Your Data Warehouse
No ratings yet
Five Tuning Tips For Your Data Warehouse
46 pages
Partitioning in Oracle Database 10g: An Oracle White Paper Feburary, 2005
No ratings yet
Partitioning in Oracle Database 10g: An Oracle White Paper Feburary, 2005
7 pages
Oracle Tables Defragmentation
No ratings yet
Oracle Tables Defragmentation
10 pages
Parallel Aware Optimizer - Provides All Possible Access Paths Matured Optimizer - Select Best Possible Path Out of All Possible Access
No ratings yet
Parallel Aware Optimizer - Provides All Possible Access Paths Matured Optimizer - Select Best Possible Path Out of All Possible Access
16 pages
Performance Tuning: SAP HANA Course
No ratings yet
Performance Tuning: SAP HANA Course
3 pages
Performance Tunning
No ratings yet
Performance Tunning
7 pages
Partitioning With Oracle 11G: Bert Scalzo, Domain Expert, Oracle Solutions
No ratings yet
Partitioning With Oracle 11G: Bert Scalzo, Domain Expert, Oracle Solutions
45 pages
Microsoft - Strategies For Partitioning Relational Data Warehouses in SQL Server
No ratings yet
Microsoft - Strategies For Partitioning Relational Data Warehouses in SQL Server
27 pages
IO Parallelism
No ratings yet
IO Parallelism
4 pages
Indexes and Frgmentation and Stats
No ratings yet
Indexes and Frgmentation and Stats
7 pages
Learn SQL: Database Management Basics
From Everand
Learn SQL: Database Management Basics
Kiet Huynh
No ratings yet
Oracle Partitioning - Yesterday, Today, and Tomorrow
100% (1)
Oracle Partitioning - Yesterday, Today, and Tomorrow
53 pages
Oracle 11g Partitioning
No ratings yet
Oracle 11g Partitioning
11 pages
Oracle Partitioning For Developers
No ratings yet
Oracle Partitioning For Developers
70 pages
CH14
No ratings yet
CH14
43 pages
SQL Tuning
No ratings yet
SQL Tuning
27 pages
Database Performance Optimization. Andrey Avtomonov
100% (1)
Database Performance Optimization. Andrey Avtomonov
26 pages
Mastering DuckDB: High-Performance Analytics Made Easy
From Everand
Mastering DuckDB: High-Performance Analytics Made Easy
Robert Johnson
No ratings yet
Oracle Partitioning Interview Questions and Answers
0% (1)
Oracle Partitioning Interview Questions and Answers
3 pages
Basics of Partitioning
100% (1)
Basics of Partitioning
2 pages
Fundamentals of Database Systems: (Parallel and Distributed Databases)
No ratings yet
Fundamentals of Database Systems: (Parallel and Distributed Databases)
46 pages
Oracle 12 C
100% (1)
Oracle 12 C
87 pages
Internal Tables: Why We Need Internal Table
No ratings yet
Internal Tables: Why We Need Internal Table
5 pages
PracticalPartitioning v2
No ratings yet
PracticalPartitioning v2
76 pages
Unit I
No ratings yet
Unit I
43 pages
Oracle Database Performance and Tuning
No ratings yet
Oracle Database Performance and Tuning
69 pages
Oracle Database Performance and Tuning
100% (1)
Oracle Database Performance and Tuning
69 pages
Things You Always Wanted To Know About Oracle Partitioning
No ratings yet
Things You Always Wanted To Know About Oracle Partitioning
43 pages
What Is Mutating Trigger?: Test Empno Test 1001
No ratings yet
What Is Mutating Trigger?: Test Empno Test 1001
24 pages
Tuning
No ratings yet
Tuning
20 pages
The Database Knowledgebase On The Web: Database Wisdom: General - Oracle 11g Partitioni..
No ratings yet
The Database Knowledgebase On The Web: Database Wisdom: General - Oracle 11g Partitioni..
4 pages
Chapter_4_Data Warehouse Indexes
No ratings yet
Chapter_4_Data Warehouse Indexes
11 pages
Erfo Rma Nce With L5. 1 An D5. 5 Tion Ing: Giuseppe Maxia Mysql Community Team Lead Sun Microsystems
No ratings yet
Erfo Rma Nce With L5. 1 An D5. 5 Tion Ing: Giuseppe Maxia Mysql Community Team Lead Sun Microsystems
103 pages
Segmentspace Management
No ratings yet
Segmentspace Management
15 pages
SQL Server 2014 Development Essentials
From Everand
SQL Server 2014 Development Essentials
Basit A. Masood-Al-Farooq
4.5/5 (2)
Partitioning Fundamentals
No ratings yet
Partitioning Fundamentals
1 page
Oracle 12c Partitioned and Subpartitioned Tables
No ratings yet
Oracle 12c Partitioned and Subpartitioned Tables
24 pages
Getting To Know The Ins and Outs of Oracle Partitioning in Oracle Database 11g
No ratings yet
Getting To Know The Ins and Outs of Oracle Partitioning in Oracle Database 11g
48 pages
Parallel_Database_QA_Detailed
No ratings yet
Parallel_Database_QA_Detailed
2 pages
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
DBMS Interview
No ratings yet
DBMS Interview
6 pages
Oracle SQL High Performance Tuning: Guy Harrison Director, R&D Melbourne
100% (1)
Oracle SQL High Performance Tuning: Guy Harrison Director, R&D Melbourne
56 pages
SQL Tuning
No ratings yet
SQL Tuning
69 pages
New Features in Oracle Database 12c
No ratings yet
New Features in Oracle Database 12c
3 pages
Column Store Indices and Batch Processing
No ratings yet
Column Store Indices and Batch Processing
2 pages
03 - Partitioning Basics
No ratings yet
03 - Partitioning Basics
33 pages
Deep dive Dynamo DB
No ratings yet
Deep dive Dynamo DB
88 pages
Data Warehouse Lec-3
No ratings yet
Data Warehouse Lec-3
38 pages
Data Warehouse: Bilal Hussain
No ratings yet
Data Warehouse: Bilal Hussain
34 pages
3 - Block Ciphers and The Data Encryption Standard Part 2
No ratings yet
3 - Block Ciphers and The Data Encryption Standard Part 2
33 pages
1 - Introduction To Number Theory
No ratings yet
1 - Introduction To Number Theory
45 pages
2 - Finite Fields
No ratings yet
2 - Finite Fields
23 pages
4 - Advance Encryption Standard
No ratings yet
4 - Advance Encryption Standard
33 pages
3 - Block Ciphers and The Data Encryption Standard
No ratings yet
3 - Block Ciphers and The Data Encryption Standard
32 pages
Logistic Regression
100% (1)
Logistic Regression
21 pages
Agglomerative Hierarchical Clustering
No ratings yet
Agglomerative Hierarchical Clustering
22 pages
Lecture Decision Trees
No ratings yet
Lecture Decision Trees
46 pages
K Means Clustering Lecture
No ratings yet
K Means Clustering Lecture
32 pages
K Nearest Neighbors: Probably A Duck."
No ratings yet
K Nearest Neighbors: Probably A Duck."
14 pages
Nemo File Format 2.25 PDF
No ratings yet
Nemo File Format 2.25 PDF
642 pages
Specifications For The IManager U2000 Northbound Interface 07 (20170808)
No ratings yet
Specifications For The IManager U2000 Northbound Interface 07 (20170808)
23 pages
Kpi Type KPI 3G: Ps Volume Hsdpa Ps Volume Hsupa
No ratings yet
Kpi Type KPI 3G: Ps Volume Hsdpa Ps Volume Hsupa
12 pages
Association - Aggregation and Composition OOPs
No ratings yet
Association - Aggregation and Composition OOPs
8 pages
Instant Ebooks Textbook (Ebook PDF) Microsoft Office 365: in Practice, 2019 Edition Download All Chapters
100% (7)
Instant Ebooks Textbook (Ebook PDF) Microsoft Office 365: in Practice, 2019 Edition Download All Chapters
51 pages
MS Office Guide
No ratings yet
MS Office Guide
10 pages
PP (5th) Dec2022
No ratings yet
PP (5th) Dec2022
2 pages
Demo 10 Disha Essential Static GK For Competitive Exams 2023 Edition English Medium
No ratings yet
Demo 10 Disha Essential Static GK For Competitive Exams 2023 Edition English Medium
10 pages
Introduction to Android Development
No ratings yet
Introduction to Android Development
11 pages
ARIETTA50 - Catalog (E) With Marks
No ratings yet
ARIETTA50 - Catalog (E) With Marks
2 pages
Can PPT
No ratings yet
Can PPT
13 pages
Session 11-Introduction To ERP PDF
No ratings yet
Session 11-Introduction To ERP PDF
25 pages
Final Project Report CSC186
No ratings yet
Final Project Report CSC186
20 pages
Unit 3
No ratings yet
Unit 3
13 pages
Unit-1 Java
100% (1)
Unit-1 Java
23 pages
Booth Multiplier
No ratings yet
Booth Multiplier
54 pages
Fortimanager v7.2.6 Release Notes
No ratings yet
Fortimanager v7.2.6 Release Notes
59 pages
PM Configuration
No ratings yet
PM Configuration
13 pages
Brksec 2464 1
No ratings yet
Brksec 2464 1
143 pages
BCA-1 FOC&IT Lab
No ratings yet
BCA-1 FOC&IT Lab
4 pages
A 3 M 50 Aa
No ratings yet
A 3 M 50 Aa
2 pages
11.2.3.10 Packet Tracer - Explore A NetFlow Implementation Instruc
No ratings yet
11.2.3.10 Packet Tracer - Explore A NetFlow Implementation Instruc
9 pages
Week 2 Quiz-2
No ratings yet
Week 2 Quiz-2
2 pages
TP2 Linux
No ratings yet
TP2 Linux
5 pages
State of Port 20160906
No ratings yet
State of Port 20160906
14 pages
(url to pdf)https___www.exploit-db.com_ (3)
No ratings yet
(url to pdf)https___www.exploit-db.com_ (3)
5 pages
Advanced Encryption Standard Implementation in C
No ratings yet
Advanced Encryption Standard Implementation in C
16 pages
Amarisoft Ue Simbox
No ratings yet
Amarisoft Ue Simbox
51 pages
Parameter Declarations
No ratings yet
Parameter Declarations
1,338 pages
Scepter of Goth History
100% (2)
Scepter of Goth History
52 pages
IJCRT2310264
No ratings yet
IJCRT2310264
8 pages

Data Warehouse: Bilal Hussain

Uploaded by

Data Warehouse: Bilal Hussain

Uploaded by

Data Warehouse

• set autotrace on;

You might also like