0% found this document useful (0 votes)

10 views8 pages

Nabeel Research Paper

just to download docs oops

Uploaded by

Muhammad Saqib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views8 pages

Nabeel Research Paper

just to download docs oops

Uploaded by

Muhammad Saqib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Principle Component Analysis of Non-stationary Time Series Data

Nabeel Ahmed, Dr Zahid Hussain Shaikh.

1. Abstract
This article explores the use of Principal Component Analysis (PCA) on non-stationary time series data. The essence of
understanding basic patterns is essential due to the fact that dynamic and complex data sets are increasingly on rise in
various areas like healthcare, Environmental studies as well as finance. The study utilized principal component analysis
as a powerful statistical method to identify main characteristics and diminish the intricacy of non-stationary time series
data. PCA represents a statistical method which transforms original variables into new ones that have no correlations
among themselves [1]. Find out the main patterns of variation within the datasets using these new variables. They offer
a clear explanation of the analytical and preparatory stages in the dynamics of data such as the Karachi exchange
100(KSE-100). We recognize the principal component analysis (PCA) theoretical foundations in the context of moving
average.

Keywords: PCA, Non-stationary, VAR,KSE-100.

2. Introduction
Financial markets change often and quickly and so they provide investors, policymakers and analysts with opportunities
and challenges. Advanced tools and methodologies are necessary in order to study and predict them because their
turbulence depends on many factors like economic statistics, political occurrences together with investor sentiments.
The KSE 100 Index is an important time series dataset for investigating non-stationary time series data properties in
financial markets, since it is a stock market index representing Pakistan’s major stock exchange. This article proposes an
innovative approach to analyzing and forecasting values of the KSE 100 Index using Principal Component Analysis (PCA)
in conjunction with the Vector Auto Regression (VAR) model technique.

A considerable challenge in conventional time series analysis techniques arises from non-stationary time series data.
Most often than not, actual financial time series data never gets to the stationary state usually presupposed by common
methods hence leading to imprecise prediction and evaluations. In order to resolve this challenge, our study employs
Principal Component Analysis (PCA), which offers a less complex method for reducing dimensionality [1], in order to
identify and abstract the important features from KSE 100 Index data. Using principal components will simplify the
analysis while improving its quality by converting the original dataset to a number of linearly independent variables. This
allows for a more manageable and informative examination of the data's fundamental structure and behavior. Building
on the work laid out by PCA, we now introduce the VAR model on the processed data set. Given its ability to recognize
linear associations among many time-series variables including financial indices (which is fundamental) particularly
prominent ones (implying large correlations), one may say that this particular forecasting technique suits best for
prediction of financial indices. Also, so as to completely incorporate non-stationarity in the original data space accurately
within the transformed components, we employ these transformed components as our input variables into a vector
auto regression model for capturing non-stationarity fundamentally in the KSE 100 Index, hence improving the precision
and dependability of our forecasts.

3. Literature Review
This project will use an AR (1) model, VAR model, and Principal Component Analysis. We will provide concise
explanations for all of these models and techniques.
Autoregressive (AR) Model [2]
An AR (1) model, also known as an Auto Regressive model of order one, is a basic time series model that
represents the current value of a variable as a linear combination of its previous values, with a one-time
period delay. Mathematically, an autoregressive model of order 1 (AR(1)) may be expressed as:

X t =∅ X t −1 +∈t

where

 X t Is the value of the time series at time t

 ∅ is the autoregressive parameter, representing the influence of the previous time

on the current one,

 X t −1 is the value of the time series at the previous time (lag 1),
 ∈t is the white noise or error term at time t , representing the random shocks or disturbances. The
autoregressive parameter ∅ determines the strength and direction of the relationship between the
immediate past value and the current value. If |ϕ|< 1, then the model is stable and the earlier values
will have little influence over time. For time series analysis, AR(1) model is used widely so that
processes involving forecast can be modeled according to cases that depend lineally on their
immediate past periods
VAR model [3]
A Vector Autoregressive (VAR) model is a statistical model used to analyze the interdependencies among
several time series variables. The multivariate autoregressive (AR) model is an expansion of the univariate AR
model that allows for the simultaneous analysis of numerous time series variables.

Below is an analysis of the fundamental elements and attributes of a VAR model:

Vector: Vector: The name "vector" signifies that the model stresses on processing multiple time series
variables. These variables are arrayed in the form of a vector where each part stands as an individual variable.

Autoregressive: Every variable in VAR model is regressed on its own lagged values as well as on the lagged
values of other variables in the system, just like in the univariate autoregressive model.

Multivariate: VAR models are Multivariate and can process a variety of related variables simultaneously,
allowing for their interdependence and feedback impacts’ analysis.

Order: A variety of related variables can be processed simultaneously by VAR models, which are Multivariate,
allowing for their interdependence and feedback impacts’ analysis

Parameter estimation: Parameter estimate in a VAR model is often accomplished through approaches as
ordinary least squares (OLS), maximum likelihood estimation (MLE), or Bayesian processes.

Impulse response functions: Impulse response functions are computed using VAR models to analyze the
dynamic reactions of the system to shocks.
Granger Causality [4]: VAR models may be used to examine Granger causality, a method that evaluates
whether previous values of one variable provide valuable insights into forecasting another variable.

A Vector Autoregressive (VAR) model may be formally defined as follows:

Let p be the order of the VAR model, and k represent the number of time series variables. Subsequently, a p-
order vector autoregressive (VAR) model with k variables may be represented in the following manner:

For each variable y i where i=1 , 2 ,3 , … … k , the p-order VAR model can be written as:
k p
y i ,t =c i + ∑ ∑ A il, j y j , t−l +∈i ,t
j=1 l=1
where,
y i ,t represents the value of the ith variable at time t .
c i is the intercept term for the ith variable.
Ail , j is jth variable in the equation for the ith variable.
p is the number of lagged values in the VAR model.
∈i ,t is the ith variable’s error term at time t, capturing the part of y i ,t that is not explained
by the lagged values of the variables.

Principal Component Analysis [1]

Principal component analysis (PCA) is a statistical method of extracting the key variables from a complex
dataset. It frequently is used in various domains, such as machine learning, pattern recognition, and image
analysis. The main objective of PCA is to transform a dataset of possibly correlated variables into a new set of
variables that are uncorrelated; these new variables are called principal components. Principle components
are constructed by creating mixtures of the initial variables in a linear model and ranking them on the quantity
of variation each explains in the data.

4. Methodology
Data Preprocessing
Identify and acquire non-stationary time series data from suitable sources. Perform data cleaning and
preparation, including resolving missing values and outliers. Detrend the time series data using suitable
procedures to make it more stationary.

Data Engineering
Initially we have taken the five variables.
We then created following variables.

a) P7: Price after week

b) Momentum: Momentum is the rate of acceleration of a security's price—that is, the
speed at which the price is changing.
M =P7 −P
c) Volatility:
V =Std(P)
d) Price Return:
Pclose−P open
P= ×100
Popen
e) Range: Range=ln ( Open )−ln (close)

Correlation matrix
Now checking their correlation between them by finding their variance-covariance matrix function.

Figure 1

Some variables are highly correlated. We have neglected them and introduced some new
variables which are uncorrelated.
Figure 2

Component selection
Now we will select the components by scree plot test.

Figure 3

This graph shows that we must select 3 components.

Principal Component Analysis (PCA)

After finding components we will find its loadings.
Figure 04

After PCA, we got these loadings of the respective components.

Figure 05

Through analysis, we concluded that Factor 1 (containing Price and its range), is stationary.
Then, we just jumped into the forecasting of the Price Return of the Karachi Stock Market. We
use VAR model for the prediction of the model.

5. Results and Discussion

We have forecasted the results by applying our selected model (i.e. VAR).
It was shown that the PCA is very applicable in analyzing non-stationary time series data. Especially when
dealing with complex and dynamic datasets such as those in finance, healthcare or environmental studies, the
PCA, with its capability to lower dimensionality while bringing out essential patterns, is advantageous.

PCA ability to recognize and separate major patterns of variation in the data was emphasized in the study. This
is important particularly in non-stationary time series since ordinary time series analysis techniques might not
be able to reveal such patterns owing to the dynamic nature of the data.

With KSE-100 index, the overall trends and periodic fluctuations can be identified for instance which give
investors some hints on the way can forward in terms of their investments decisions as well as how they can
manage risks.

References
[1]. Lansangan, J. R. G., & Barrios, E. B. (2009). Principal components analysis of nonstationary time series
data. Statistics and Computing, 19, 173-187.
[2]. Hamilton, J. D. (2020). Time series analysis. Princeton university press

[3]. Lütkepohl, H. (2005). New introduction to multiple time series analysis. Springer Science & Business Media.

[4]. Granger, C. W. (1969). Investigating causal relations by econometric models and cross-spectral methods.
Econometrica: journal of the Econometric Society, 424-438.

Stuff Cheats For Harvest Moon: The Tale of Two Towns
No ratings yet
Stuff Cheats For Harvest Moon: The Tale of Two Towns
15 pages
Jurnal Varxx
No ratings yet
Jurnal Varxx
9 pages
Liam - Mescall - PCA Project
No ratings yet
Liam - Mescall - PCA Project
15 pages
Time Series
No ratings yet
Time Series
67 pages
Financial Econometrics
No ratings yet
Financial Econometrics
19 pages
A Vector Auto-Regressive (VAR) Model
No ratings yet
A Vector Auto-Regressive (VAR) Model
21 pages
Time Series Analysis Using Vector Autoregression Techniques
No ratings yet
Time Series Analysis Using Vector Autoregression Techniques
77 pages
Assignment 2
No ratings yet
Assignment 2
9 pages
EUI Working Papers: Department of Economics
No ratings yet
EUI Working Papers: Department of Economics
33 pages
Time Series
No ratings yet
Time Series
19 pages
Methodology: I. Adf and Phillips-Perron Tests
No ratings yet
Methodology: I. Adf and Phillips-Perron Tests
2 pages
Factor analysis is a statistical method used to explore the underlying structure of relationships among observed variables in a dataset. It aims to identify latent or unobservable factors that exp (2)
No ratings yet
Factor analysis is a statistical method used to explore the underlying structure of relationships among observed variables in a dataset. It aims to identify latent or unobservable factors that exp (2)
12 pages
VAR VECM Toda-YamamotoModels
No ratings yet
VAR VECM Toda-YamamotoModels
38 pages
Analysis of Multiple Time Series
No ratings yet
Analysis of Multiple Time Series
56 pages
Vector Autoregressions: Dr. Chen, Jo-Hui
No ratings yet
Vector Autoregressions: Dr. Chen, Jo-Hui
33 pages
Notes
No ratings yet
Notes
2 pages
Time Series 02
No ratings yet
Time Series 02
27 pages
gunjan p
No ratings yet
gunjan p
60 pages
Introduction-to-Vector-Auto-Regression-VAR
No ratings yet
Introduction-to-Vector-Auto-Regression-VAR
8 pages
Five
No ratings yet
Five
18 pages
VAR Slides
No ratings yet
VAR Slides
54 pages
00 Time Series Analysis_ Complete Study Guide
No ratings yet
00 Time Series Analysis_ Complete Study Guide
26 pages
Topic 7. VAR Models
No ratings yet
Topic 7. VAR Models
44 pages
Tyds Unit 5
No ratings yet
Tyds Unit 5
2 pages
A Deep Dive On Vector Autoregression in R by Justin Eloriaga Towards Data Science
No ratings yet
A Deep Dive On Vector Autoregression in R by Justin Eloriaga Towards Data Science
18 pages
Advanced Multivariate Time Series Forecasting Mode
No ratings yet
Advanced Multivariate Time Series Forecasting Mode
8 pages
Specification: Vector Autoregression (VAR) Is A
No ratings yet
Specification: Vector Autoregression (VAR) Is A
1 page
Eksempel Eksamen
No ratings yet
Eksempel Eksamen
27 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
ACluster HFT
No ratings yet
ACluster HFT
15 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Priyvrat Rautela (22213033) 5BECOH CIA-3 Advanced Econometrics
No ratings yet
Priyvrat Rautela (22213033) 5BECOH CIA-3 Advanced Econometrics
8 pages
Time-series-Statistics-for-stationary-process-stock-analysis-and-prediction
No ratings yet
Time-series-Statistics-for-stationary-process-stock-analysis-and-prediction
9 pages
Arma Model
No ratings yet
Arma Model
13 pages
Time Series Data
No ratings yet
Time Series Data
19 pages
Lecture 7 VAR
No ratings yet
Lecture 7 VAR
34 pages
VAR MODEL
No ratings yet
VAR MODEL
6 pages
course content
No ratings yet
course content
28 pages
Factor Analysis
No ratings yet
Factor Analysis
42 pages
Structural Inference in Cointegrated Vector Autoregressive Models
No ratings yet
Structural Inference in Cointegrated Vector Autoregressive Models
197 pages
Autoregressive 1
No ratings yet
Autoregressive 1
13 pages
Lecture Five-Multivariate Factor Models
No ratings yet
Lecture Five-Multivariate Factor Models
20 pages
Adv ecotrix end sem
No ratings yet
Adv ecotrix end sem
5 pages
8 Dimensionality Reduction
No ratings yet
8 Dimensionality Reduction
49 pages
Analytical Methods of Optimization
From Everand
Analytical Methods of Optimization
D. F. Lawden
No ratings yet
Vector Autoregressions: How To Choose The Order of A VAR
No ratings yet
Vector Autoregressions: How To Choose The Order of A VAR
8 pages
Tutorial 4
No ratings yet
Tutorial 4
4 pages
Econometrics II Chap 4.1 Univariate Time Series Ppt (1)
No ratings yet
Econometrics II Chap 4.1 Univariate Time Series Ppt (1)
63 pages
PCA For Nonstationary Series
No ratings yet
PCA For Nonstationary Series
55 pages
Introduction To VAR Model
No ratings yet
Introduction To VAR Model
8 pages
Time Series Analysis Forecasting
No ratings yet
Time Series Analysis Forecasting
18 pages
MR Project Group 6
No ratings yet
MR Project Group 6
8 pages
Multivariate Time Series Analysis With Python For Forecasting and Modeling
No ratings yet
Multivariate Time Series Analysis With Python For Forecasting and Modeling
16 pages
Factor Analysis and Principal Components: by A. Subrahmanyam
No ratings yet
Factor Analysis and Principal Components: by A. Subrahmanyam
14 pages
Topic 3 16
No ratings yet
Topic 3 16
136 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Intro Time Series 2017
No ratings yet
Intro Time Series 2017
54 pages
FactorsRisk [UP]
No ratings yet
FactorsRisk [UP]
37 pages
Stata Var - Intro Introduction To Vector Auto Regression Models
No ratings yet
Stata Var - Intro Introduction To Vector Auto Regression Models
7 pages
TS Lecture1 2019
No ratings yet
TS Lecture1 2019
56 pages
Case Study 5: Multivariate Time Series: Dr. Kempthorne October 9, 2013
No ratings yet
Case Study 5: Multivariate Time Series: Dr. Kempthorne October 9, 2013
31 pages
Bachelor DMc-compressed
No ratings yet
Bachelor DMc-compressed
3 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
2 pages
Faheem Ses GC
No ratings yet
Faheem Ses GC
6 pages
Lab03 - Queues
No ratings yet
Lab03 - Queues
5 pages
Computer Science Assignment
No ratings yet
Computer Science Assignment
4 pages
Project MAT152+
No ratings yet
Project MAT152+
9 pages
Social Media Impact On Students Academic Performance
No ratings yet
Social Media Impact On Students Academic Performance
13 pages
Chap7 Selecting Samples
No ratings yet
Chap7 Selecting Samples
25 pages
Carlo Aquino: Philippine Name Middle Name Family Name
No ratings yet
Carlo Aquino: Philippine Name Middle Name Family Name
14 pages
Provincial Board Proposes An Ordinance Instituting The Magsidalus Iti Arubayan Program in All Levels of Governments of The Province of La Union
No ratings yet
Provincial Board Proposes An Ordinance Instituting The Magsidalus Iti Arubayan Program in All Levels of Governments of The Province of La Union
1 page
Genesys Cloud Outbound Dialing Planning Guide 002
No ratings yet
Genesys Cloud Outbound Dialing Planning Guide 002
9 pages
Case Digests - Property
No ratings yet
Case Digests - Property
16 pages
Aldol 1208-Allen
No ratings yet
Aldol 1208-Allen
7 pages
3M Stainless Steel Cleaner & Polish
No ratings yet
3M Stainless Steel Cleaner & Polish
10 pages
Arenapacking Simulation
No ratings yet
Arenapacking Simulation
1 page
Potensic_ATOM_User_Manual_EN_2024_07
No ratings yet
Potensic_ATOM_User_Manual_EN_2024_07
41 pages
Database System With Administration: Technical Assessment
No ratings yet
Database System With Administration: Technical Assessment
7 pages
Test Bank For Merrills Atlas of Radiographic Positioning and Procedures 11th Edition Frank
100% (1)
Test Bank For Merrills Atlas of Radiographic Positioning and Procedures 11th Edition Frank
6 pages
IBM Filenet Content Manager 5.2.1 Introduction
No ratings yet
IBM Filenet Content Manager 5.2.1 Introduction
69 pages
Execution and Business Plan-Victorio, P.A
No ratings yet
Execution and Business Plan-Victorio, P.A
26 pages
Assyst Bullmer Cutter Procut 5000/7501: Spare and Wearing Parts List
100% (1)
Assyst Bullmer Cutter Procut 5000/7501: Spare and Wearing Parts List
38 pages
Review of Literature Procedures: Definisi
No ratings yet
Review of Literature Procedures: Definisi
3 pages
Information Management MCQ All Week 15 MCQ
No ratings yet
Information Management MCQ All Week 15 MCQ
56 pages
Ndodontics: Rubber Dam Frames and Accesories
No ratings yet
Ndodontics: Rubber Dam Frames and Accesories
5 pages
PDF CounterExamples From Elementary Calculus to the Beginnings of Analysis 1st Edition Andrei Bourchtein download
100% (9)
PDF CounterExamples From Elementary Calculus to the Beginnings of Analysis 1st Edition Andrei Bourchtein download
75 pages
Safety Data Sheet: 1. Product and Company Identification
No ratings yet
Safety Data Sheet: 1. Product and Company Identification
8 pages
Business Finance - Aralin 8
No ratings yet
Business Finance - Aralin 8
24 pages
Investment Report 659bb9cd71f41806c41216fc (3)
No ratings yet
Investment Report 659bb9cd71f41806c41216fc (3)
1 page
Kicker: DWG STR A 08-2
No ratings yet
Kicker: DWG STR A 08-2
1 page
INS. BILL
No ratings yet
INS. BILL
1 page
Normative Values For The Voice Handicap Index-10: Yzpittsburgh, Pennsylvania
No ratings yet
Normative Values For The Voice Handicap Index-10: Yzpittsburgh, Pennsylvania
4 pages
Reviewer in Btech
No ratings yet
Reviewer in Btech
3 pages
downey-alfonso-2023-the-impact-of-patient-suicide-on-clinicians
No ratings yet
downey-alfonso-2023-the-impact-of-patient-suicide-on-clinicians
5 pages
Bleaching
100% (1)
Bleaching
19 pages
St. Francis Institute of Technology (Engg. College) : Internal Assessment Test-I
No ratings yet
St. Francis Institute of Technology (Engg. College) : Internal Assessment Test-I
3 pages
British J Health Psychol - 2014 - Bishop - Using Mixed Methods Research Designs in Health Psychology An Illustrated
No ratings yet
British J Health Psychol - 2014 - Bishop - Using Mixed Methods Research Designs in Health Psychology An Illustrated
16 pages
Special Educational Needs Inclusion and Diversity
No ratings yet
Special Educational Needs Inclusion and Diversity
35 pages

Nabeel Research Paper

Uploaded by

Nabeel Research Paper

Uploaded by

Principle Component Analysis of Non-stationary Time Series Data

Nabeel Ahmed, Dr Zahid Hussain Shaikh.

Keywords: PCA, Non-stationary, VAR,KSE-100.

 X t Is the value of the time series at time t

on the current one,

Below is an analysis of the fundamental elements and attributes of a VAR model:

A Vector Autoregressive (VAR) model may be formally defined as follows:

Principal Component Analysis [1]

a) P7: Price after week

This graph shows that we must select 3 components.

Principal Component Analysis (PCA)

After PCA, we got these loadings of the respective components.

5. Results and Discussion

You might also like