CRSP US Stock & Indexes Database Data Descriptions Guide
CRSP US Stock & Indexes Database Data Descriptions Guide
Phone: 312.263.6400
Fax: 312.263.6430
Email: [email protected]
For further information, please visit our website at www.crsp.org or email [email protected]
CONTENTS
The Bats exchange was added with data beginning efforts to check and improve data quality both
January 24, 2012. historically and in the current update. Data corrections
to historical information are made as errors are
NASDAQ MARKETS identified and are detailed in the Release Notes that
accompany each data cut.
The NASDAQ Stock MarketSM consists of three
subsets of securities, the NASDAQ Global Market,
Historical corrections to security data may result in
Global Select Market, and the NASDAQ Capital
changes to historical CRSP index returns and levels.
Market. The Securities are tiered based on criteria
In any given year, the calendar year-end stock database
setting minimum levels for: annual income, numbers
may ship as the standard “Z-Cut”, as well as a second
of publicly traded shares, market capitalization, share
“X-Cut” if there have been substantial or significant
price, and number of market makers. A security may
corrections applied at year end. Both data cuts reflect
move between tiers as its status changes.
corrections, however, the “Z-Cut” contains routine
minor monthly corrections while the “X-Cut’s”
Prior to July 2006, the Global and Global Select
year-end data edits typically required more extensive
Markets were combined in a single tier called the
research efforts and may hold a greater potential to
NASDAQ National Market. The NASDAQ Capital
impact the CRSP indexes.
Market was called the NASDAQ SmallCap Market.
The NASDAQ National Market was initiated in April
1982 for larger and generally more actively traded DATA SOURCES
NASDAQ securities. The NASDAQ National Market
Securities had to meet higher financial and non- ORIGINAL CRSP MONTHLY DATABASE
financial criteria than other NASDAQ stocks, and
December 1925-January 1928: Commercial and
were subject to last-sale reporting. In June of 1992 the
Financial Chronicle, “Bank and Quotation Section”
regular NASDAQ segment of The NASDAQ Stock
Market was renamed The NASDAQ SmallCap Market February 1928-December 1960: Bank and Quotation
and for the first time these became subject to real-time Record, an expansion of the Bank and Quotation
price and volume reporting. Section.
The CRSPAccess NASDAQ security data include The collection and initial correction of cash dividends
6 in the original CRSP Monthly Database was performed April 1987-September 1999: Interactive Data
as follows: Services, Inc. (IDSI)
1999-present: Interactive Data Corporation
1937-1960: annual issues of Moody’s, Standard and Arca
Poor’s Dividend Records, or the annual section
of the Standard Corporation Records were used, March 2006-present: Interactive Data Corporation
Chapter 1: Background
to NASD
information was either missing from The New York
February 1986: Interactive Data Services, Inc. (IDSI)
Times or the available data were questionable, The
March 2004-present: Interactive Data Corporation
Wall Street Journal Newspaper served as CRSP’s
used as secondary source
secondary data source. For the years prior to 1933,
CRSP compared distribution data from The New SIC Codes
York Times and The Wall Street Journal with that
in the original database. CRSP determined that The Mergent was the primary source for SIC Code for
New York Times and The Wall Street Journal ex- NYSE, NYSE MKT & Arca securities from 20010824
date data were more complete and accurate than the through 2009. IDC has always been a continuous
original data. Based on these findings, CRSP made the alternate source of SIC codes, so no holes in coverage
decision to rely on The New York Times as the primary were introduced by the elimination of the Mergent
data source for distribution ex-date data from 1925 data. The differences in codes resulting from our
through 1933. change in source did not impact the CRSP Indexes.
NYSE and NYSE MKT In the December 2009 Stock Database, CRSP removed
NAICS codes provided by our source, Mergent, and
July 1962-September 1, 1972: daily price and
replaced them with NAICS codes from Interactive
dividend data provided by Standard & Poor’s Price
Data Corporation. Mergent was CRSP’s only source
Tape and Punched Card Dividend Service
for NAICS beginning 20010824. The IDCI NAICS
July 1962-March 1987: High, low, and volume data
codes begin 20040610.
provided by Interactive Data Services, Inc. (IDSI), a
subsidiary of Interactive Data Corporation (IDC)
September 1972-April 1987: Interactive Data
Corporation (IDC)
Dividend Amount in Period, Beginning Excess Returns Without Dividends vs. Index Function Code for Buy Rules, 46
Basis, 33 Index Series, Cumulative, 39
Index Function Code for Generating
Dividend Amount in Period, Ordinary, Exchange Code, End of Period, 39 Statistics, 46
Adjusted, 34
Exchange Code, End of Previous Index Function Code for Sell Rules, 46
Dividend Amount in Period, Ordinary, Period, 40
Beginning Basis, 34 Index Group Name, 47
Exchange Code, Most Recent, 40
Dividend Cash Amount, 34 Index Income Return, 47
Exchange Code - Header, 40
E. Index Ineligible Issues Flag, 47
F.
End of Group Data, 35 Index Level of Returns, 48
Factor to Adjust Price in Period, 41
End of Stock Data, 35 Index Level of Returns on Income, 48
Factor to Adjust Shares
End of Valid Data, 35 Outstanding, 41 Index Level of Returns Without
Dividends, 48
Entity Begin Date Range or Event First Date Included in List, 42
Date, 36 Index Level of Total Returns, 48
G.
Entity End Date Range, 36 Index Method Type Code, 49
Group Flag of Associated Index, End
Ex-Distribution Date, 36 of Period, 42 Index Missing Data Flag, 49
Excess Returns on Income vs. Group Flag of Associated Index, End Index Name, 50
Associated Portfolios, 36 of Previous Period, 42
Index New Issues Flag, 50
Excess Returns on Income vs. Group Flag of Associated Index, Last
Associated Portfolios, Cumulative, 37 Flag, All Periods, 43 Index Primary Link, 50
Excess Returns Without Dividends vs. Index Count Total, 45 Index Subcategory Code, 53
Associated Portfolios, 38
Index Count Used, 45 Index Subset Screening Structure, 54
Excess Returns Without Dividends vs.
Index Series, 39 Index Exception Handling Flags, 45 Index Total Return, 54
NAICs, End of Previous Period, 59 Number of Index List Types, 66 Price, Last Available Nonmissing, 74
NAICs, Most Recent, 59 Number of Index Types, 66 Primary Exchange, End of Period, 74
NASDAQ Index Code, End of Number of Periods in Calendar, 67 Primary Exchange, End of Previous
Period, 60 Period, 75
Number of Portfolio Types, 67
NASDAQ Index Code, End of Primary Exchange, Most Recent, 75
Previous Period, 61 Number of Rebalancing Types, 67
R.
NASDAQ Index Code, Most O.
Recent, 61 Record Date, 75
Object Array, 68
NASDAQ Issue Number, 61 Related Assignment Information, 75
Object Type Code, 68
NASDAQ Market Makers, End of Restriction Begin Date (Partition or
Returns on Income, Cumulative, 78 SIC Code, End of Previous Period, 85 Universe Subset Type Code (Partition
or Index Restriction), 91
Returns Without Dividends, 78 SIC Code, Most Recent, 85
V.
Returns Without Dividends, SIC Code - Header, 85
Cumulative, 79 Valid Exchange Codes in Universe
Spread Between Bid and Ask, 85 (Partition or Index Restriction), 92
S.
Statistic Average in Period, 86 Valid First Digit of Share Code
Security Status, End of Period, 79 (Partition or Index Restriction), 92
Statistic Maximum Identifier, 86
Security Status, End of Previous Valid Incorporation of Securities
Period, 79 Statistic Maximum in Period, 86 in Universe (Partition or Index
Restriction), 92
Security Status, Most Recent, 80 Statistic Median in Period, 87
Valid Second Digit of Share Code
Share Class, End of Period, 80 Statistic Minimum Identifier, 87 (Partition or Index Restriction), 93
Share Class, End of Previous Statistic Minimum in Period, 87 Valid When-Issued Securities
Period, 80 in Universe (Partition or Index
T. Restriction), 94
Share Class, Most Recent, 80
Ticker, End of Period, 88 Volume, Average, 94
Share Code Groupings for Subsets
(Partition or Index Restriction), 81 Ticker, End of Previous Period, 88 Volume, Median, 94
A. ACQUIRING PERMNO
Acquiring PERMNO is the PERMNO of another security
ACQUIRING PERMCO
linked to a distribution where a stock was received in a spin-
Acquiring PERMCO is the PERMCO of another company off, exchange, merger, or other distribution event. It can also
linked to a distribution. If the Acquiring PERMNO is link to a security that was acquired in a merger causing a
nonzero and represents an associated security, Acquiring shares increase.
PERMCO is set to the PERMCO of that security. If
Acquiring PERMNO is less than 1000, then Acquiring Acquiring PERMNO is set to a number less than 1000 if
PERMCO can still be set. In this case, it represents a link to inapplicable or unknown. If multiple distributions exist with
a company tracked by CRSP rather than a specific issue. For the same Distribution Code and Ex-Distribution Date, they
example, if a company pays cash to shareholders in a merger, are numbered in the Acquiring PERMNO field. Data in this
then the Acquiring PERMCO is set to the PERMCO of that field are incomplete prior to 1985. Acquiring PERMNO may
company. point to four-digit securities. Data for these securities are
not included in the databases. Acquiring PERMNO values
Acquiring PERMCO is zero if not applicable, unknown, or between 1 and 9 do not represent securities. These values
associated with a company not tracked by CRSP. Data in this as used to distinguish multiple distribution records with the
field is incomplete prior to 1985. same Distribution Code on the same Ex-Distribution Date.
Amount After Delisting is the value of a security Unit of Item Set (no. of bytes)
after it delists from an exchange. The amount can DATE RANGE AVAILABILITY
Daily -
be either an off-exchange price, an off-exchange
Monthly -
price quote, or the sum of a series of distribution
CHAPTER 2: Data Definitions
ts_print Daily Usage n/a Primary Concepts Base CRSPAccess Data Structures, Time
Series Objects, Event Array Objects,
ts_print Monthly Usage n/a
Header Objects
stk_print Option(s) /de
Data Type integer number
C USAGE
Unit of Item Code
Object delist_arr
DATE RANGE AVAILABILITY
Array delist[ ]
Daily -
Element dlamt
Monthly -
FORTRAN-95 USAGE
DATABASE AVAILABILITY AND UTILITY USAGE
Type or Subtype delist_arr
Database Formats CRSPAccess
Member and/or Array delist()
Product Types STK, IND
Element dlamt
ts_print Daily Usage n/a
ts_print Monthly Usage n/a
stk_print or ind_print n/a
ARRAY STRUCTURE SIZE Option(s)
Array Structure Size is the number of bytes needed C USAGE
in each structure element for this array type in a Object CRSP_*
CRSPAccess object structure. Array n/a
Element arrtype
GENERAL INFORMATION FORTRAN-95 USAGE
Primary Concepts Base CRSPAccess Data Structures, Time Type or Subtype crsp_ts or crsp_array
Series Objects, Event Array Objects, Member and/or Array n/a
Header Objects
Element arrtype
13 ASK ADJUSTED, END OF PERIOD ASK, END OF PERIOD
Category: Prices Category: Prices
Data Type: Floating Point Data Type: Floating Point
Description: Description:
CHAPTER 2: Data Definitions
• Daily — Closing ask on the trading date being • Daily — Closing ask on the trading date being
accessed, adjusted for distributions. accessed.
• Monthly — Closing ask on the last trading date of • Monthly — Closing ask on the last trading date of
the month of the period being accessed, adjusted the month of the period being accessed.
for distributions.
Extended Information
DATE RANGE AVAILABILITY
Ask is available both daily and monthly for all
Daily 1925
securities on the NYSE, NYSE MKT, NASDAQ, and
Monthly 1925
Arca. Ask prices for NASDAQ are handled a little
DATABASE AVAILABILITY AND PRODUCT TYPES
differently than for NYSE/NYSE MKT and outlined as
Database Formats CRSPAccess
follows:
Product Types STK
TS_PRINT/TSQUERY USAGE NASDAQ
Daily ITEMID adjask
Ask is available for issues trading on the NASDAQ
Stock and Index Data Description Guide
for securities when no closing price is available. the day, or the closing ask if trading price not
Between December 31, 1925 and February 23, 1942, available, adjusted for distributions. Ask identified
a continuous series of ask data are available whether by a leading dash -.
or not a closing price is available. Between February • Monthly — Highest trading price during the
24, 1942 and December 27, 1992, ask is available only month, or the highest bid-ask spread if trading
in cases when a closing price is missing. Beginning price not available, adjusted for distributions. Bid-
December 28, 1992, a continuous series of ask data are ask spreads identified by preceding dash -.
available.
DATE RANGE AVAILABILITY
DATE RANGE AVAILABILITY
Daily 1925
Daily 1925
Monthly 1925
Monthly 1925
DATABASE AVAILABILITY AND PRODUCT TYPES
DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess
Database Formats CRSPAccess
Product Types STK
Product Types STK
TS_PRINT/TSQUERY USAGE
Stock and Index Data Description Guide
TS_PRINT/TSQUERY USAGE
Daily ITEMID adjaskhi
Daily ITEMID ask
Monthly ITEMID madjaskhi
Monthly ITEMID mask
Header Adjaskhi
Header Ask
SUBNO 0
SUBNO 0
SUBNO INDNO
DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess
Product Types STK
TS_PRINT/TSQUERY USAGE ASSOCIATED INDEX RETURNS ON INCOME
Daily ITEMID askhi
Category: Returns Related to an Index
Monthly ITEMID maskhi
Header Askhi Data Type: Floating Point
SUBNO 0
Description: Returns on income only of an index that
a user selects to be associated with a security or group
of securities.
ASSOCIATED INDEX RETURNS DATE RANGE AVAILABILITY
Category: Returns Related to an Index Daily Monthly
1925 1925
Data Type: Floating Point
DATABASE AVAILABILITY AND PRODUCT TYPES
Description: Total returns of an index that a user Database Formats Product Types
selects to be associated with a security or group of CRSPAccess STK
securities. TS_PRINT/TSQUERY USAGE
Daily ITEMID indiret
DATE RANGE AVAILABILITY
Monthly ITEMID mindiret
Daily 1925
Header Indiret
Monthly 1925
SUBNO INDNO
DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess
Product Types STK
TS_PRINT/TSQUERY USAGE
Daily ITEMID indtret
Monthly ITEMID mindtret
Header Indtret
SUBNO INDNO
16 ASSOCIATED INDEX RETURNS ON INCOME, ASSOCIATED INDEX RETURNS WITHOUT
CUMULATIVE DIVIDENDS, CUMULATIVE
Category: Returns Related to an Index Category: Returns Related to an Index
Data Type: Floating Point Data Type: Floating Point
CHAPTER 2: Data Definitions
Description: Compounded return, on income only, Description: Compounded price appreciation only,
of an index that a user selects to be associated with a of an index that a user selects to be associated with a
security or group of securities. Each period in the time security or group of securities. Each period in the time
series contains a cumulative return since the beginning series contains a cumulative return since the beginning
period. period.
DATE RANGE AVAILABILITY DATE RANGE AVAILABILITY
Daily 1925 Daily 1925
Monthly 1925 Monthly 1925
DATABASE AVAILABILITY AND PRODUCT TYPES DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess Database Formats CRSPAccess
Product Types STK Product Types STK
TS_PRINT/TSQUERY USAGE TS_PRINT/TSQUERY USAGE
Daily ITEMID cumindiret Daily ITEMID cumindaret
Monthly ITEMID mcumindiret Monthly ITEMID mcumindaret
Stock and Index Data Description Guide
Description: Returns on income only of a portfolio DATABASE AVAILABILITY AND UTILITY USAGE
Begin of Valid Data is the index of the first calendar Header Adjbid
SUBNO 0
period with valid data in a time series. If no data of
this type are available, it is set to zero. The Calendar
Trading Date at this index is the date of the first
CHAPTER 2: Data Definitions
calendar period with data of this type. BID ADJUSTED, LAST AVAILABLE
GENERAL INFORMATION NONMISSING
Primary Concepts Base CRSPAccess Data Structures, Time Category: Prices
Series Objects
Data Type integer number
Data Type: Floating Point
Unit of Item Array index Description: Daily — Last available non-missing closing
DATE RANGE AVAILABILITY bid as of the trading date being accessed, adjusted for
Daily 1925 distributions.
Monthly 1925
DATABASE AVAILABILITY AND UTILITY USAGE
Monthly — Last available non-missing month-end
Database Formats CRSPAccess
closing bid as of the trading date being accessed,
Product Types STK, IND
adjusted for distributions.
ts_print Daily Usage n/a DATE RANGE AVAILABILITY
ts_print Monthly Usage n/a Daily 1925
Stock and Index Data Description Guide
Bid identified by a leading dash -. CRSPAccess calendar structure needed to assign the
time to data in a time series array.
Monthly — Lowest trading price during the month, or
the lowest bid-ask spread if trading price not available. GENERAL INFORMATION
Bid-ask spreads identified by preceding dash -. Primary Concepts Base CRSPAccess Data Structures,
Calendar Objects, Time Series Objects
Extended Information: Daily: Bid or Low Price is the Data Type structure
lowest trading price during the day or the closing bid Unit of Item Array index
price on days when the closing price is not available. DATE RANGE AVAILABILITY
The field is set to zero if no Bid or Low Price is Daily 1925
available. Monthly 1925
DATABASE AVAILABILITY AND UTILITY USAGE
Daily trading prices for the NASDAQ National Market Database Formats CRSPAccess
securities were first reported November 1, 1982. Daily Product Types STK, IND
trading prices for The NASDAQ SmallCap Market ts_print Daily Usage n/a
were first reported June 15, 1992. Therefore, Bid ts_print Monthly Usage n/a
or Low Price for NASDAQ securities is always a bid stk_print or ind_print Option(s) n/a
before these dates. C USAGE
Object CRSP_TIMESERIES
Monthly: Monthly files contain the lowest daily Price Array n/a
or Bid/Ask Average during the month. The field is Element cal
set to zero when no Price or Bid/Ask Average was FORTRAN-95 USAGE
available during the month a value is found for Type or Subtype crsp_ts
incomplete months. If Price or Bid/Ask Average Member and/or Array n/a
contains any bid/ask averages, these will be marked Element cal
with a negative symbol. The absolute value of Price or
Bid/Ask Average is used to select the lowest, but the
sign is preserved if a bid/ask average is selected.
21 CALENDAR IDENTIFICATION NUMBER The calendar periods of the Calendar Identification
Calendar Identification Number is an integer code Number of Rebalancing Calendar, Calendar
assigned by CRSP to trading calendars. A Calendar Identification Number of Assignment Calendar, and
Name and Calendar Identification Number identify Calendar Identification Number of Calculations
each calendar. The calendars supported in CRSPAccess Calendar are synchronized, although the actual date
CHAPTER 2: Data Definitions
databases are: ranges for each period may differ. The assignment
calendar uses the same calendars listed in Calendar
CALENDARS CALENDAR ID CALENDAR NAME BEGINNING DATE
Identification Number.
Daily 100 Daily Trading 19251231
Calendar GENERAL INFORMATION
Monthly 101 Month-End 19251231 Primary Concepts Index Header, Calendars
Trading Calendar Data Type integer number
Annual 300 Annual Trading 19251231 Unit of Item Code
Calendar
DATE RANGE AVAILABILITY
Quarterly 310 Quarterly Trading 19251231
Daily 1925
Calendar
Monthly 1925
Weekly 500 Weekly Trading 19260102
Calendar DATABASE AVAILABILITY AND UTILITY USAGE
Database Formats CRSPAccess
Product Types IND
GENERAL INFORMATION
ts_print Daily Usage n/a
Primary Concepts Base CRSPAccess Data Structures,
Stock and Index Data Description Guide
period reported.
CAPITALIZATION, END OF PERIOD
Extended Information: Company Name is the name
Category: Capitalization
of the company at the time of its name history record.
Data Type: Floating Point CRSP allocates a 35 character name Description: field
Description: Closing price * shares outstanding for all securities. Preference is given to the spellings
(in 1000s), as of end of the period. If an index, and abbreviations provided in Standard & Poor’s
capitalization is the total market value of the issues CUSIP Directory. In cases where name sources provide
used in the index at the beginning of the period. descriptions in excess of 35 characters, CRSP furnishes
its own abbreviations.Daily: The daily file includes all
DATE RANGE AVAILABILITY historical names included in the file beginning in 1925.
Daily 1925
DATE RANGE AVAILABILITY
Monthly 1925
Daily 1925
DATABASE AVAILABILITY AND PRODUCT TYPES
Monthly 1925
Database Formats CRSPAccess
Stock and Index Data Description Guide
to CRSP.
GENERAL INFORMATION
DATE RANGE AVAILABILITY Primary Concepts Index Rebalancing History Arrays
Daily 1925 Data Type integer number
Monthly 1925 Unit of Item Count
DATABASE AVAILABILITY AND PRODUCT TYPES DATE RANGE AVAILABILITY
Database Formats CRSPAccess Daily 1925
Product Types STK Monthly 1925
TS_PRINT/TSQUERY USAGE DATABASE AVAILABILITY AND UTILITY USAGE
Daily ITEMID comnam Database Formats CRSPAccess
Monthly ITEMID mcomnam Product Types IND
Header Last Company Name ts_print Daily Usage n/a
SUBNO 2 ts_print Monthly Usage n/a
ind_print Option(s) /rs#
C USAGE
Stock and Index Data Description Guide
Element usdcnt
Data Type: Floating Point Summary: The 8 character CUSIP identifier for a
security at the end of the period reported.
Description: Cumulative factor from a base date used
to adjust prices after distributions so that equivalent Extended Information: CUSIP refers to the CUSIP
comparisons can be made between prices before and identifier valid during the date range of a name
after the distribution. structure. All non-blank CUSIPs are 8 characters long.
Monthly 1925
CRSP has preserved all CUSIPs assigned to a given
DATABASE AVAILABILITY AND PRODUCT TYPES
issue over time. CUSIP identifiers were first assigned
Database Formats CRSPAccess
in 1968. All CUSIPs in a name history before that date
Product Types STK
are unavailable. Dummy CUSIP identifiers are not
TS_PRINT/TSQUERY USAGE
included in the name history.
Daily ITEMID cumfacpr
Monthly ITEMID mcumfacpr
For more details of the CUSIP identifier; See CUSIP -
Header Cumfacpr
Header.
SUBNO 0
a security at the end of period preceding the period assigned CUSIP with a dummy issuer number (the
reported. first 6 character positions) and a dummy issue number
(the last 2 character positions). The other, ******9*,
DATE RANGE AVAILABILITY
(containing a 9 in the seventh character position)
Daily 19680102
Monthly 19680102
represents a CRSP-assigned CUSIP with a real issuer
DATABASE AVAILABILITY AND PRODUCT TYPES
number but a dummy issue number. For example:
Database Formats CRSPAccess
A CUSIP - Header such as 12399099 or 12345699 is
Product Types STK
assigned by CRSP, and an identifier such as 12345610
TS_PRINT/TSQUERY USAGE
is assigned by the CUSIP Agency.
Daily ITEMID ncusip
Monthly ITEMID mncusip
Securities actively traded on an international basis,
Header NCUSIPE
domiciled outside the United States and Canada,
SUBNO 1
will be identified by a CINS (CUSIP International
Numbering System) number. CINS numbers
Stock and Index Data Description Guide
Database Formats CRSPAccess Primary Concepts Base CRSPAccess Data Structures, Time
Product Types STK Series Objects, Event Array Objects,
TS_PRINT/TSQUERY USAGE Header Objects
SUBNO 2 Daily
Monthly
the same structure, such as the difference between a DATABASE AVAILABILITY AND PRODUCT TYPES
return and price data item. It is set to zero if unused. Database Formats CRSPAccess
29
Product Types STK research.
TS_PRINT/TSQUERY USAGE
GENERAL INFORMATION
Daily ITEMID caldt
Primary Concepts Delisting History Array
Monthly ITEMID mcaldt
Data Type integer number
Header Caldt
Unit of Item Code
CHAPTER 2: Data Definitions
SUBNO 0
DATE RANGE AVAILABILITY
Daily 1925
Monthly 1925
DATE - YYYYMMDD TRADING DATE (PARTIAL DATABASE AVAILABILITY AND UTILITY USAGE
PERIOD DATA) Database Formats CRSPAccess
Category: Other Product Types STK
ts_print Daily Usage n/a
Data Type: Double Precision Floating Point
ts_print Monthly Usage n/a
Description: Trading dates used with partial period stk_print Option(s) /de
data. C USAGE
Object delist_arr
DATE RANGE AVAILABILITY
Array delist[ ]
Daily 1925
Element dlstcd
Monthly 1925
FORTRAN-95 USAGE
DATABASE AVAILABILITY AND PRODUCT TYPES
Stock and Index Data Description Guide
Delisting Code is a 3-digit integer code. It either (1) Primary Concepts Header Identification and Summary
Data
indicates that a security is still trading or (2) provides
Data Type integer number
a specific reason for delisting. All coded delistings are
Unit of Item Code
categorized by the first digit of the delisting code.
DATE RANGE AVAILABILITY
PRIMARY FIRST Daily 1925
DIGIT OF CODE CATEGORY Monthly 1925
1 still trading or halted but not yet delisted DATABASE AVAILABILITY AND UTILITY USAGE
2 merger Database Formats CRSPAccess
3 exchange Product Types STK
4 liquidation ts_print Daily Usage n/a
5 delisted by NYSE, NYSE MKT, NASDAQ or Arca ts_print Monthly Usage n/a
7 delisted by the Securities and Exchange Commission stk_print Option(s) /hh, /hn
8 trading simultaneously on more than one exchange
C USAGE
Object header_row
The second and third digits of the delisting codes
Array header
provide further detail of delisting events. Additional Element dlstcd
delisting codes, specific to various delisting categories, FORTRAN-95 USAGE
have been created to indicate when an issue is closed Type or Subtype stkhdr
to further research, or if the issue is pending further Member and/or Array n/a
30
Element dlstcd date is set to the date of the announcement. If the
DELISTING DATE security became worthless after delisting and there is
no evidence of any trading after delisting, then the
Delisting Date is an integer containing the date in
Delisting Date of Next Available Information is set to
YYYYMMDD format of a security’s last price on the
one trading day after the Delist Date, and the Delisting
current exchange. If the security is still active, Delisting
CHAPTER 2: Data Definitions
Type or Subtype delist_arr delisted from NYSE, NYSE MKT, NASDAQ, or Arca.
Member and/or Array delist() The Delisting Return is calculated by comparing the
Element dlpdt
security’s Amount After Delisting with its price on the
last day of trading. The Amount After Delisting can
be either an off-exchange price, an off-exchange price
DELISTING PRICE quote, or the sum of a series of distribution payments.
The effective date of the delisting return is specified in
Delisting Price refers to a trade price or a price quote
the Delisting Payment Date.
(given as the average of bid and ask quotes) on another
exchange or over-the-counter. The date of this price
The return for any issue that has been closed to further
or quote is specified in the Delisting Date of Next
research is calculated as follows:
Available Information.
• If a price within 10 periods of the delist date is
If the Delisting Price is positive, then it is a trade price.
available, then the delisting return is calculated
If the Delisting Price is negative, then it is the average
using that price.
of bid and ask quotes. A Delisting Price is set to zero
if the security is still active, if there was no further • If a final distribution is available, then the delisting
trading for the security after the delist date, or if prices return is calculated using all known distribution
or price quotes are not available after the delist date. If information occurring after the date of last price.
delisting payments were made using distributions, the
Delisting Price is also set to zero, and the sum of the • If distributions occurring after the date of last
distribution payments is specified in the Amount After price are available, but no final distribution has
Delisting. been found, then the delisting return is calculated
as if a final distribution were found. (This applies
GENERAL INFORMATION only to issues closed to further research.)
Primary Concepts Delisting History Array
Data Type real number • If there is evidence that no distributions will
Unit of Item USD ever be paid to shareholders, then the stock is
DATE RANGE AVAILABILITY considered worthless. The delisting return is set to
32 -1 (i.e. a 100% loss). Element dlret
FORTRAN-95 USAGE
Type or Subtype delist_arr
• If there is evidence that the stock has been Member and/or Array delist()
declared worthless, then the delisting return is set Element dlret
CHAPTER 2: Data Definitions
-99.0 security trades on a new exchange after delisting, but CRSP currently
Primary Concepts Delisting History Array
has no sources to gather price information
Data Type real number
Unit of Item Ratio
Monthly: If Amount After Delisting is non-zero and
DATE RANGE AVAILABILITY
Delisting Payment Date is less than or equal to the
Daily 1925
Delisting Date, the Delisting Return represents a
Monthly 1925
partial-month return, not a Delisting Return. The
DATABASE AVAILABILITY AND UTILITY USAGE
partial-month returns compare the value on the last
Database Formats CRSPAccess
day of trading with the value on the last month-end Product Types STK
date and do not factor in additional after-delisting ts_print Daily Usage retx/0 when the DLRET option is
information. included in the FORMAT option
ts_print Monthly Usage mretx/0 when the DLRET option is
GENERAL INFORMATION included in the FORMAT option
Primary Concepts Delisting History Array stk_print Option(s) /de
Data Type real number C USAGE
Unit of Item Ratio Object delist_arr
DATE RANGE AVAILABILITY Array delist[ ]
Daily 1925 Element dlretx
Monthly 1925 FORTRAN-95 USAGE
DATABASE AVAILABILITY AND UTILITY USAGE Type or Subtype delist_arr
Database Formats CRSPAccess Member and/or Array delist()
Product Types STK Element dlretx
ts_print Daily Usage ret/0 when the DLRET option is included
in the FORMAT option
ts_print Monthly Usage mret/0 when the DLRET option is DISTRIBUTION CODE
included in the FORMAT option
CRSP describes company distributions and corporate
stk_print Option(s) /de
actions in the distribution history with a 4-digit code.
C USAGE
The first digit describes the type of distribution. The
Object delist_arr
second digit describes the payment method. The third
Array delist[ ]
33 digit augments the type denoted by the first digit. The Primary Concepts Distribution Event Array
fourth digit provides information regarding the tax Data Type integer number
status of the distribution for details. Unit of Item YYYYMMDD date
DATE RANGE AVAILABILITY
CRSP has not verified the tax status of ordinary cash Daily 1925
CHAPTER 2: Data Definitions
dividends since 1987. CRSP assigns the most common Monthly 1925
tax code, taxable as dividend, to ordinary dividends DATABASE AVAILABILITY AND UTILITY USAGE
to these issues. CRSP does verify the tax status of Database Formats CRSPAccess
stock distributions and distributions associated with Product Types STK
rights offerings, spin-offs, liquidations, mergers, ts_print Daily Usage n/a
GENERAL INFORMATION
Description: Ordinary and return-of-capital dividends
during the period, adjusted to beginning of period
34 basis. TS_PRINT/TSQUERY USAGE
Daily ITEMID odivamt
DATE RANGE AVAILABILITY Monthly ITEMID modivamt
Daily 1925 Header Odivamt
Monthly 1925 SUBNO 0
CHAPTER 2: Data Definitions
Description: Ordinary cash dividends paid, adjusted be adjusted using the Factor to Adjust Price. These are
using the price adjustment factor. identified by a Distribution Code with the first digit 6
DATE RANGE AVAILABILITY and a Factor to Adjust Price between -1 and 0. Note:
Daily 1925 regular income dividends for ADRs use the gross.
Monthly 1925
DATABASE AVAILABILITY AND PRODUCT TYPES GENERAL INFORMATION
C USAGE
Data Type integer number
Object group_arr
Unit of Item Array index
Array group
DATE RANGE AVAILABILITY
Element grpenddt
Daily 1925
FORTRAN-95 USAGE
Monthly 1925
Type or Subtype group_arr()
DATABASE AVAILABILITY AND UTILITY USAGE
Member and/or Array group(,)
Database Formats CRSPAccess
Element grpenddt
Product Types STK, IND
ts_print Daily Usage n/a
ts_print Monthly Usage n/a
END OF STOCK DATA
stk_print or ind_print Option(s) /hr
End of Stock Data is the date that data ends for the
C USAGE
security, in YYYYMMDD format. It is the date of
Object CRSP_TIMESERIES
the last period in the time series arrays and is always Array n/a
greater than zero. Element end
Data Type: Double Precision Floating Point Ex-Distribution Dates of liquidating payments after
Description: First period in a selected date range, or delistings are reported when available, and set to
CHAPTER 2: Data Definitions
event date for an entity. Record Date or Delisting Payment Date if unavailable.
security is first traded without the right to receive Product Types STK
YYYYMMDD format and is always a daily trading date. Daily ITEMID portxsiret
Monthly ITEMID mportxsiret
For distributions in a merger or exchange where the Header Portxsiret
Header Cumxsptret
SUBNO PORTID Description: Difference between a security’s capital
appreciation and the capital appreciation of a portfolio
EXCESS RETURNS VS. INDEX SERIES a user selects to be associated with the security.
Category: Returns Related to an Index DATE RANGE AVAILABILITY
Daily 1925
Data Type: Floating Point
Monthly 1925
Description: Difference between a security’s total DATABASE AVAILABILITY AND PRODUCT TYPES
return and the total return of an index that a user Database Formats CRSPAccess
selects to be associated with the security. Product Types STK
TS_PRINT/TSQUERY USAGE
DATE RANGE AVAILABILITY Daily ITEMID exchcd
Daily 1925 Monthly ITEMID mexchcd
Monthly 1925 Header EXL
DATABASE AVAILABILITY AND PRODUCT TYPES SUBNO 2
Database Formats CRSPAccess
Product Types STK
TS_PRINT/TSQUERY USAGE EXCHANGE CODE - HEADER
Daily ITEMID exchcd
Exchange Code - Header displays the Exchange Code
Monthly ITEMID mexchcd
on which a security was last listed. Valid Exchange
Header EX
Code - Header values are 1, 2, 3, 4 or 5, which
SUBNO 0
correspond to the NYSE, NYSE MKT, NASDAQ, Arca
and Bats respectively. Other Exchange Codes are not
included in the Exchange Code - Header field.
Stock and Index Data Description Guide
Daily 1925
Data Type: Double Precision Floating Point
Monthly 1925
DATABASE AVAILABILITY AND UTILITY USAGE Description: Code identifying a group to which a
Database Formats CRSPAccess security belongs. Currently, S&P 500 group flag 16 is
Product Types STK the only active group. If the security belongs to the
ts_print Daily Usage n/a selected group type, the Group Flag of Associated
ts_print Monthly Usage n/a Index will contains a one (1). If it does not belong to
stk_print or ind_print Option(s) /di the group, or is not valid according to the group rules,
C USAGE the field contains a zero (0).
Object dists_arr
DATE RANGE AVAILABILITY
Array dists[ ]
Daily 1925
Element facshr
Monthly 1925
FORTRAN-95 USAGE
DATABASE AVAILABILITY AND PRODUCT TYPES
Type or Subtype dists_arr
Database Formats CRSPAccess
Member and/or Array dists( )
Stock and Index Data Description Guide
Description: Last known code identifying a group to based on a purchase on the most recent time previous
which a security belongs. Currently, S&P 500 group to this day when the security had a valid price. Usually,
flag 16 is the only active group. this time is the previous calendar period. See “Returns”
DATE RANGE AVAILABILITY in the Calculations section.
Daily 1925 Daily: In daily databases, dividends are reinvested on
Monthly 1925
the Ex-Distribution Date.
DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess
Monthly: In monthly databases, returns are holding
Product Types STK period returns from month-end to month-end, not
TS_PRINT/TSQUERY USAGE compounded from daily returns, and ordinary
Daily ITEMID lgrpflg dividends are reinvested at month-end.
Monthly ITEMID mlgrpflg A series of special missing return codes specify the
Header LSPInd
reason a return is missing.
SUBNO 16
Stock and Index Data Description Guide
H. -66.0 Valid current price but no valid previous price. Either first price, unknown
exchange between current and previous price, or more than 10 periods
between time t and the time of the preceding price t’.
HIGHEST CLOSE
-77.0 Not trading on the current exchange at time t.
Category: Prices -88.0 No data available to calculate returns.
Data Type: Floating Point -99.0 Missing return due to missing price at time t; usually due to suspension
in trading or trading on unknown exchange.
Description: Daily — Highest daily closing price within
the selected output calendar. GENERAL INFORMATION
Primary Concepts Price, Volume, and Return Time Series
Monthly — Highest month end closing price within the
Arrays
selected calendar. Appropriate to use with quarterly Data Type real number
and annual output calendars. Unit of Item Ratio
DATABASE AVAILABILITY AND PRODUCT TYPES DATABASE AVAILABILITY AND UTILITY USAGE
CRSP includes a public index such as the S&P 500 Data Type: Double Precision Floating Point
Composite or the NASDAQ Composite, Index Capital
Appreciation Return is derived from data provided by Description: Number of issues used to create a specific
the creator of the index. index or portfolio during one calendar period. A
security must be a member of the index or portfolio
GENERAL INFORMATION with valid prices for both the current and the previous
Primary Concepts Index Time Series trading periods to be included in the count. See the
Data Type real number Index Methodologies section for information including
Unit of Item Ratio rebalancing frequency and universe inclusion for
DATE RANGE AVAILABILITY specific indexes.
Daily 1925
Monthly 1925
DATE RANGE AVAILABILITY
DATABASE AVAILABILITY AND UTILITY USAGE
Daily 1925
Database Formats CRSPAccess
Monthly 1925
Stock and Index Data Description Guide
Object indhdr_row
3 Standard deviation over previous year
Array indhdr
Element flags
GENERAL INFORMATION
FORTRAN-95 USAGE
Primary Concepts Index Header
Type or Subtype indhdr
Data Type integer number
Member and/or Array flags
Unit of Item Code
Element n/a
DATE RANGE AVAILABILITY
Daily 1925
Monthly 1925
INDEX FUNCTION CODE FOR BUY RULES DATABASE AVAILABILITY AND UTILITY USAGE
Index Function Code for Buy Rules is a code defining Database Formats CRSPAccess
Product Types IND
a function used to determine whether an issue is added
ts_print Daily Usage n/a
to a portfolio during rebalancing. This variable is not
ts_print Monthly Usage n/a
yet available, and is always set to 0.
Stock and Index Data Description Guide
distributions, relative to its value at one fixed point in Description: Value of an index, excluding ordinary
time. dividends, relative to its value at one fixed point in
time.
DATE RANGE AVAILABILITY
Daily 1925 DATE RANGE AVAILABILITY
Monthly 1925 Daily 1925
DATABASE AVAILABILITY AND PRODUCT TYPES Monthly 1925
Database Formats CRSPAccess DATABASE AVAILABILITY AND PRODUCT TYPES
Product Types STK Database Formats CRSPAccess
TS_PRINT/TSQUERY USAGE Product Types STK
Daily ITEMID tlvl TS_PRINT/TSQUERY USAGE
Monthly ITEMID mtlvl Daily ITEMID alvl
Header TLvl Monthly ITEMID malvl
SUBNO 0 Header ALvl
SUBNO 0
Stock and Index Data Description Guide
Reweighting Type Flag, and Index Reweighting Timing Primary Concepts Index Header
Flag characteristics. Current codes are: Data Type structure
Unit of Item Set (methodology)
CODE DESCRIPTION
DATE RANGE AVAILABILITY
1 CRSP Cap-Based Portfolios
Daily 1925
3 CRSP Risk-Based Decile Indexes
Monthly 1925
4 CRSP Value-Weighted Market Indexes
DATABASE AVAILABILITY AND UTILITY USAGE
5 CRSP Equal-Weighted Market Indexes
Stock and Index Data Description Guide
CODE DESCRIPTION
DATABASE AVAILABILITY AND UTILITY USAGE
Database Formats 0 Unknown or not available
CRSPAccess
Product Types 1 New securities are included the first period they meet existing portfolio
IND
restrictions
ts_print Daily Usage n/a
2 Securities are never added until next rebalancing period
ts_print Monthly Usage n/a
stk_print or ind_print Option(s) /hr
C USAGE GENERAL INFORMATION
GENERAL INFORMATION
Primary Concepts Index Header INDEX REBALANCING END DATE
Data Type integer number
Index Rebalancing End Date is the integer date,
Unit of Item Code
in YYYYMMDD format, of the last date in the
DATE RANGE AVAILABILITY
rebalancing period of an index.
Daily 1925
Monthly 1925 GENERAL INFORMATION
DATABASE AVAILABILITY AND UTILITY USAGE Primary Concepts Index Rebalancing History Arrays
Element method.wgttype
FORTRAN-95 USAGE CODE DESCRIPTION
GENERAL INFORMATION
Monthly 1925
DATABASE AVAILABILITY AND UTILITY USAGE
INDEX SUBCATEGORY CODE
Database Formats CRSPAccess
Product Types IND Index Subcategory Code is an integer flag indicating
ts_print Daily Usage n/a a subcategory of the primary index in an index list
ts_print Monthly Usage n/a history to which the security belongs. It is set to zero if
ind_print Option(s) /hr no subcategory is applicable.
C USAGE
GENERAL INFORMATION
Object indhdr_row
Primary Concepts Index List History Array
Array indhdr
Data Type integer number
Element method.subtype
Unit of Item Code
FORTRAN-95 USAGE
DATE RANGE AVAILABILITY
Type or Subtype indhdr
Daily 1925
Member and/or Array method
Monthly 1925
Element subtype
DATABASE AVAILABILITY AND UTILITY USAGE
Database Formats CRSPAccess
Product Types IND
54
ts_print Daily Usage n/a INDEX TOTAL RETURN
ts_print Monthly Usage n/a Index Total Return is the return, including all
ind_print Option(s) n/a
distributions, of an index. See “Index Returns” in the
C USAGE
Calculations section for details on how CRSP index
Object list_arr[ ]
returns are calculated. Index Total Return is only
CHAPTER 2: Data Definitions
Array list[ ][ ]
available for CRSP-generated indexes.
Element subind
FORTRAN-95 USAGE GENERAL INFORMATION
Index Subset Screening Structure, like the Partition DATABASE AVAILABILITY AND UTILITY USAGE
used to restrict a database using various screening Product Types STK*, IND
ts_print Daily Usage n/a
variables. The screen fields are: Universal Subset Type
ts_print Monthly Usage n/a
Code, First Trading Date Allowed in Restriction,
ind_print Option(s) /tr
Index Restriction End Date, Valid Exchange Codes
Stock and Index Data Description Guide
C USAGE
in Universe, Valid NASDAQ Market Groups in
Object tret_ts[ ]
Universe, Valid When-Issued Securities in Universe,
Array tret[ ][ ]
Valid Incorporation of Securities in Universe, and
Element n/a
Share Code Screen Structure. Index Subset Screening
FORTRAN-95 USAGE
Structure screens are used to restrict the securities used
Type or Subtype tret_ts
in the actual index.
Member and/or Array tret(,)
GENERAL INFORMATION Element n/a
Primary Concepts Index Header
Data Type structure
Unit of Item Set (screen markets)
INDEX TOTAL VALUE
DATE RANGE AVAILABILITY Index Total Value is the total market value of the
Daily 1925 non-ADR securities in the index universe, in $1000s,
Monthly 1925 with valid prices and shares outstanding amounts on
DATABASE AVAILABILITY AND UTILITY USAGE the selected Calendar Trading Date. See the Index
Database Formats CRSPAccess Methodologies section for information including
Product Types IND rebalancing frequency and universe inclusion for
ts_print Daily Usage n/a specific indexes.
ts_print Monthly Usage n/a
ind_print Option(s) /hr GENERAL INFORMATION
C USAGE Primary Concepts Index Time Series
Object indhdr_row Data Type real number
Array indhdr Unit of Item USD
Element induniv
Date Range Availability Daily 1925
FORTRAN-95 USAGE
Monthly 1925
Type or Subtype indhdr
DATABASE AVAILABILITY AND UTILITY USAGE
Member and/or Array induniv
Database Formats CRSPAccess
Element n/a
Product Types STK, IND
ts_print Daily Usage cap/0
55
ts_print Monthly Usage mcap/0 INDNO
ind_print Option(s) /tv INDNO indicates the unique permanent identifier
C USAGE assigned by CRSP to every supported index. All
Object totval_ts[ ] INDNO identifiers are 7-digit integers. There is no
Array totval[ ][ ] inherent meaning in the numbers. The indexes sets
CHAPTER 2: Data Definitions
weighted index the Index Used Value is the weight of Database Formats CRSPAccess
the index. Product Types IND
ts_print Daily Usage permno
Stock and Index Data Description Guide
For standard CRSP market indexes the beginning total ts_print Monthly Usage mpermno
market value is calculated using prices and shares from ind_print Option(s) /hh, /hr
the previous trading day. In these indexes a security
C USAGE
cannot be an ADR and must have prices and shares
Object indhdr_row
on the current and previous trading dates. See “Index
Array indhdr
Returns” in the Calculations Section, and see the
Element indno
Index Methodologies Section.
FORTRAN-95 USAGE
GENERAL INFORMATION
and annual output calendars.
Primary Concepts Base CRSPAccess Data Structures, Time
DATE RANGE AVAILABILITY Series Objects, Event Array Objects,
Daily 1925 Calendar Objects
Description: Compounded total returns of a portfolio Description: Compounded price appreciation only,
that a user selects to be associated with a security or of a portfolio that a user selects to be associated with a
group of securities. Each period in the time series security or group of securities. Each period in the time
contains a cumulative return since the beginning series contains a cumulative return since the beginning
period. period.
DATE RANGE AVAILABILITY DATE RANGE AVAILABILITY
Daily 1925 Daily 1925
Monthly 1925 Monthly 1925
DATABASE AVAILABILITY AND PRODUCT TYPES DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess Database Formats CRSPAccess
Product Types STK Product Types STK
TS_PRINT/TSQUERY USAGE TS_PRINT/TSQUERY USAGE
Daily ITEMID cumptret Daily ITEMID cumparet
Monthly ITEMID mcumptret Monthly ITEMID mcumparet
Stock and Index Data Description Guide
N.
MEMBER PORTFOLIO RETURNS ON INCOME,
CUMULATIVE NORTH AMERICAN INDUSTRY
Category: Returns Related to a Portfolio Type CLASSIFICATION SYSTEM (NAICS) CODE, END
OF PERIOD
Data Type: Floating Point
Category: Name History
Description: Compounded return, on income only,
Data Type: String
of a portfolio that a user selects to be associated with a
Description: North American Industry Classification
security or group of securities. Each period in the time System, 6-character industry code, at the end of period
series contains a cumulative return since the beginning reported.
period. Extended Information: NAICS codes were adopted
DATE RANGE AVAILABILITY in 1997 and implemented in 1999, by the Office of
Daily 1925 Management and Budget (OMB), to replace the U.S.
Monthly 1925 Standard Industrial Classification (SIC) system. The
DATABASE AVAILABILITY AND PRODUCT TYPES 6-character code is used to group establishments with
Database Formats CRSPAccess similar products or services. While there are exceptions
Product Types STK to the definition, an establishment “is generally a
TS_PRINT/TSQUERY USAGE single, physical location at which economic activity
Daily ITEMID cumpiret occurs (e.g., store, factory, farm, etc.)1”. NAICS was
Monthly ITEMID mcumpiret designed to encompass all fields of economic activities,
Header Cumpiret producing and non-producing. Each establishment
SUBNO PORTID is assigned to one industry that matches its primary
activity. The codes were developed by the US, Canada,
and Mexico to provide a business activity standard
throughout North America, to facilitate economic
analyses of North America’s economies.
59 NAICS is a hierarchical code, containing up to six Monthly 20010824
digits: The first two fields, NAICS sectors, designate DATABASE AVAILABILITY AND PRODUCT TYPES
general categories of economic activity, the third field, Database Formats CRSPAccess
sub-sector, further defines the sector, the fourth field Product Types STK
is the industry group, the fifth field is the NAICS TS_PRINT/TSQUERY USAGE
CHAPTER 2: Data Definitions
industry, and the sixth field represents the national Daily ITEMID snaics
industry (a zero in the 6th digit generally indicates that Monthly ITEMID msnaics
the NAICS industry and the country industry are the Header Naicse
same). For example, 1123 represents Poultry and Egg SUBNO 1
Production, 11231 represents Chick Egg Production,
and 112310 represents Chicken Egg Production.
NAICS, MOST RECENT
NAICS codes are available for securities in the CRSP
Category: Name History
database from August 24, 2001 onwards. Unknown
NAICS codes are blank. For additional information on Data Type: String
NAICS codes, please refer to the Executive Office of
Description: The most recently known North
the President Office of Management and Budget’s most
American Industry Classification System, 6-character
current North American Industry Classification System
industry code.
manual, or visit the US Census Bureau’s website at
http://www.census.gov/epcd/www/naics.html. DATE RANGE AVAILABILITY
Stock and Index Data Description Guide
Daily 20010824
In the December 2009 stock database, CRSP removed
Monthly 20010824
NAICS Codes provided by our source, Mergent,
DATABASE AVAILABILITY AND PRODUCT TYPES
from our Stock Databases and replaced them with
Database Formats CRSPAccess
NAICS Codes from Interactive Data Corporation.
Product Types STK
Mergent was CRSP’s only source for NAICS beginning
TS_PRINT/TSQUERY USAGE
20010824. The IDCI NAICS Codes begin 20040610.
Daily ITEMID snaics
DATE RANGE AVAILABILITY Monthly ITEMID msnaics
Daily 20010824 Header Naicsl
Monthly 20010824 SUBNO 2
DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess
Product Types STK
NAICS - HEADER
TS_PRINT/TSQUERY USAGE
North American Industry Classification System -
Daily ITEMID snaics
Header is an 6-character code used to group companies
Monthly ITEMID msnaics
Header Naics
with similar products or services. It contains the
SUBNO 0
most current NAICS code in the database. See North
American Industry Classification System (NAICS)
for additional detail on NAICS codes. N.B. this field
NAICS, END OF PREVIOUS PERIOD includes data starting on 20040610.
Description: Integer code indicating the issue’s trading on NASDAQ. The NASDAQ Issue Number in
classification within NASD’s internal business the CRSP Data File may change if NASDAQ assigns
Description: categories, at the end of the period a new number to an issue CRSP considers to be a
preceding the period reported. continuation of an existing issue.
DATE RANGE AVAILABILITY GENERAL INFORMATION
Daily 198211 Primary Concepts Header Identification and Summary Data
Monthly 198211 Data Type integer number
DATABASE AVAILABILITY AND PRODUCT TYPES Unit of Item Id
Database Formats CRSPAccess DATE RANGE AVAILABILITY
Product Types STK Daily 198211
TS_PRINT/TSQUERY USAGE Monthly 198211
Daily ITEMID nsdinx DATABASE AVAILABILITY AND UTILITY USAGE
Monthly ITEMID mnsdinx Database Formats CRSPAccess
Header Nsdinxe Product Types STK
Stock and Index Data Description Guide
(NASD) to each listed security on The NASDAQ DATABASE AVAILABILITY AND PRODUCT TYPES
Description: One-digit integer code indicating an since November 1, 1982, and all NASDAQ securities
issue’s membership within the NASDAQ Market tier since June 15, 1992. Due to lack of sources, NASDAQ
system, at the end of the previous period. Number of Trades data are missing for 15 NASDAQ
DATE RANGE AVAILABILITY National Market securities in December, 1982, and all
Daily 19920615 The NASDAQ National Market securities in February,
Monthly 199206 1986.
DATABASE AVAILABILITY AND PRODUCT TYPES
Monthly: Not available.
Database Formats CRSPAccess
Product Types STK
TS_PRINT/TSQUERY USAGE GENERAL INFORMATION
Daily ITEMID nmsind Primary Concepts Supplemental NASDAQ Time Series
Monthly ITEMID mnmsind Data Type integer number
Header Nmsinde Unit of Item Count
SUBNO 1 DATE RANGE AVAILABILITY
Stock and Index Data Description Guide
Daily 19821101
Monthly n/a
DATABASE AVAILABILITY AND UTILITY USAGE
NASDAQ NATIONAL MARKET INDICATOR,
Database Formats CRSPAccess
MOST RECENT Product Types STK
Category: Nasdaq ts_print Daily Usage numtrd/0
ts_print Monthly Usage n/a
Data Type: Double Precision Floating Point
stk_print Option(s) /pn
Description: One-digit integer code indicating an C USAGE
issue’s membership within the NASDAQ Market tier Object numtrd_ps
system, most recently known value. Array numtrd[ ]
Element numtrd
DATE RANGE AVAILABILITY
FORTRAN-95 USAGE
Daily 19920615
Type or Subtype numtrd_ts
Monthly 199206
Member and/or Array numtrd( )
DATABASE AVAILABILITY AND PRODUCT TYPES
Element numtrd
Database Formats CRSPAccess
Product Types STK
TS_PRINT/TSQUERY USAGE
NASDAQ STATUS CODE, END OF PERIOD
Daily ITEMID nmsind
Monthly ITEMID mnmsind
Category: Nasdaq
Header Nmsindl Data Type: Double Precision Floating Point
SUBNO 2
Description: One-digit integer describing the trading
status of an issue listed on NASDAQ, at the end of
each period reported.
NASDAQ NUMBER OF TRADES
Daily: NASDAQ Number of Trades contains the
CODE DESCRIPTION
number of trades made on the NASDAQ Stock Market
0 Unknown or not applicable
each date for a security. Trades on all exchanges are
1 Active
connected to NASDAQ’s composite pricing network
64
CODE DESCRIPTION NASDAQ STATUS CODE, MOST RECENT
2 Trading with only one market maker Category: Nasdaq
3 Suspended
4 Inactive
Data Type: Double Precision Floating Point
5 Delisted Description: One-digit integer describing the most
CHAPTER 2: Data Definitions
stk_print Option(s) /q
C USAGE NEW PERMNO
Object nasdin_arr New PERMNO is an integer pointer to a new security
Array nasdin[ ] assigned when an issue ceases trading as a result of a
Element trtsenddt merger or exchange where shareholders receive stock
FORTRAN-95 USAGE in the acquiring company. The New PERMNO is
Type or Subtype nasdin_arr
the PERMNO of the primary security received from
Member and/or Array nasdin()
the acquiring company. It acts as a forward pointer,
Element trtsenddt
allowing the user to trace the ongoing history of
surviving companies. New PERMNO may identify
NEW PERMCO an issue that exists on a different CRSP Stock File.
It is set to zero if there is no new primary security
New PERMCO is an integer link to a company
applicable, the issue is unknown, or the delisting does
assigned when an issue ceases trading as a result of
not represent a merger or exchange. The distribution
a merger or exchange when shareholders receive
history arrays contain an itemized record of all types
some payment from the acquiring company. If
of payments to shareholders in an exchange or merger.
New PERMNO is nonzero, New PERMCO is the
See Acquiring PERMNO for companies associated
PERMCO of that security. If New PERMNO is zero,
with individual payments.
New PERMCO can still be nonzero if the shareholders
receive a payment from an acquiring company known GENERAL INFORMATION
to CRSP, but the payments are not primarily in the Primary Concepts Delisting History Array
stock of the company. New PERMCO is zero if the Data Type integer number
Array delist[ ]
the list is set to zero.
Element nwperm
FORTRAN-95 USAGE GENERAL INFORMATION
Number of Array Elements is the count of actual event DATABASE AVAILABILITY AND UTILITY USAGE
structures available in a CRSP event object for the Database Formats CRSPAccess
Object n/a
series, Begin of Valid Data and End of Valid Data are
Array n/a
both set to zero.
Element indtypes
FORTRAN-95 USAGE GENERAL INFORMATION
C USAGE CRSP_TIMESERIES 2
Object CRSP_ARRAY 3
n/a
Array CRSP_ROW 5
n/a
Element rebaltypes GENERAL INFORMATION
time series, event, or header data in CRSPAccess object Database Formats CRSPAccess
FORTRAN-95 USAGE
Statistic Grouping Code. Type or Subtype indhdr
Member and/or Array n/a
GENERAL INFORMATION
Element portnum
Primary Concepts Index Header
Data Type structure
Unit of Item Set (portfolio building rules)
PORTFOLIO NUMBER IN ASSOCIATED INDEX
DATE RANGE AVAILABILITY
Daily 1925
Portfolio Number in Associated Index is the portfolio
Monthly 1925 number within an associated index group defined in
DATABASE AVAILABILITY AND UTILITY USAGE INDNO of Associated Index. The associated index
Database Formats CRSPAccess breakpoint information for that portfolio is used for
Product Types IND this index. It is set to zero if no outside rebalancing
ts_print Daily Usage n/a information is used to build this index.
ts_print Monthly Usage n/a
GENERAL INFORMATION
ind_print Option(s) /hr
Primary Concepts Index Header
C USAGE
Data Type integer number
Object indhdr_row
Unit of Item Id
Array indhdr
DATE RANGE AVAILABILITY
Element rules
Daily 1925
FORTRAN-95 USAGE
Monthly 1925
Type or Subtype indhdr
DATABASE AVAILABILITY AND UTILITY USAGE
Member and/or Array rules
Database Formats CRSPAccess
Element n/a
Product Types IND
ts_print Daily Usage n/a
ts_print Monthly Usage n/a
PORTFOLIO NUMBER IF SUBSET SERIES
ind_print Option(s) /hr
Portfolio Number if Subset Series is the portfolio C USAGE
number within an index group to which this index Object indhdr_row
series belongs. The Index Primary Link variable Array indhdr
contains the Permanent Index Group Identification
72
Element assign.asport unavailable, the number in the price field is replaced
FORTRAN-95 USAGE with a bid/ask average (marked by a leading dash).
Type or Subtype indhdr
Member and/or Array assign DATE RANGE AVAILABILITY
Monthly 1925
DATABASE AVAILABILITY AND PRODUCT TYPES
PORTFOLIO STATISTIC Database Formats CRSPAccess
Monthly: Price Alternate is an alternate monthly price Unit of Item YYYYMMDD date
derived from daily prices. Price Alternate contains the DATE RANGE AVAILABILITY
Daily n/a
last non-missing price in the month. The date of this
Monthly 196207
price is stored in the Price Alternate Date field. Price
CHAPTER 2: Data Definitions
All prices are raw prices as they were reported at the Data Type: Floating Point
time of trading. Description: Daily — The last non-missing daily
CHAPTER 2: Data Definitions
Market were first reported June 15, 1992. Price or Daily 1925
negative bid/ask average before this time. DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess
Average is the price on the last trading date of the TS_PRINT/TSQUERY USAGE
month. The price series begins the first month-end Daily ITEMID prc
after the security begins trading and ends the last Monthly ITEMID mprc
Header Prcprev
complete month of trading. If the closing price is not
SUBNO 1
available on any given end of month trading day, the
number in the price field is a bid/ask average, not an
actual closing price. Trading prices for The NASDAQ
National Market securities were first reported PRIMARY EXCHANGE, END OF PERIOD
November 1, 1982. Trading prices for The NASDAQ Category: Name History
SmallCap Market were first reported June 15, 1992.
Data Type: Character
Price or Bid/Ask Average for NASDAQ securities is
always a bid/ask average, (identified with “-“ before the Description: Character code indicating the exchange
value) before these dates. on which the security has its primary listing at the end
of the period reported. (N = NYSE, A = NYSE MKT,
DATE RANGE AVAILABILITY Q = NASDAQ, R = Arca, B = Bats, I= IEX, X = Other)
Daily 1925
DATE RANGE AVAILABILITY
Monthly 1925
Daily 1925
DATABASE AVAILABILITY AND PRODUCT TYPES
Monthly 1925
Database Formats CRSPAccess
DATABASE AVAILABILITY AND PRODUCT TYPES
Product Types STK
Database Formats CRSPAccess
TS_PRINT/TSQUERY USAGE
Product Types STK
Daily ITEMID prc
ts_print/TsQuery Usage
Monthly ITEMID mprc
Daily ITEMID primexch
Header Prc
Monthly ITEMID mprimexch
SUBNO 0
Header Primexch
SUBNO 0
75 PRIMARY EXCHANGE, END OF PREVIOUS R.
PERIOD
RECORD DATE
Category: Name History
Record Date is the record date on which the
Data Type: Character
stockholder must be registered as holder of record on
CHAPTER 2: Data Definitions
Description: Character code indicating the exchange the stock transfer records of the company in order
on which the security has its primary listing at the to receive a particular distribution directly from the
end of the period preceding the period reported. (N = company. This integer date is coded as YYYYMMDD,
NYSE, A = NYSE MKT, Q = NASDAQ, R = Arca, B = and set to 0 if unavailable.
Bats, X = Other)
For a merger, exchange, or total liquidation in
DATE RANGE AVAILABILITY which the company disappeared, Record Date is, by
Daily 1925 convention, set equal to the date of the last price or
Monthly 1925 Delisting Date.
DATABASE AVAILABILITY AND PRODUCT TYPES
Record dates of liquidating payments after delisting are
Database Formats CRSPAccess
reported when available, and set to 0 when unavailable.
Product Types STK
TS_PRINT/TSQUERY USAGE GENERAL INFORMATION
Daily ITEMID primexch Primary Concepts Distribution Event Array
Monthly ITEMID mprimexch Data Type integer number
Stock and Index Data Description Guide
Element begdt
Data Type structure
Unit of Item Set (partition assignment)
DATE RANGE AVAILABILITY RESTRICTION END DATE (PARTITION OR
Daily 1925
INDEX)
Monthly 1925
Restriction End Date is the last date, in YYYYMMDD
DATABASE AVAILABILITY AND UTILITY USAGE
format, of data included in a partition universe
Database Formats CRSPAccess
Product Types IND
restriction or an index universe restriction. Restriction
ts_print Daily Usage n/a
End Date is set to 0 if there is no date restriction.
ts_print Monthly Usage n/a GENERAL INFORMATION
ind_print Option(s) /hr Primary Concepts Index Header
C USAGE Data Type integer number
Object indhdr_row Unit of Item YYYYMMDD date
Array indhdr DATE RANGE AVAILABILITY
Stock and Index Data Description Guide
Monthly — Month-end to month-end change in total -55.0 CRSP has no sources to establish a value after delisting or is unable to
assign a value to one or more known distributions after delisting
investment of a security, with ordinary dividends
-66.0 more than 10 trading periods between a security’s last price and its
reinvested at the month-end. first available price on a new exchange
Extended Information: Delisting Return is the return -88.0 security is still active
of a security after it has delisted from NYSE, NYSE -99.0 security trades on a new exchange after delisting, but CRSP currently
has no sources to gather price information
MKT, or NASDAQ. The Delisting Return is calculated
by comparing the security’s Amount After Delisting
with its price on the last day of trading. The Amount Monthly: If Amount After Delisting is non-zero and
After Delisting can be either an off-exchange price, Delisting Payment Date is less than or equal to the
an off-exchange price quote, or the sum of a series Delisting Date, the Delisting Return represents a
of distribution payments. The effective date of the partial-month return, not a Delisting Return. The
delisting return is specified in the Delisting Payment partial-month returns compare the value on the last
Date. day of trading with the value on the last month-end
date and do not factor in additional after-delisting
The return for any issue that has been closed to further information.
research is calculated as follows: DATE RANGE AVAILABILITY
Daily 1925
• If a price within 10 periods of the delist date is Monthly 1925
DATABASE AVAILABILITY AND PRODUCT TYPES DATE RANGE AVAILABILITY
78
Database Formats CRSPAccess Daily 1925
Product Types STK Monthly 1925
TS_PRINT/TSQUERY USAGE DATABASE AVAILABILITY AND PRODUCT TYPES
Daily ITEMID ret Database Formats CRSPAccess
CHAPTER 2: Data Definitions
the time series contains a cumulative return since the return since the beginning period.
beginning period.
Monthly — Monthly returns on income compounded
DATE RANGE AVAILABILITY from the beginning month in the range. Each period
Daily 1925 in the time series contains a cumulative return since
Monthly 1925 the beginning period.
DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess DATE RANGE AVAILABILITY
Product Types STK Daily 1925
TS_PRINT/TSQUERY USAGE Monthly 1925
Daily ITEMID cumtret DATABASE AVAILABILITY AND PRODUCT TYPES
Monthly ITEMID mcumtret Database Formats CRSPAccess
Header Cumtret Product Types STK
SUBNO 0 TS_PRINT/TSQUERY USAGE
Daily ITEMID cumiret
Monthly ITEMID mcumiret
RETURNS ON INCOME Header Cumiret
Category: Returns SUBNO 0
-99.0 Missing return due to missing price at time t; usually due to when issued, R = regular way, E = ex-distributed, Q =
suspension in trading or trading on unknown exchange. non-leading when issued, X = untracked exchange or
unknown)
DATE RANGE AVAILABILITY
Daily 1925 DATE RANGE AVAILABILITY
For example, a Share Type Code of 14 represents DATABASE AVAILABILITY AND PRODUCT TYPES
ordinary common shares of a closed-end fund. Database Formats CRSPAccess
Product Types STK
DATE RANGE AVAILABILITY
TS_PRINT/TSQUERY USAGE
Daily 1925
Daily ITEMID shrcd
Monthly 1925
Monthly ITEMID mshrcd
DATABASE AVAILABILITY AND PRODUCT TYPES
Header SCL
Database Formats CRSPAccess
SUBNO 2
Product Types STK
TS_PRINT/TSQUERY USAGE
Daily ITEMID shrcd
SHARES OUTSTANDING
Monthly ITEMID mshrcd
Header SC Category: Shares
SUBNO 0
Data Type: Double Precision Floating Point
Stock and Index Data Description Guide
observation: SUBNO 1
Monthly: Spread Between Bid and Ask is the Product Types IND
ts_print Daily Usage n/a
difference between the closing bid and ask quotes for
ts_print Monthly Usage n/a
a security. It is available only when Ask or High Price
ind_print Option(s) n/a
and Bid or Low Price are available and Closing Price
CHAPTER 2: Data Definitions
C USAGE
or Bid/Ask Average is a bid/ask average. If Closing
Object rebal_arr[ ]
Price or Bid/Ask Average is zero and Spread between
Array rebal[ ][ ]
Bid and Ask is negative, the spread represents a Bid or
Element avgstat
Low Price. If Closing Price or Bid/Ask Average is zero
FORTRAN-95 USAGE
and Spread between Bid and Ask is positive, Spread
Type or Subtype rebal_arr()
Between Bid and Ask represents an Ask or High Price.
Member and/or Array rebal(,)
It is set to zero if unavailable. Element avgstat
GENERAL INFORMATION
Primary Concepts Auxiliary Time Series Data
Data Type real number STATISTIC MAXIMUM IDENTIFIER
Unit of Item Mathematical value
Statistic Maximum Identifier is the identifier of the
DATE RANGE AVAILABILITY
entity in a portfolio with the maximum statistic at
Daily n/a
the beginning of a rebalancing period. The identifier
Monthly 1925
Stock and Index Data Description Guide
Daily 1925
Statistic Grouping Code. It is set to zero if unavailable.
Monthly 1925
DATABASE AVAILABILITY AND UTILITY USAGE GENERAL INFORMATION
Element minstat
share type. The Share Code variable should be used to
determine the security’s share type. NASDAQ tickers
T. before 1982 in an issue’s name history are presumed
to represent legitimate trading symbols for that issue
TICKER, END OF PERIOD at some point in time, although these symbols may
Category: Name History be listed out of proper chronological sequence. In
addition, the NASDAQ file ticker symbols provided
Data Type: String
do not necessarily constitute a definitive list of all
Description: An alphabetic symbol assigned to a symbols used throughout an issue’s trading history.
security by an exchange at the end of the period Due to source limitations, the ticker field may be blank
reported. in name histories of NASDAQ securities that stopped
Extended Information trading from the early 1970s through the early 1980s.
Tickers can be reused over time. The combination NYSE tickers prior to July 1962 are blank.
Stock and Index Data Description Guide
NYSE MKT securities must be changed to comply with Category: Name History
the new data. This change provides a field in the CRSP Data Type: Character
database that contains an exact match with symbols
Description: One-character field describing the status
available directly from exchanges and other sources.
of a security at the end of the period. (A = active, H =
halted, S = suspended, X = unknown)
GENERAL INFORMATION
Primary Concepts Header Identification and Summary DATE RANGE AVAILABILITY
Data Daily 1925
Data Type character Monthly 1925
Unit of Item Id DATABASE AVAILABILITY AND PRODUCT TYPES
DATE RANGE AVAILABILITY Database Formats CRSPAccess
Daily 1962 Product Types STK
Monthly 196207 TS_PRINT/TSQUERY USAGE
DATABASE AVAILABILITY AND UTILITY USAGE Daily ITEMID trdstat
Database Formats CRSPAccess Monthly ITEMID mtrdstat
Product Types STK Header Trdstat
ts_print Daily Usage n/a SUBNO 0
ts_print Monthly Usage n/a
ind_print Option(s) /hh, /hn
C USAGE
TRADING STATUS, END OF PREVIOUS
Object header_row
PERIOD
Array header
Element htick Category: Name History
FORTRAN-95 USAGE Data Type: Character
Type or Subtype stkhdr
Member and/or Array n/a
Description: One-character field describing the
Element htick
status of a security at the end of the period preceding
the period reported. (A = active, H = halted, S =
suspended, X = unknown)
DATE RANGE AVAILABILITY DATE RANGE AVAILABILITY
90
Daily 1925 Daily 1925
Monthly 1925 Monthly 1925
DATABASE AVAILABILITY AND PRODUCT TYPES DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess Database Formats CRSPAccess
CHAPTER 2: Data Definitions
Data values, share classes and share type suffixes, at the end
Data Type character
of the period reported. There is no punctuation (no
Unit of Item Code
periods) in the Trading Ticker Symbol. N.B. this field
DATE RANGE AVAILABILITY
includes data starting on 20020102 for NYSE/NYSE
Daily 1925
MKT, and 19821101 for NASDAQ.
Monthly 1925
DATABASE AVAILABILITY AND UTILITY USAGE DATE RANGE AVAILABILITY
Data
Data Type character CODE DESCRIPTION
Unit of Item Id 0 Identifier restriction not applicable
DATE RANGE AVAILABILITY 10 NYSE common excluding foreign, ADRs, REIT, Closed End Funds
Daily 19821101 11 NYSE/NYSE MKT common excluding foreign, ADRs, REIT, Closed End
Monthly 198211 Funds
DATABASE AVAILABILITY AND UTILITY USAGE 12 NYSE/NYSE MKT/The NASDAQ National Market common excluding
foreign, ADRs, REIT, Closed End Funds
Database Formats CRSPAccess
20 NYSE common excluding ADRs
Product Types STK
21 NYSE MKT common excluding ADRs
ts_print Daily Usage n/a
22 NYSE/NYSE MKT common excluding ADRs
ts_print Monthly Usage n/a
23 NASDAQ common excluding ADRs
ind_print Option(s) /hn
24 NYSE/NYSE MKT/NASDAQ common excluding ADRs
C USAGE
30 NYSE common
Object header_row
31 NYSE MKT common
Array header
32 NYSE/NYSE MKT common
Element htsymbol
33 NASDAQ common
FORTRAN-95 USAGE
34 NYSE/NYSE MKT/NASDAQ common
Type or Subtype stkhdr
35 NYSE common excluding ADRs and foreigns
Member and/or Array n/a
36 NYSE MKT common excluding ADRs and foreigns
Element htsymbol
37 NYSE/NYSE MKT common excluding ADRs and foreigns
38 NASDAQ common excluding ADRs and foreigns
TRADING TICKER SYMBOL, MOST RECENT 39 NYSE/NYSE MKT/NASDAQ common excluding ADRs and foreigns
40 Arca common excluding ADRs
Category: Name History
41 Arca common
Data Type: String 42 NYSE/NYSE MKT/NASDAQ/Arca common excluding ADRs
43 NYSE/NYSE MKT/NASDAQ/Arca common
Description: Trading symbol listed by exchanges and
consolidated quote systems, including all temporary
values, share classes and share type suffixes, most
recently known.
GENERAL INFORMATION ts_print Daily Usage n/a
92
Primary Concepts Index Header ts_print Monthly Usage n/a
Data Type integer number ind_print Option(s) /hr
Unit of Item Code C USAGE
DATE RANGE AVAILABILITY Object indhdr_row
CHAPTER 2: Data Definitions
Element univcode
the decimal representation of a 10-digit binary number.
The nth bit of the binary number is 1 if an n in the
first digit of the Share Code is valid in the subset, and
V.
a 0 otherwise.
VALID EXCHANGE CODES IN UNIVERSE
GENERAL INFORMATION
(PARTITION OR INDEX RESTRICTION)
Primary Concepts Index Header
Valid Exchange Codes in Universe is an integer code Data Type integer number
indicating the base exchanges in the universe used to Unit of Item Code
partition an index or to populate the actual index. The DATE RANGE AVAILABILITY
following table lists the base codes used. The sum of Daily 1925
two or more codes indicates all selected exchanges are Monthly 1925
valid. DATABASE AVAILABILITY AND UTILITY USAGE
Database Formats CRSPAccess
CODE DESCRIPTION
Product Types IND
0 No exchange restriction
ts_print Daily Usage n/a
1 NYSE
ts_print Monthly Usage n/a
2 NYSE MKT
ind_print Option(s) /hr
4 NASDAQ Stock Market
C USAGE
8 Arca
Object indhdr_row
Array indhdr
GENERAL INFORMATION
Element partuniv.shrcd.fstdig or induniv.shrcd.
Primary Concepts Index Header
fstdig
Data Type integer number
FORTRAN-95 USAGE
Unit of Item Code
Type or Subtype indhdr
DATE RANGE AVAILABILITY
Member and/or Array partuniv or induniv
Daily 1925
Element fstdig
Monthly 1925
DATABASE AVAILABILITY AND UTILITY USAGE
Database Formats CRSPAccess
Product Types IND
93 VALID INCORPORATION OF SECURITIES GENERAL INFORMATION
IN UNIVERSE (PARTITION OR INDEX Primary Concepts Index Header
RESTRICTION) Data Type integer number
Unit of Item Code
Valid Incorporation of Securities in Universe describes
DATE RANGE AVAILABILITY
the incorporation of companies selected in a subset
CHAPTER 2: Data Definitions
Daily 19820402
universe used to partition an index or in the actual
Monthly 198204
index. The following integer codes are used.
DATABASE AVAILABILITY AND UTILITY USAGE
CODE DESCRIPTION Database Formats CRSPAccess
0 Not applicable or no restriction by country of incorporation Product Types IND
1 Companies incorporated outside of the US are excluded ts_print Daily Usage n/a
ts_print Monthly Usage n/a
GENERAL INFORMATION ind_print Option(s) /hr
Primary Concepts Index Header C USAGE
Data Type integer number Object indhdr_row
Unit of Item Code Array indhdr
DATE RANGE AVAILABILITY Element partuniv.wantnms or induniv.wantnms
Daily 1962 FORTRAN-95 USAGE
Monthly 1925 Type or Subtype indhdr
DATABASE AVAILABILITY AND UTILITY USAGE Member and/or Array partuniv or induniv
Stock and Index Data Description Guide
in the actual index. The NASDAQ National Market DATABASE AVAILABILITY AND UTILITY USAGE
Element secdig within the selected output calendar. For example, the
weekly calendar will average the 5 trading days within
each week.
VALID WHEN-ISSUED SECURITIES IN
UNIVERSE (PARTITION OR INDEX Monthly — Average monthly volume traded within the
RESTRICTION) selected output calendar. For example, the quarterly
Valid When-Issued Securities in Universe is an integer calendar will average the 3 month-end volumes in the
code describing the types of when-issued trading quarter.
allowed in a subset universe used in an index partition Date Range Availability
or in the actual index. The following codes are used: Daily 1925
Monthly 1925
CODE DESCRIPTION
Database Availability and Product Types
0 No when-issued restrictions, or not applicable
Database Formats CRSPAccess
10 Initial when-issued trading is included when available. Ex-distribution
trading is excluded. When-issued trading during reorganizations is Product Types STK
included. ts_print/TsQuery Usage
Stock and Index Data Description Guide
110 Initial when-issued trading is excluded until issue attains regular-way Daily ITEMID volavg
status. Ex-distribution trading is excluded. When-issued trading during Monthly ITEMID mvolavg
reorganizations is included.
Header Volavg
SUBNO 0
GENERAL INFORMATION
Primary Concepts Index Header
Data Type integer number VOLUME, MEDIAN
Unit of Item Code
Category: Volume
DATE RANGE AVAILABILITY
Daily 1925 Data Type: Double Precision Floating Point
Monthly 1925
DATABASE AVAILABILITY AND UTILITY USAGE Description: Daily — Median daily volume traded
Database Formats CRSPAccess within the selected output calendar. For example, the
Product Types IND weekly calendar will select the median value for the 5
ts_print Daily Usage n/a trading days within each week.
ts_print Monthly Usage n/a
ind_print Option(s) /hr Monthly — Median monthly volume traded within the
C USAGE selected output calendar. For example, the quarterly
Object indhdr_row calendar will select the median value for the 3 month-
Array indhdr end volumes in the quarter.
Element partuniv.wantwi or induniv.wantwi
FORTRAN-95 USAGE DATE RANGE AVAILABILITY
Type or Subtype indhdr Daily 1925
Member and/or Array partuniv or induniv Monthly 1925
Element wantwi DATABASE AVAILABILITY AND PRODUCT TYPES
Database Formats CRSPAccess
Product Types STK
TS_PRINT/TSQUERY USAGE
Daily ITEMID volmed
95
Monthly ITEMID mvolmed number of shares they bought and the total number of
Header Volmed shares they sold. The NASDAQ system summed the
SUBNO 0 greater figure (whether buy or sell) from the market
reports to create daily volume figures.
VOLUME, TOTAL
CHAPTER 2: Data Definitions
reported in units of 100, and are not adjusted for splits DATABASE AVAILABILITY AND PRODUCT TYPES
units of one share, for daily data, and on hundred Monthly ITEMID mtvol
shares for monthly data. Up until March, 2014, our Header Vol
On NASDAQ, volumes of after-hours trades are Monthly: Total volume traded within the selected
included in the current day, while the trades or quotes output calendar, adjusted for splits. For example, the
are included the next day. Therefore, it is possible quarterly calendar will sum the 3 months within each
to have bid/ask or missing price quotes paired with quarter.
nonzero volumes. Trades on all exchanges connected
to the consolidated pricing network and all late trades DATE RANGE AVAILABILITY
are included in the volume. There are no volumes Daily 1925
available on NASDAQ prior to November 1, 1982. Monthly 1925
DATABASE AVAILABILITY AND PRODUCT TYPES
Until June 15, 1992, NASDAQ reported volumes Database Formats CRSPAccess
differently on the NASDAQ National Market and Product Types STK
NASDAQ SmallCap Market. On the National Market, TS_PRINT/TSQUERY USAGE
the volume of each transaction was reported by one Daily ITEMID adjvol
party involved in the transaction. On the SmallCap Monthly ITEMID madjvol
Market, all market makers of a security made two Header Adjvol
volume reports at the end of the market day, the total SUBNO 0
96 W.
WEIGHT OF ISSUE
Weight of Issue is the defined weight of an issue within
the index during the range indicated in a list defining
CHAPTER 2: Data Definitions
This area contains formulas and methodologies used to if t > C0 and no split events since t-1,
derive CRSP variables in the stock and index files and C(t) = C(t-1)
generated by the CRSP data utilities. if t > C0 and a split event with factor f since t-1,
C(t) = C(t-1) * f
ADJUSTED DATA if t > C0 and split event change
Price, dividend, shares, and volume data are historically C(t-1)/f
adjusted for split events to make data directly comparable if t < C0 and a split event change
at different times during the history of a security. CRSP C(t+1)*f
provides raw, Unadjusted Data, but data utilities stk_print
and ts_print can be used to generate Adjusted Data. Where factor is typically the Factor to Adjust Price
variable + 1.
An adjustment base date is chosen as the anchor date.
All data on this date are unadjusted, and other data are If there is a gap in trading where possible split events are not
converted based on the split events between the base date known, all adjusted values are set to missing when the gap is
and the time of that data. The adjustment base date is between the observation and the adjustment base date.
usually chosen to be the last available day of trading.
Monthly: If monthly summary data (id or Low Price, Ask or
Split events always include stock splits, stock dividends, High Price, and Volume Traded) are adjusted, the adjustment
and other distributions with price factors such as spin-offs, factor cannot take into account adjustments that take place
stock distributions, and rights. Shares and volumes are only in the middle of the month. Therefore, the result assumes all
adjusted using stock splits and stock dividends. Split events adjustment events occur on the last trading day of the month.
are applied on the Ex-Distribution Date. A more accurate monthly adjusted value can be derived by
adjusting and resummarizing the underlying daily data.
Price and dividend data are adjusted with the calculation:
security return at time t, and I(t) is the index return gn = the Geometric Average Return applicable on
at time t. If the security return R(t) is based on a each subset period n
previous price t’ that is not the previous time period, rc = the cumulative return over the entire period
I(t) is the compounded index return from t’ + 1 to t. n = the number of equal subset periods to average
the return
If an Excess Return is based on associated portfolios,
the Excess Return for a period is
E(t) = R(t)-I(p(t),t) INCOME RETURN
Income Return is the return on the ordinary dividends
where E(t) is the Excess Return at time t, R(t) is
paid to shareholders of a security. It is the ratio of
the security return at time t, p(t) is the portfolio
the amount of ordinary dividends since the end of
assignment of the security at time t, and I(p(t),t) is
the previous period up to and including the end of
the return of that portfolio at time t. If the security
the period of interest to the price at the end of the
return R(t) is based on a previous price t’ that is not
previous period. It is similar to a dividend yield.
Stock and Index Data Description Guide
INDEX COUNT
Index Count is the count in an index for a time period
FACTOR TO ADJUST PRICES IN PERIOD (TS_PRINT
is the number of securities in the portfolio during the
ITEM)
time period. Rules are based on the specific index or
Factor to Adjust Prices in Period is the amount the portfolio methodology. See Total and Used Counts
current price is multiplied by in returns calculations for more details.
so that current and previous prices are on the same
split-adjusted basis. Factor to Adjust Prices in Period INDEX LEVEL
is derived from the Factor to Adjust Price field of Index Level is the value of an investment relative to
distributions with Ex-Distribution Dates after the its value at one fixed point in time. Index Levels allow
previous period and up to and including the current convenient comparison of the relative performance
period. In simple stock splits, Factor to Adjust Prices in of the different portfolios or asset classes. Differences
Period is distribution Factor to Adjust Price plus one. arise when indexes are based on different underlying
databases such as daily and monthly CRSP stock
products.
GEOMETRIC AVERAGE RETURN
The initial date and value are set arbitrarily, but must
A Geometric Average Return is the constant return be consistent if comparing multiple indexes. The Index
applied to each period in a range that would result in Level for any series at any time after the initial point
the compounded return over that range. indicates the value at that time of the initial value
100 invested at the initial point. The Index Level for any INDEX RETURN
series at any time before the initial point, indicates the An Index Return is the change in value of a portfolio
value invested at that time that will result in the initial over some holding period. The return on an index (Rt)
value at the initial point. The Index Level of a series is calculated as the weighted average of the returns for
missing prior to its first available return. Let: the individual securities in the index:
CHAPTER 3: CRSP Calculations
if t-1 < F then It is set to missing- Note: Missing values index would consist of n stocks, with the same dollar
are file format specific. amount invested in each stock.
Defined CRSP indexes use the following initial dates The security returns can be total returns or capital
and levels: appreciation (returns without dividends). This
determines whether the index is a total return index or
CRSP Stock File Indexes a capital appreciation index.
initial level 100.00
initial date December 29, 1972 In an index where the individual components are not
known, but an index level is available from an external
source, such as the Standard & Poor’s 500 Composite
CRSP Cap-Based Portfolios Index, the return Rt is calculated as follows:
initial level 1.00
initial date December 31, 1925
CRSP US Government Treasury and Inflation Rt is the index return for time t
Indexes
It is the index level at time t
initial level 100.00
initial date December 29, 1972 It-1 is the index level at end of the previous period (time
t-1)
Publicly available indexes such as for the S&P 500
Composite and NASDAQ Composite have initial
values set by their creators and differ from the CRSP
initializations.
101 INDEX WEIGHT RETURN
The weight of an index for a time period is the total A Return is the change in the total value of an
market value of the securities in the index at the end of investment in a security over some period of time
the previous trading period. Vt = ∑(wi,t)= ∑(vi,t) where: per dollar of initial investment. Total Return is the
vi,t = pi,t-1* si,t-1 in which: Holding Period Total Return for a sale of a security
CHAPTER 3: CRSP Calculations
average from the Price or Bid/Ask Average variable p(t) = last sale price or closing bid/ask average at
and the applicable shares observation from the Shares time t
Outstanding Observation Array for each calendar
d(t) = dividend amount for t
period to calculate Market Capitalization.
f(t) = factor to adjust price in period t
REBASING INDEX LEVELS p(t’) = last sale price or closing bid/ask average at
It is possible to rebase an index to make index levels of time of last available price < t
two index level series comparable. To rebase an index,
choose a new initial date and value, find the current
index level on the new initial date, and multiply the
levels on all dates by the new initial value divided by
the old initial date index level:
t’ is usually one period before t, but t’ can be up to
ten periods before t if there are no valid prices in the
interval. If there is a trading gap with unknown status
between t and t’, the previous price is considered
invalid.
where:
It = Original Index Level for the series at time t
In daily databases, dividends are reinvested in the
Nt = New Index Level for the series at time t
security on the Ex-Distribution Date. In monthly
D = New Initial Date.
databases, the returns are holding period returns from
ID = Original Index Level for the series on the new
month-end to month-end, not compounded daily
initial date
returns, and dividends are reinvested in the security at
L = New Initial Level.
month-end.
time range, Factor to Adjust Price is 1.0, Factor to window of the above market return
Adjust Prices in Period is 2.0, and Dividend Cash ni is the number of non-missing returns for security i
Amount is 0.0. If a one dollar dividend is the only during the year
distribution event in the time range, both Dividend
Cash Amount and dividend amount are 1.0. where the summations are over t and include all days
on which security i traded, beginning with the first
A series of special return codes specify the reason a trading day of the year and ending with the last trading
return is missing: day of the year. There are two index families based
on Scholes- Williams Beta calculations: NYSE/NYSE
-66.0 Valid current price, but no valid MKT and NASDAQ-only.
previous price; either first price,
unknown exchange between current In the NYSE/NYSE MKT family, only trading prices
and previous price, or more than are considered in the beta calculation, and a security
10 periods between time t and the must have traded half the days in a year to be given
time of the preceding price t’ a non-missing beta for that year. The index used in
Stock and Index Data Description Guide
-77.0 Not trading on the current exchange the calculation is the total returns on the Trade-only
at time t NYSE/NYSE MKT Value-Weighted Market Index.
-88.0 Outside the range of the security’s
price range Betas for the NASDAQ family do not use the standard
-99.0 Missing return due to missing price at Scholes-Williams trade-only data restriction, since
time t most NASDAQ securities were not required to report
transactions until 1992. Removing bid/ask averages
would restrict NASDAQ data to only NASDAQ
SCHOLES-WILLIAMS BETA National Market securities after 1982. NASDAQ
Beta is a statistical measurement of the relationship returns based on bid/ask averages have different
between two time series, and has been used to compare characteristics from trade-based returns, and betas are
security data with benchmark data to measure risk in provided for comparison. NASDAQ betas are based
financial data analysis. CRSP provides annual betas on the total returns on the NASDAQ Value-Weighted
computed using the methods developed by Scholes Market Index.
and Williams (Myron Scholes and Joseph Williams,
“Estimating Betas from Nonsynchronous Data,”
Journal of Financial Economics, vol 5, 1977, 309-327).
where:
The ranges for individual exchange data are listed below. Index Levels of CRSP Market Indexes are set to 100.0 on
The series containing combinations of exchanges begin December 29, 1972.
at the earliest point that data for any of the exchanges is
available. The NYSE/NYSE MKT/NASDAQ/Arca Market Indexes are
available in Daily and Monthly Stock Files. Other exchange
The New York Stock Exchange (NYSE) all series begins combinations are available in the CRSP US Index Database
December 31, 1925 and Security Portfolio Assignment Module.
NYSE MKT all series begins July 2, 1962
The NASDAQ Stock Market (NASDAQ) all series begins B. PUBLISHED S&P 500 AND NASDAQ COMPOSITE
December 14, 1972 INDEX DATA
The Arca Exchange (Arca) all series begins The S&P 500 Composite Index is a value-weighted index
March 8, 2006 created by Standard & Poor’s. Since March 1957, the
index contains 500 securities. Prior to that time the index
NOTE: Quarterly and annual index returns are not
contained 90 securities. These have been combined into a
available for the series including Arca.
single time series. S&P Composite levels are collected from
Daily and monthly index returns are calculated based public sources such as the Dow Jones New Service, the Wall
on daily and monthly security holding period returns Street Journal and the Standard & Poor’s Statistical Service.
respectively. Quarterly and annual frequency index returns
The NASDAQ Composite Index is a value-weighted index
are calculated by compounding monthly index returns.
created by the NASDAQ Stock Market.
A. CRSP MARKET INDEXES
C. CRSP STOCK FILE CAPITALIZATION DECILE CRSP Stock File Risk-Based Decile Indexes are
INDEXES rebalanced each year by ranking the statistics at the
end of the previous year. If there are no data for the
CRSP Stock File Capitalization Decile Indexes are
previous year for an issue but a valid statistic can be
calculated for each of the Stock File Indexes market
calculated for the current year, that statistic is used in
groups. All securities excluding American Depositary
the rankings.
Receipts on a given exchange or combination of
exchanges are ranked according to capitalization and CRSP Beta Deciles are ranked with Portfolio 1
then divided into ten equal parts each rebalancing containing the securities with the largest positive betas
period. and 10 containing securities with the smallest and
most negative.
Stock and Index Data Description Guide
as the breakpoints that are applied to various exchange three sets of Cap-Based Indexes represents one Index
groupings of the universe. Group of index results and one Portfolio Type of
portfolio assignments and statistics. Seventeen Index
Decile results are created for three exchange groups: Series, one for each decile and each composite, are
created for each Portfolio Type.
NYSE only
NYSE and NYSE MKT. NYSE MKT data are added
beginning July 1962 III. CRSP INDEXES FOR THE
NYSE, NYSE MKT and the NASDAQ National S&P 500 UNIVERSE
Market. The NASDAQ National Market data are
added beginning April 1982 CRSP Indexes for the S&P 500® Universe are
standard CRSP Market Indexes derived from CRSP
Individual decile portfolios are created for each Stock Files but include only issues from the CRSP
exchange group, the largest being in decile 1 and stock data that are in the S&P 500® universe.
the smallest in decile 10. In addition to each decile
Stock and Index Data Description Guide
portfolio, returns are calculated for the following: The CRSP Indexes for the S&P 500® series contain
CRSP 1-2, CRSP 3-5, CRSP 6- 8, CRSP 9-10, CRSP value- and equal-weighted returns with and without
6-10 and CRSP 1-10. dividends for a market of stocks in the S&P 500®
universe. Daily and monthly data beginning December
Companies becoming eligible or ineligible during a 25, 1925 are provided. The published S&P 500®
quarter are handled with the following rules: index and returns are also included for comparison.
Previous period market capitalizations are used for For a security to be included in the CRSP indexes for
assigning deciles and weights.
the S&P 500 Universe, it must have a price at the end
Securities added during a quarter are assigned to of the current period, a price at the end of the previous
appropriate portfolios when two consecutive month- period, and it must be a member of the S&P 500
end prices are available. Universe at the end of the current period. See CRSP
When a security’s last price is a month-end price, Market Indexes for the variables calculated and the
its month’s return is included in the portfolios’ methodology used.
quarterly return.
Prior to March, 1957, the index contains 90 issues.
When the month-end price is missing, a replacement
CRSP does not have data for two securities between
month-end value is derived from the delisting return
1925 and 1931 as follows.
including merger terms, regional exchanges, etc.
If the derived replacement month-end price is not Company Name Start Date End Date
available, the last available daily price is used.
INT’L MERCANTILE MARINE PFD 31-dec-1925 22-jul-1929
If an issue becomes ineligible for an index in the STANDARD POWER & LIGHT “B” 06-feb-1930 16-nov-1931
middle of a quarter but is still active, such as after an
exchange change or because the issue is leaving the Due to differences in handling mergers,
NASDAQ National Market, the issue is considered reorganizations, and other major corporate actions,
held until the end of the month and then dropped. CRSP data and the S&P 500® universe do not always
Index Total Returns, Index Capital Appreciation, have a one-to-one mapping. In some cases this results
and Index Income Returns are calculated from a in a short period where CRSP is missing prices or has
value-weighted portfolio of securities in the portfolio multiple prices per company listed by S&P.
each period. Index Levels are calculated for each of
The Count of Securities Used is not always 500 (90
107 prior to March 1957) due to missing prices. Known than ninety days to maturity may be selected. Due to
reasons for missing prices are when-issued trading, the lack of data, the selection process in periods prior
halts, and suspensions. to 1942 is somewhat subjective and the maturities of
the selected issues may deviate more than several days
A. CRSP PORTFOLIOS FOR THE S&P 500
CHAPTER 4: Index Methodologies
from the thirty and ninety day targets. Where bills were
UNIVERSE not available, certificates or notes may have been used.
The CRSP Portfolios for the S&P 500 Universe Exclusions may include:
include an alternate value- and equal-weighted version
of the CRSP indexes for the S&P 500 Universe. The suspicious quotes,
methodology differences are: issues that did not mature on their next coupon
payment data, or
Issues are selected based on membership in the S&P bid quotations that implied negative yields.
500 at the end of the previous period instead of the
end of the current period. Each monthly return is calculated as price change plus
Delisting returns are used to evaluate the value of interest, divided by last month’s price. The returns and
securities that delist before the end of a period they corresponding index values are set to -99 for months
were selected. in which a return cannot be calculated, i.e. if the price
is missing for either this month or last month, or if no
valid issue was available.
IV. CRSP TREASURY AND INFLATION
Stock and Index Data Description Guide
INDEXES The issue chosen for the 30, 20, 10, 7, 5, 2, and 1 year
Fixed Term Index series for a given date was selected
The CRSP US Treasury and Inflation Series (CTI) based on its length to maturity as of the date. The
Files are provided on a monthly frequency. The series returns contained in these series are calculated under
contains returns adapted from the CRSP US Treasury the assumption that the relevant issue is bought one
Fixed Term Index Series, the CRSP Risk Free Rates month prior to the quote date and sold on the date.
File, and the US Government Consumer Price Index.
These derived files offer 10 groups of indexes: 30 The issue chosen for the 90 and 30 day Treasury
year, 20 year, 10 year, 7 year, 5 year, 2 year, 1 year, 90 Bill series on a given date was selected based on its
day, and 30 day target maturity indexes, as well as the length to maturity as of the month immediately prior
Consumer Price Index. to the date. The 90 and 30 day series returns were
calculated on the basis of buying the relevant issue
For fixed-term series with maturities of one year or one month prior to the date and selling it on the
greater, a representative Treasury bond or note for each date. For example, a 90 day bill return is calculated
series is selected. Available issues are filtered on the between a date approximately 90 days prior to the
basis of their characteristics. Each month, the most bill’s maturity, and the date which is a month after
recent non-callable, non-flower, and fully taxable issue this date. Likewise, a 30 day bill return is calculated
closest to the target maturity is selected. If none are between a date approximately 30 days prior to the bill’s
found, a second pass allows flower bonds. Note that maturity, and the date which is a date one month later.
all these series begin in 1941 or 1942 due to the lack of In cases where the date chronologically approached
suitable issues in the early history. or exceeded the maturity date, thereby making a final
price unavailable, the return was calculated based on a
For thirty and ninety day risk-free series, a final price of $100.
representative Treasury bill for each series is selected.
Each month the issue maturing closest to the target The associated index levels of the CRSP US Treasury
duration, as measured from the end of the previous and Inflation Series all have been initialized so that
month, is selected. Bills must have at least thirty days December 29, 1972 (19721229) equals 100. This
to their maturity date to be selected for the thirty day facilitates comparison between the CTI Indexes and
series. However, for ninety day series, bills with less Stock File Indexes.
108 V. CRSP SELECT FILE SPECIFICATIONS move up to the next highest maturity and so forth.
A. LONG TERM BOND SELECTION For the period 1934-1942, always choose a non-flower
bond (iflwr of 1) and preference is given to a bond
1. Select the 20-year bond that is the closest to having
that is partially tax-exempt (itax of 2). If a partially tax-
CHAPTER 4: Index Methodologies
3. If a 25-year bond does not meet the above criteria, Choose the Treasury Bill closest to 90 days to maturity
choose the 30-year bond with at least 19.7 years to on the quote date. A bill can be within 4 days of target
maturity at the beginning of the year. If more than maturity, i.e. 90 days plus or minus 4 days. If a bill is
one exists, choose the bond closest to 20 years to not available use a certificate or a note.
Stock and Index Data Description Guide
The bond is held for one full year in the index. Bonds
chosen for this index are either non-callable or callable
Treasury bonds with a type of Bond or Callable Bond.
A 20-year bond can be selected from a universe of
bonds that were issued as having a term to maturity
of 7305-7693 days, a 25-year bond from an issue of
8766-9892 days, and a 30- year bond from an issue of
10955-11288 days.
Note: *Americus Trust Components are exclusive to First Digit 2 and 17 Philadelphia Stock Exchange
Second Digit 3. 19 Toronto Stock Exchange
20 Over-The-Counter (Non-NASDAQ Dealer Quotations)
Note: **Exchange Traded Funds are exclusive to First Digit 7 and Second
Digit 3. 31 When-Issued Trading On NYSE
32 When-Issued Trading On NYSE MKT
SHARE TYPE - SECOND DIGIT 33 When-Issued Trading On NASDAQ
CODE DEFINITION 34 When-Issued Trading On Arca
0 Securities Which Have Not Been Further Defined
1 Securities Which Need Not Be Further Defined
2 Companies Incorporated Outside The U.S II. DISTRIBUTION CODES
3 Americus Trust Components (Prime, Score, & Units)*, Exchange Traded Funds**
4 Closed-End Funds and Unit Investment Trusts A four-digit code describes distribution events. The first digit
5 Closed-End Fund Companies Incorporated Outside The US describes the distribution in general terms. The second digit
8 REIT’s (Real Estate Investment Trusts) describes the form or method of payment. The meaning of
the third digit varies with the value of the first digit, and
Note: *Americus Trust Components are exclusive to First Digit 2 and
Second Digit 3.
gives a more detailed description of the event. The fourth
digit provides information about the tax status of the
Note: **Exchange Traded Funds are exclusive to First Digit 7 and Second distribution.
Digit 3.
The coding and meanings of the four digits are described
below. For digits 2, 3, and 4, special conventions apply: a
value of “0” implies that CRSP has not yet discovered the
descriptive information for the corresponding digit; a value
of “1” implies sources have been checked and the status
for the corresponding attribute is actually unspecified, not
applicable, or not available for the distribution.
SECOND DIGIT: PAYMENT METHOD Rights Valuation Method (for first digit = 4 only)
CODE MEANING
CODE MEANING
0 unknown, not yet coded
0 unknown, not yet coded
1 Transferable unknown value (no price or assigned value)
1 unspecified or not applicable
2 market value of trading right on exdate
2 cash, United States dollars
3 Fair market value
3 cash, foreign currency converted to US dollars
4 Value at exdate, calculate
4 cash, Canadian dollars (now obsolete, converted to US dollars)
5 Non-transferable fair market value
5 same issue of common stock
6 Non-transferable value at exdate, calculated (based on recdate if
6 units including same issue of common stock
Stock and Index Data Description Guide
exdate is unavailable)
7 an issue of a different common stock which is on the file
7 Non-transferable, unknown value
8 other property
4 return of capital (i.e., gain recognized, loss not) 1222 US cash dividend, monthly, taxable same rate as dividends
5 gain or loss realized compared with cost 1224 US cash dividend, monthly, tax status - return of capital, gain
recognized, loss not
6 realized capital gain (Investment Companies)
1228 US cash dividend, monthly, fully taxable as ordinary income to
7 capital gains tax credit
individuals
8 fully taxable as ordinary income to individuals
1232 US cash dividend, quarterly, taxable same rate as dividends
9 dividend reinvestment plan qualifies for the limited exclusion
1234 US cash dividend, quarterly, tax status - return of capital, gain
provided by Sec. 305(e) of the Internal Revenue Code
recognized, loss not
1238 US cash dividend, quarterly, fully taxable as ordinary income to
Coding convention note for distribution codes with individuals
the fourth digit (tax status) coded as 2 or 8: Until 1986, 1239 US cash dividend, quarterly, tax status - dividend reinvestment plan
distribution codes 2 and 8 were used in conjunction qualifies for the limited exclusion provided by Sec. 305(e) of the
with one another such that the 2 represented the part Internal Revenue Code
of the dividend qualifying for the dividend exclusion 1242 US cash dividend, semi-annual, taxable same rate as dividends
and the 8 representing the part that did not. Since the 1244 US cash dividend, semi-annual, tax status - return of capital, gain
tax reform act of 1986, which eliminated the exclusion, recognized, loss not
these have been coded as 2’s. 1248 US cash dividend, semi-annual, fully taxable as ordinary income to
individuals
Commonly Coded Distribution Events in CRSP 1252 US cash dividend, annual, taxable same rate as dividends
Stock Files 1254 US cash dividend, annual, tax status - return of capital, gain
recognized, loss not
The following table describes some of the most 1258 US cash dividend, annual, fully taxable as ordinary income to
individuals
commonly coded distribution events in the CRSP
1262 US cash dividend, year-end or final, taxable same rate as dividends
stock files. CRSP did not verify the tax status of
1272 US cash dividend, extra or special, taxable same rate as dividends
ordinary dividends in the NYSE/NYSE MKT file after
1274 US cash dividend, extra or special, tax status - return of capital, gain
April, 1987 or in the Supplemental NASDAQ file at
recognized, loss not
any time. Instead, CRSP assigned ordinary dividends
1278 US cash dividend, extra or special, fully taxable as ordinary income to
the default tax code (12*2); that is, US cash dividend, individuals
taxable in the normal way as a dividend. If a dividend 1282 US cash dividend, interim, taxable same rate as dividends
received is in the form of a security which is traded on
117
DIVIDEND LIQUIDATION
CODE DESCRIPTION CODE DESCRIPTION
1292 US cash dividend, non-recurring, or proceeds from sale of rights, 2235 Cash paid in partial liquidation, tax status - return of capital, gain,
taxable same rate as dividends loss realized
CHAPTER 5: Data Coding Schemes
1312 Cash dividend (foreign currency converted to US), unspecified 2243 Cash paid as a step in liquidation, non-taxable
frequency, tax status - unspecified or not applicable 2244 Cash paid as a step in liquidation tax status - return of capital, gain
1318 Cash dividend (foreign currency converted to US), unspecified recognized, loss not
frequency, fully taxable as ordinary income to individuals 2245 Cash paid as a step in liquidation, tax status - return of capital, gain
1332 Cash dividend (foreign currency converted to US), quarterly, taxable or loss realized
same rate as dividends 2255 Cash paid as a final liquidating payment, tax status - return of
1338 Cash dividend (foreign currency converted to US), quarterly; fully capital, gain or loss realized
taxable as ordinary income to individuals 2744 Other issue on file distributed as a step in liquidation, tax status -
1342 Cash dividend (foreign currency converted to US), semi-annual, return of capital, gain recognized, loss not
taxable same rate dividends 2817 Issue not on file distributed as a step in unspecified liquidation
1348 Cash dividend (foreign currency converted to US), semi-annual, fully process, tax status - capital gains tax credit
taxable as ordinary income to individuals 2844 Issue not on file distributed as a step in liquidation, tax status return
1352 Cash dividend (foreign currency converted to US), annual, taxable of capital, gain recognized, loss not
same rate as dividends 2999* Missing liquidation information, tax status - dividend reinvestment
1372 Cash dividend (foreign currency converted to US), extra or special, plan qualifies for the limited exclusion provided by Sec. 305(e) of the
taxable same rate as dividends Internal Revenue Code
1378 Cash dividend (foreign currency converted to US), extra or special,
Stock and Index Data Description Guide
3823 Issue not on file, received in a merger, non-taxable 5773 Initial stock distribution of other class of common, same company,
3824 Issue not on file, received in a merger, tax status - return of capital, which is on the file, non-taxable
gain recognized, loss not 5872 Initial stock distribution in different issue of common, same company,
3825 Issue not on file, received in a merger, tax status - gain or loss realized which is not on file, taxable same rate as dividends
compared with cost 5873 Initial stock distribution in different issue of common, same company,
3852 Issue not on file, received as a non-ordinary distribution in another which does not trade on the file, non-taxable
stock, taxable same rate as dividends
3853 Issue not on file, received as a non-ordinary distribution, non-taxable
OFFER/ISSUANCES
3854 Issue not on file, received as a non-ordinary distribution, tax status -
CODE DESCRIPTION
return of capital, gain recognized, loss not
6235 Common shares increased by merger with company not on file, tax
3862 Issue not on file, received in a reorganization, taxable as dividend
status - gain or loss realized compared with cost
3863 Issue not on file, received in a reorganization, non-taxable
6261 Common shares decreased through a companies own tender offer, tax
3864 Issue not on file, received in a reorganization, tax status - return of status - unknown
capital, gain recognized, loss not
6511 Common shares increased or decreased for reasons not specified
3883 Issue not on file, received in an exchange of stock, non-taxable
6521 Common shares increased by merger with company on file, tax status
3884 Issue not on file, received in an exchange of stock, return of capital - unspecified or not applicable
Stock and Index Data Description Guide
RIGHTS 6571 Common shares increased through company’s own exchange offer, tax
status - unspecified or not applicable
CODE DESCRIPTION
6581 Common shares increased through sale of stock other than rights
4523 Rights to buy more of this security, at market value, non-taxable
issue, tax status - unspecified or not applicable
4533 Rights to buy more of this security at indicated value, non-taxable
7111 Bankruptcy filing (for any reason) tax status - unspecified or not
4563 Rights to buy more of this security, non-transferable value at exdate, applicable
calculated (based on recdate if exdate unavailable), non-taxable
7121 Negative financial performance tax status - unspecified or not
4623 Rights to buy ‘units’ that include this security, non-taxable applicable
4722 Rights to buy another common issue on file, taxable same rate as 7131 External tender offer results in too few shareholders tax status -
dividends unspecified or not applicable
4822 Rights to buy other securities at market value, taxable same rate as 7141 Internal tender offer results in too few shareholders tax status -
dividends unspecified or not applicable
4823 Rights to buy other securities, nontaxable 7151 US government intervention (SEC intervention, other government
4833 Rights to buy other securities at indicated value, non-taxable intervention or request)
4999* Missing rights distribution, tax status - dividend reinvestment plan 7161 Foreign or external intervention (non-US government intervention,
qualifies for the limited exclusion provided by Sec. 305(e) of the foreign non-government intervention, “acts of god”) tax status -
Internal Revenue Code unspecified or not applicable
7171 Company request (any reason except bankruptcy) tax status -
unspecified or not applicable
STOCK
7181 Failure to meet exchange requirements tax status - unspecified or not
CODE DESCRIPTION
applicable
5523 Stock split, non-taxable
5533 Stock dividend, non-taxable *This code alerts the user to information that is not coded, and is
inconsistent with the conventional distribution-coding scheme.
5538 Stock dividend, fully taxable as ordinary income to individuals
119 III. DELISTING CODES
MERGERS
CODE DESCRIPTION
ACTIVE 290 Flags a merger with missing final distribution information. Replaces
CODE DESCRIPTION code 240. Code 240 is no longer assigned.
CHAPTER 5: Data Coding Schemes
202 Merged into or in order to form an issue trading on NYSE MKT. 332 Issue exchanged, primarily for another class of common stock. (Other
stock is not maintained on the CRSP file.)
203 Merged into or in order to form an issue trading on NASDAQ.
333 Issue exchanged, primarily for cash.
205 When merged, shareholders primarily receive shares of mutual funds.
334 Issue exchanged, primarily for preferred stock, or rights, or warrants, or
231 When merged, shareholders primarily receive common stock or ADRs.
debentures, or notes.
Replaces codes 201, 202 and 203. Codes 201-203 are no longer
Stock and Index Data Description Guide
232 When merged, shareholders primarily receive common stock or ADRs. 340* Flags an exchange with missing final distribution information.
(Merged stock is not maintained on the CRSP file.) Replaces codes 210- 341 Flags an exchange, shareholders receive common stock and cash. Issue
220. Codes 210-220 are no longer assigned. on CRSP file.
233 When merged, shareholders receive cash payments. 342 Flags an exchange, shareholders receive common stock and preferred
234 When merged, shareholders primarily receive preferred stock, bundled stock or warrants or rights or debentures or notes. Issue on CRSP file.
units, warrants, or rights, or debentures, or notes, or bundled units. 343 Flags an exchange, shareholders receive common stock and other
235 When merged, shareholders primarily receive other property. property. Issue on CRSP file.
240* Flags merger with missing final distribution information. 350* Flags an exchange attempt that was not sufficient to “kill” issue.
241 When merged, shareholders primarily receive common stock and cash, 351 Flags an exchange, shareholders receive common stock and cash. Issue
issue on CRSP file. not on CRSP file.
242 When merged, shareholders primarily receive common stock and 352 Flags an exchange, shareholders receive common stock and preferred
preferred stock or warrants or rights or debentures or notes, issue on stock, or warrants, or rights, or debentures, or notes. Issue not on CRSP
CRSP file. file.
243 When merged, shareholders primarily receive common stock, issue on 353 Flags an exchange, shareholders receive common stock and other
CRSP file and other property, issue on CRSP file. property. Issue not on CRSP file.
244 When merged, shareholders primarily receive common stock or ADR, 361 When exchanged, sharesholders primarily receive cash and preferred
and cash and preferred stock or warrants or rights or debentures or stock or warrants or rights or debentures or notes.
notes. Issue on CRSP file. 362 When exchanged, shareholders primarily receive cash and other
251 When merged, shareholders primarily receive common stock or ADRs property.
and cash. (Merged stock is not maintained on the CRSP file.) 371 When exchanged, shareholders primarily receive preferred stock or
252 When merged, shareholders primarily receive common stock or ADRs warrants or rights or debentures or notes and other property.
and preferred stock, or warrants, or rights, or debentures, or notes. 390* Flags an unsuccessful exchange attempt with missing distribution
253 When merged, shareholders primarily receive common stock or ADRs information.
and other property.
261 When merged, shareholders primarily receive cash and preferred stock,
LIQUIDATIONS
or warrants, or rights, or debentures, or notes.
CODE DESCRIPTION
262 When merged, shareholders primarily receive cash and other property.
400 Issue stopped trading as result of company liquidation.
271 When merged, shareholders primarily receive preferred stock or
warrants, or rights, or debentures, or notes and other property. 401 Issue liquidated, for issue trading on NYSE.
280 Issue delisted due to merger attempt, but merger attempt failed. 403 Issue liquidated for issue trading on NASDAQ.
120
LIQUIDATIONS DROPPED
CODE DESCRIPTION CODE DESCRIPTION
450 Issue liquidated, final distribution verified, issue closed to further 583 Delisted by current exchange - denied temporary exception
research. requirement.
CHAPTER 5: Data Coding Schemes
460 Issue liquidated, no final distribution is verified, issue closed to further 584 Delisted by current exchange - does not meet exchange’s financial
research. guidelines for continued listing.
470 Issue liquidated, no final distribution is verified, issue pending further 585 Delisted by current exchange - protection of investors and the public
research. interest.
480 Issue liquidated, no distribution information is available, issue is 586 Delisted by current exchange - composition of unit is not acceptable.
pending further research. 587 Delisted by current exchange - corporate governance violation.
490 Issue liquidated, no distributions are to be paid, issue closed to further 588 Conversion of a closed-end investment company to an open-end
research. investment company.
589 Delisted by current exchange - unlisted trading privileges
503 Issue stopped trading current exchange - to NASDAQ. 600 Expired warrant or right
Stock and Index Data Description Guide
505 Issue stopped trading current exchange - to Mutual Funds. 601 Warrants, rights, preferreds, or units called for redemption
510 Issue stopped trading current exchange - to Boston Exchange. 610 Unit split into its component parts
2 Trading with only one market maker 43 IXBKR - Nasdaq SCM Bank
1 The NASDAQ SmallCap Market before June 15, 1992 53 NTTR - NQ100 Tech Sec Tot Ret In
2 The NASDAQ National Market 54 NXTR - NQ100 Ex-Tech Sec Tot Ret
3 The NASDAQ SmallCap Market after June 15, 1992 55 BIXX - BetterInvesting 100 Index
62 IXF - Nasdaq Financial 100
NASDAQ INDEX CODE 63 NDXL - NQ 100 Double Leverage IN
20 DIVQ - NQ Div. Acheivers Index 86 QCLX - NQ OMX Glob Coal Tot Ret
22 IXIDN - Nasdaq NNM Industrial 88 QSTS - NQ OMX Glob Steel Tot Ret
24 IXFNN - Nasdaq NNM Other Finance 90 QGLX - NQ OMX Glob Gold&Prec Tot
25 IXISN - Nasdaq NNM Insurance 91 CDGR - Copeland Risk Man Div GTR
28 IXTCN - Nasdaq NNM Telecommunicat 92 ABQI - NQ OMX ABA Comm Bank Indx
29 IXCON - Nasdaq NNM Computer 93 ABQX - NQ OMX ABA Com Bank IndxR
31 IXCMN - Nasdaq NNM Composite 95 CDGX - Copeland Risk Man Div Gro
32 DTEC - NQ Dallas Regional Chambe 96 XABQ - ABA NQ COM BANK TOT RET
33 DVQT - NQ Div. Achievers Tot Ret 97 QCRX - NQ OMX CRD GL SUS TOT RET
34 NDXT - NQ-100 Tech. Sector Ind. 98 QCRD - NQ OMX CRD GLOB SUST 50
99 XCMP - NQ COMP TOT RET
122 V. MISSING RETURN CODES
information needed to place the observations in time. Data Description Object Type Code objtype
Information Array Type Code arrtype
Data Description Object Type Code objtype Array Structure Size size_of_ array_width
Information Array Type Code arrtype Object Array Object Array arr
maintained in the file. There is only one header structure, contains name and classification fields and
structure per issue for any data iteration. Note that the effective date ranges of those fields. Each security
Ticker Symbol - Header only contains tickers for active has at least one name structure.
securities.
Variable Name Variable
Variable Name Variable Secondary Identifiers CUSIP ncusip
Primary Permanent PERMCO permco North American Industry Classification naics
Identifiers PERMNO permno System (NAICS) Code
Secondary Permanent CUSIP - Header hcusip Ticker Symbol ticker
Identifiers NASDAQ Company Number compno Standard Industrial Classification siccd
NASDAQ Issue Number issuno (SIC) Code
Security Date Ranges Begin of Stock Data begdt Date Range of Name Name Effective Date namedt
History Record Last Date of Name 1
nameenddt
End of Stock Data enddt
Most Recent Header Company Name - Header hcomnam Identifying Information Company Name comnam
Identification and Convertible Code - Header hconvcd Convertible Code convcd
Summary Data
Stock and Index Data Description Guide
Interest Rate or Strike Price hrating Interest Rate or Stricke Price rating
Intermarket Trading System Indicator hits Intermarket Trading System Indicator its
- Header Issuer Code issuercd
Issue Description - Header hnamedesc Name Code namecd
Issuer Code - Header hissuercd Name Description namedesc
Name Code - Header hnamecd Name Flag nameflag
Name Description - Header hnamedesc Primary Exchange primexch
Name Flag - Header hnameflag Security Status secstat
North American Industry Classification hnaics Share Class shrcls
Code - Header Share Code shrcd
Primary Exchange - Header hprimexch Share Type shrtype
Security Status - Header hsecstat Sub-exchange subexch
Share Code - Header hshrcd Trading Denomination denom
Share Type - Header hshrtype Trading Status trdstat
Standard Industrial Classification hsiccd Trading Ticker Symbol tsymbol
(SIC) Code - Header
Sub-exchange - Header hsubexch CRSPAccess only
1
information on trading status and location for a given Liquidation payments - All partial and final
date range. liquidation payments are included. These contain
the value of each payment and relevant dates that
C. DISTRIBUTION EVENT ARRAY - DISTS are known. If the payment is in the form of stock,
The Distribution Event Array is a list of events or if a payment is known to come from the purchase
describing cash dividends, capital adjustments, and of assets by a known company, the Acquiring
other distributions made to shareholders of a security. PERMNO and Acquiring PERMCO are set to that
company or issue.
Variable Name Variable Return of capital distributions.
Distribution Information Distribution Code DISTCD Rights offerings.
Dividend Cash Amount DIVAMT Merger, acquisition, and reorganization
Factors to Adjust Prices Factor to Adjust Price FACPR distributions.
and Shares Factor to Adjust Shares Outstanding FACSHR Limited tender offers.
Dates Associated with Distribution Declaration Date DCLRDT Information on announcements related to
the Distribution Ex-Distribution Date EXDT liquidations and tender offers that resulted in
Stock and Index Data Description Guide
Periodic and special cash dividends - the cash Variable Name Variable
amount in US dollars, frequency, and related dates Shares Information Shares Outstanding SHROUT
factors to adjust price and shares, type of action, and Share Observation Date Shares Outstanding SHRSENDDT
Range Observation End Date1
related dates of all splits are provided.
Shares Outstanding SHRSDT
Spin-offs - All spin-off events are included. The cash
Observation Date
value of the spin-off is the price at the end of the ex-
distribution date of the stock received. A price factor 1
CRSPAccess data access only.
is calculated by dividing the cash amount by the
126 There are two types of Shares Outstanding Variable Name Variable
Observations: Delist Information Amount After Delisting dlamt
Delisting Code dlstcd
1. Primary shares observations contain a shares Delisting Price dlprc
CHAPTER 6: Database Structures
Exactly one shares structure is effective each date in the In current CRSP files only the most recent delisting
security’s history. One shares outstanding observation event is coded in the Delisting Event Array. If an issue
is effective until the next observation or the delisting leaves an exchange in the CRSP data files and later
date. The first shares observation is effective from the returns, the gap is marked in the Name History Array
Stock and Index Data Description Guide
Shares Observation Date backward to the beginning of with an Exchange Code of 0. During this time, event
data. data are not tracked and time series data are filled in
with missing values.
The Shares Outstanding Observations Array cannot
be used to directly find the shares outstanding each Delisting information is determined by several
calendar period. Utility functions and programs are factors: the exchange of shares at the earliest possible
available to map observations to time series used to opportunity, trade on a secondary market, payments
calculate market capitalization. from the company, or outstanding tender offer. The
information is coded as it becomes available. An issue
E. DELISTING EVENT ARRAY - DELIST is considered closed to further research if any of the
Every security on the CRSP file is assigned one following conditions apply:
delist record. The Delisting Event Array contains
information on the status of a security after it is no Research has verified that a final distribution has
longer listed on an exchange in a CRSP file. Each been paid to stockholders.
delisting history event contains a code describing A price is found on another exchange.
the reason for delisting, the value after delisting Research has verified that no distributions were ever
(when available), forward links to acquiring issue and paid to stockholders.
company traded on NYSE, NYSE MKT, NASDAQ, Some distributions have been paid to stockholders,
or Arca and delisting return. Active issues have a but no final distribution information can be found
delisting history event where Delisting Date is set to and 10 years have passed since the date of the most
the last date of available price data. The Distribution recent delisting information.
History Array includes itemized data on the payments No information concerning the delisting can be
made to shareholders after the delisting, and includes found and 10 years have passed since the delist date.
announcement information related to the delisting If none of these conditions applies to a delisted issue,
when available. the issue is pending, which means that further research
is required until one of the above conditions has been
met. If no information is found or the information
found is incomplete, no delisting return will be
calculated by CRSP.
127 Monthly: If no delisting information exists, and G. PRICE, VOLUME, AND RETURN TIME
daily data exist after the last month-end trading date, SERIES ARRAYS
CRSP generates partial-month delisting amounts and Price, Volume, and Return Time Series Arrays are
returns by using the price on the last daily trading date. a set of time series that makes up the core of CRSP
CHAPTER 6: Database Structures
Although the partial month returns are stored in the stock data. This includes three price time series, total
Delisting Return field, they are not delisting returns. returns, and trading volumes. All these time series in a
stock file use the same calendar(s).
F. NASDAQ INFORMATION ARRAY - NASDIN
The NASDAQ Information Array contains a history Variable Name Variable
of an issue’s trading status on The NASDAQ Stock Price, Volume, and Return Time Ask or High Price askhi
Market. Each set of information, or structure, contains Series Data Bid or Low Price bidlo
status and classification fields and the effective date Holding Period Total Return ret
ranges of those fields. If the NASDAQ Traits Code, Price or Bid/Ask Average prc
NASDAQ National Market Indicator, NASD Index Volume Traded vol
Code, or Market Makers Count changes, then a
new structure is added, and the date of the change H. AUXILIARY TIME SERIES DATA
is recorded in the NASDAQ Traits Date. Each issue Auxiliary Data Time Series are additional time series
traded on the NASDAQ Stock Market since November provided in CRSPAccess stock files at the same
1982 has at least one NASDAQ Information Array. frequency as the Price, Volume, and Return Time
Stock and Index Data Description Guide
Series Arrays.
Variable Name Variable
NASDAQ Information Market Maker Count mmcnt Variable Name Variable
Array Data NASD Index Code nsdinx Monthly Alternate Price Price Alternate (monthly only) altprc
NASDAQ National Market Indicator nmsind and Date, Daily Open Price Alternate Date (monthly only) altprcdt
Price, Returns Without
NASDAQ Traits Code trtscd Return Without Dividends retx
Dividends, Spread Time
NASDAQ Information NASDAQ Traits Date trtsdt Spread Between Bid and Ask spread
Series Data
Date Range NASDAQ Traits End Date trtsenddt (monthly only)
Open Price (daily only) openprc
NASDAQ information structures are available for securities trading on NASDAQ
Ask ask
beginning on April 1, 1982 for NASDAQ Traits Date and NASDAQ National Market
Indicator. All fields are available beginning November 1, 1982. 1 Bid bid
NASDAQ Number of Trades (daily numtrd
1
NASDAQ information data are missing in December, 1982 for all issues with NASD only)
company numbers less than 1025 (approximately 20 percent of the securities active
at that time), and are missing in February, 1986 for all issues. NASDAQ Traits Date, I. PORTFOLIO STATISTICS AND ASSIGNMENT
NASDAQ Traits Code, and NASDAQ National Market Indicator are complete. All other
TIME SERIES - PORT
fields are missing.
Portfolio Statistics and Assignment Time Series is
The NASDAQ National Market was initiated in April a set of portfolio time series. Each portfolio time
1982 for larger and generally more actively traded series is based on a portfolio type defined by CRSP
NASDAQ securities. The NASDAQ National Market and contains a history of statistics and portfolio
Securities must meet higher financial and non- assignments for a security. Two variables are available
financial criteria than other NASDAQ stocks, and were for each calendar period:
always subject to last-sale reporting. In June of 1992
the regular NASDAQ segment of the NASDAQ Stock Variable Name Variable
Market was renamed The NASDAQ SmallCap Market Portfolio Statistic and Portfolio Assignment port
Assignment Times Series Number
and for the first time these issues became subject to
Data Portfolio Statistic Value stat
real-time price and volume reporting.
128 Each Portfolio Statistics and Assignment Time Series Variable Name Variable
in the set is called a Portfolio Type. Portfolio Types Group Data Group Flag of Associated grpflag
are predefined groupings based on CRSP indexes. Index
The portfolio time series can be linked to CRSP index Group Secondary Flag grpsubflag
CHAPTER 6: Database Structures
returns data to calculate excess returns of a security Group Date Range Begin of Group Data grpdt
against its assigned index portfolio at any time during End of Group Data grpenddt
its history.
The only grouptype currently available is 16 - S&P
Each Portfolio Type represents a predefined index 500 Universe. Only inclusion events are added to this
group with its own methodology and rebalancing group, so Group Flag of Associated Index is always 1.
period. The portfolio time series can be linked to Begin of Group Data and End of Group Data identify
different calendars based on the rebalancing frequency the range the security was included in the S&P 500
of the index. The timing and calculation of the index. The user must subscribe to the appropriate
statistic and assignment rules are also dependent on stock and index databases to extract group data.
the index. Calendars used in portfolios are not the
same calendars used with security price and returns
data. Portfolio ranges and calendars can differ for III. INDEX DATA STRUCTURES
all portfolio types. In a portfolio time series, the
Data Subtype Code is set to the Permanent Index A. INDEX HEADER - INDHDR
Identification Number, INDNO, of an index that
Stock and Index Data Description Guide
Methodology Description Index Method Type Code methcode Share Code Screen Share Code Groupings sccode
Structure Index Primary primtype Structure for Subsets in a
Methodology Type Restriction
2. Index Exception Handling Flags – flags - Expansion 5. Portfolio Building Rules Structure - rules -
Expansion
Variable Name Variable
Exception Handling Flag Index Basic Exception flagcode Variable Name Variable
Structure Types Code Portfolio Building Rules Index Basic Rule Types rulecode
Stock and Index Data Description Guide
level historical rebalancing statistical information for Date Range First Date Included begdt
in List
rebalancing periods in an index. Each event array
Last Date Included in enddt
structure within the history contains the characteristics
List
for one portfolio for one time range in the index,
Security Characteristics Index Subcategory Code subind
including the breakpoints used to assign securities to
Weight of Issue weight
the portfolio.
Statistic Minimum in minstat Index Levels Index Total Return Index tind
Period Level