FRB Dallas Home » Research & Data »Basics » Indexing Data to a Common Starting Point
 
 

Research & Data
DataBasics

Indexing Data to a Common Starting Point

How to index any economic data series to a common starting point to facilitate the comparison of numeric data.

The Economic Problem

Indexing Is Kind of Like a Race
That a racehorse can run is relatively uninteresting. Of more intrigue to bookies and bettors is that a given racehorse can run relatively faster than another. Few would come to watch randomly placed horses gallop around a track, each starting and stopping at will and each with its own finish line. It’s the comparison of competing horses and subsequent ranking that make a race compelling.

To create a fair comparison, track officials normalize the beginning point with a start gate, release all horses at the same time and use precision measuring instruments to determine a winner. Clearly, some racehorses are faster and stronger than others. But without a common starting point, any determination of physical supremacy would be dubious.

A similar case holds true with economic data. Economists like to compare data. They do so to gain perspective and to put things in context. For instance, knowing that a state’s employment is growing over time is useful. But knowing its growth rate relative to other states is more telling. For example, a state’s rate of employment change, though positive, could be the weakest of the 50 states in a sample.

Start Data at the Same Point
A relatively simple way to make such comparisons is by indexing data to a common starting point. In effect, the variables in question must be set equal to each other and then examined over time for differences. Indexed data are handy because they allow an observer to quickly determine rates of growth by looking at a chart's vertical axis. They also allow for comparison of variables with different magnitudes.

Indexing Enables Comparison of Data of Any Magnitude
For example, suppose an analyst wants to use a graph to compare the gross domestic product (GDP) of three different countries. Drawing such a chart with absolute values would be difficult because of the size disparity between countries. One country’s GDP might register in the trillions, another in the hundreds of billions and the other in the tens of billions. All these amounts wouldn’t fit well on the chart.

As another example, Chart 1 shows how dissimilar magnitudes in quarterly employment levels in Texas and the United States make for difficult graphical interpretation.

Chart 1
Chart 1: U.S. and Texas employment levels (dissimilar magnitudes)

Indexing numerical data is useful in a variety of contexts. It shows up all the time in economic, financial and business analysis. Equity traders index stock prices and stock indices to compare performance over time. Economists index data to prominent events—say economic peaks (or troughs)—to see how the data decline (or rise) relative to each other. In all cases, it allows for quick comparison and ranking.

The Technical Solution

Indexing Mechanics
To index numerical data, values must be adjusted so they are equal to each other in a given starting time period. By convention, this value is usually 100. From there on, every value is normalized to the start value, maintaining the same percentage changes as in the nonindexed series. Subsequent values are calculated so that percent changes in the indexed series are the same as in the nonindexed.

Consider the data in Table 1. Variables X and Y represent hypothetical data series. On average variable Y is one order of magnitude larger than variable X. To index the two series, apply the following equation to the raw data:

X hat sub t equals the ratio of x sub t and x sub 0 multiplied by 100

where X sub tis the raw data value in a given time period from t = 1990, 1991â?¦2003, X sub 0 is the data value in the initial time period, 1990 and X hat sub t is the new indexed value of the variable.

Table 1        
Indexing Two Data Series
Year
X
Y
Indexed value of X
Indexed value of Y
1990
250
2000
100
100
1991
500
3000
200
150
1992
810
6000
324
300
1993
925
6500
370
325
1994
1010
6500
404
325
1995
1052
7100
421
355
1996
1030
7300
412
365
1997
1240
7600
496
380
1998
1470
7800
588
390
1999
1500
8300
600
415
2000
1525
9200
610
460
2001
1580
9900
632
495
2002
1740
10,200
696
510
2003
1890
9800
756
490

 

Between 1990 and 1991, variable X increased from 250 to 500, or 100 percent. Consequently, the indexed value of X must also increase 100 percent, from 100 to 200. Similarly, Y increased 50 percent between 1990 and 1991. Thus the indexed value of Y increased 50 percent, from 100 to 150, over the same time period.

Indexing allows you to quickly gauge percentage changes between the initial time period and any subsequent time period. For example, between 1990 and 2003, variables X and Y increased 656 and 390 percent, respectively.

Real-World Example

Applying the Technique to Texas and U.S. Employment
Indexing improves the ability to analyze changes in data over a specified time period. In the example of the U.S. and Texas employment levels, it was difficult to see how job growth in Texas compared with job growth at the national level. However, such a comparison is possible with indexed data.

The Calculations
In Table 2, each value in the U.S. column is divided by 121,744 and multiplied by 100 to arrive at an indexed value. Likewise, each value in the Texas column is divided by 8,501 and multiplied by 100.

Table 2        
Indexing Texas and U.S. Employment Data
Period
U.S.
Texas
U.S. indexed
Texas indexed
Q1_1997
121,744
8,501
100.0
100.0
Q2_1997
122,537
8,600
100.7
101.2
Q3_1997
123,358
8,694
101.3
102.3
Q4_1997
124,270
8,763
102.1
103.1
Q1_1998
124,903
8,847
102.6
104.1
Q2_1998
125,756
8,924
103.3
105.0
Q3_1998
126,492
9,010
103.9
106.0
Q4_1998
127,297
9,076
104.6
106.8
Q1_1999
128,006
9,112
105.1
107.2
Q2_1999
128,721
9,142
105.7
107.5
Q3_1999
129,448
9,208
106.3
108.3
Q4_1999
130,406
9,273
107.1
109.1
Q1_2000
131,397
9,367
107.9
110.2
Q2_2000
131,925
9,426
108.4
110.9
Q3_2000
132,023
9,494
108.4
111.7
Q4_2000
132,319
9,531
108.7
112.1
Q1_2001
132,461
9,560
108.8
112.5
Q2_2001
132,108
9,538
108.5
112.2
Q3_2001
131,819
9,484
108.3
111.6
Q4_2001
130,890
9,432
107.5
110.9
Q1_2002
130,701
9,461
107.4
111.3
Q2_2002
130,736
9,458
107.4
111.2

Texas Grew Faster than the U.S. over the Study Period
Chart 2 illustrates the effect of indexing the two data series. Between 1997 and mid-2002, employment in Texas has grown at a slightly higher rate than that of the nation. There is, however, a downside to indexing data. For example, Chart 2 does not show how many jobs there are in Texas and the nation respectively. To solve this, chart makers will often insert the last numerical value in a time series on the chart.

Chart 2
Chart 2: U. S. and Texas employment

Conclusion

The indexing methodology can be used with various types of economic data. It can be an effective means of normalizing data to a common starting point and observing how variables change over time relative to each other. It is a common method used by economists and businesspeople to enhance perspective and understanding of economic trends.

Glossary at a Glance

Indexing: Modifying two or more numeric data series so that the resulting series start at the same value and change at the same rate as the unmodified series.

 

Federal Reserve Bank of Dallas Seal
Federal Reserve Bank of Dallas

2200 N. Pearl St., Dallas, Texas 75201 | 214.922.6000 or 800.333.4460
Disclaimer / Privacy Policy