Cover: A Million Random Digits with 100,000 Normal Deviates

A Million Random Digits with 100,000 Normal Deviates

Download

Download eBook for Free

Foreword, Acknowledgement & Introduction

FormatFile SizeNotes
PDF file 1.6 MB

Use Adobe Acrobat Reader version 10 or higher for the best experience.

Table of Random Digits

FormatFile SizeNotes
PDF file 34.8 MB

Use Adobe Acrobat Reader version 10 or higher for the best experience.

Table of Random Normal Deviates

FormatFile SizeNotes
PDF file 12.6 MB

Use Adobe Acrobat Reader version 10 or higher for the best experience.

Errata

FormatFile SizeNotes
PDF file 0.1 MB

Use Adobe Acrobat Reader version 10 or higher for the best experience.

Download Support Files

Datafile: A Million Random Digits

This file contains the Table of Random Digits in plain text form.

FormatFile SizeNotes
zip file 0.7 MB

The file(s) provided above are ZIP-formatted archives, which most modern systems can natively unpack. If your computer does not unpack the archive when you double-click it, you may need to use a separate decompression program such as UnZip.

Datafile: 100,000 Normal Deviates

This file contains the Table of Random Normal Deviates in plain text form.

FormatFile SizeNotes
zip file 0.3 MB

The file(s) provided above are ZIP-formatted archives, which most modern systems can natively unpack. If your computer does not unpack the archive when you double-click it, you may need to use a separate decompression program such as UnZip.

Purchase

Purchase Print Copy

 FormatList Price Price
Add to Cart Paperback628 pages $68.00 $54.40 20% Web Discount

Foreword to the Online Edition

This book was a product of RAND's computing power (and patience). The tables of random numbers in the book have become a standard reference in engineering and econometrics textbooks and have been widely used in gaming and simulations that employ Monte Carlo trials. Still the largest known source of random digits and normal deviates, the work is routinely used by statisticians, physicists, polltakers, market analysts, lottery administrators, and quality control engineers.

A humorous sidelight: The New York Public Library originally indexed this book under the heading "Psychology."

Acknowledgments

The following persons participated in the production, testing, and preparation for publication of the tables of random digits and random normal deviates: Paul Armer, E. C. Bower, Mrs. Bernice Brown, G. W. Brown, Walter Frantz, J. J. Goodpasture, W. F. Gunning, Cecil Hastings, Olaf Helmer, M. L. Juncosa, J. D. Madden, A. M. Mood, R. T. Nash, J. D. Williams. These tables were prepared in connection with analyses done for the United States Air Force.

Introduction

Early in the course of research at The RAND Corporation a demand arose for random numbers; these were needed to solve problems of various kinds by experimental probability procedures, which have come to be called Monte Carlo methods. Many of the applications required a large supply of random digits or normal deviates of high quality, and the tables presented here were produced to meet those requirements. The numbers have been used extensively by research workers at RAND, and by many others, in the solution of a wide range of problems during the past seven years.

One distinguishing feature of the digit table is its size. On numerous RAND problems the largest existing table of Kendall and Smith, 1939, would have had to be used many times over, with the consequent dangers of introducing unwanted correlations. The feasibility of working with as large a table as the present one resulted from developments in computing machinery which made possible the solving of very complicated distribution problems in a reasonable time by Monte Carlo methods. The tables were constructed primarily for use with punched card machines. With the high-speed electronic computers recently developed, the storage of such tables is usually not practical and, in fact, much larger tables than the present one are often required; these machines have caused research workers to turn to pseudo-random numbers which are computed by simple arithmetic processes directly by the machine as needed. These developments are summarized in Juncosa, 1953; Meyer, Gephart, and Rasmussen, 1954; and Moshman, 1954, where other references may be found. The Monte Carlo Method, 1951; Curtiss, 1949; Kahn and Marshall, 1953; and Kahn, 1956, discuss the uses and applications of the Monte Carlo methods and give references to other applications.

Production of the Random Digits

The random digits in this book were produced by rerandomization of a basic table generated by an electronic roulette wheel. Briefly, a random frequency pulse source, providing on the average about 100,000 pulses per second, was gated about once per second by a constant frequency pulse. Pulse standardization circuits passed the pulses through a 5-place binary counter. In principle the machine was a 32-place roulette wheel which made, on the average, about 3000 revolutions per trial and produced one number per second. A binary-to-decimal converter was used which converted 20 of the 32 numbers (the other twelve were discarded) and retained only the final digit of two-digit numbers; this final digit was fed into an IBM punch to produce finally a punched card table of random digits.

Production from the original machine showed statistically significant biases, and the engineers had to make several modifications and refinements of the circuits before production of apparently satisfactory numbers was achieved. The basic table of a million digits was then produced during May and June of 1947. This table was subjected to fairly exhaustive tests and it was found that it still contained small but statistically significant biases. For example, the following table[1] shows the results of three tests (described later) on two blocks of 125,000 digits:

  Block 1 Block 2
  χ2 Probability χ2 Probability
Frequency (9 d.f.a) 6.0 .74 21.0 .02
Odd-even (1 d.f) 3.0 .09 7.0 <.0l
Serial (81 d.f.) 78.7 .55 105.6 .03

aThe letters "d.f." (degrees of freedom) identify a parameter associated with the test. A discussion of the test may be found in any textbook on statistics.

Block 1 was produced immediately after a careful tune-up of the machine; Block 2 was produced after one month of continuous operation without adjustment. Apparently the machine had been running down despite the fact that periodic electronic checks indicated it had remained in good order.

The table was regarded as reasonably satisfactory because the deviations from expectations in the various tests were all very small—the largest being less than 2 per cent—and no further effort was made to generate better numbers with the machine. However, the table was transformed by adding pairs of digits modulo 10 in order to improve the distribution of the digits. There were 20,000 punched cards with 50 digits per card; each digit on a given card was added modulo 10 to the corresponding digit of the preceding card to yield a rerandomized digit. It is this transformed table which is published here and which is the subject of the tests described below.

The transformation was expected to, and did, improve the distribution in view of a limit theorem to the effect that sums of random variables modulo 1 have the uniform distribution over the unit interval as their limiting distribution. (See Horton and Smith, 1949, for a version of this theorem for discrete variates.)

These tables were reproduced by photo-offset from pages printed by the IBM model 856 Cardatype. Because of the very nature of the tables, it did not seem necessary to proofread every page of the final manuscript in order to catch random errors of the Cardatype. All pages were scanned for systematic errors, every twentieth page was proofread (starting with page 10 for both the digits and deviates), and every fortieth page (starting with page 5 for both the digits and deviates) was summed and the totals checked against sums obtained from the cards.[2]

Tests on the Random Digits

Frequency Tests. The table was divided into 1000 blocks of 1000 digits each and the frequency of each digit was recorded for each block. Then for each block a goodness-of-fit χ2 was computed with 9 d.f. These 1000 values of χ2 provided an empirical fit to the χ2 distribution (with 9 d.f.); to test the fit, a goodness-of-fit χ2 was computer using 50 class intervals, each of which was expected to contain 2 per cent of the values. (The number of intervals was chosen in accordance with the result of Wald and Mann, 1942.) The value of the test χ2 was 54.6 which, for 49 d.f., corresponded to about the 0.45 probability level.

To examine further the frequencies, the digits were tallied in 20 blocks of 50,000 digits each. The results are shown in Table 1 together with the goodness-of-fit χ2 for each block. On the total frequencies the χ2 (13.316) for 9 d.f. has been partitioned into three components as follows:

χ2 d.f. Probability
Odd versus even digits 1.37 1 0.25
Within groups of odd digits 7.90 4 0.10
Within groups of even digits 4.04 4 0.40
Block No. 0 1 2 3 4 5 6 7 8 9 χ2
149235013491649515109499350555080498649747.556
2487049565080509750665034490249745012500910.132
350655014503450574902506149424946496050196.078
4500950534966489150314895503750625170488615.004
5503349825180507448924992501150054959487213.846
649764993493250394965503449434932511650707.076
7501151524990504749745107486949255023490214.116
8500350925163493650205069491449434914494613.051
9486048995138495950895047503050395002493713.410
1049984957496451244909499550534946499550597.212
1149485048504150775051500450244886491750047.142
1249584993506449875041498449914987511348826.992
1349684961502950385022502350104988493650252.162
14511049235025497550955051503549624942488210.172
15509449624945489150145002503850235179485216.261
1649575035505150215036492750224988491050534.856
1750884989504249484999502850374893500449725.347
1849705034499650085049501649544989497050141.625
1949984981498451074874498050575020497850216.584
2049635013510150844956497250184971502149016.584
Total9980210005010064110031110009410021499942995591001079928013.316

Of the 200 frequencies recorded in Table 1, 59 (29.5 per cent) deviate from 5000 by more than σ (= 30√5 = 67.08), and 8 (4 per cent) deviate from 5000 by more than 2σ. Of the twenty χ2 values in Table 1, eight exceed the 50 per cent value (8.34), two fall below the 10 per cent value (4.17), and two exceed the 90 per cent value (14.7).

Poker Tests.Sets of 5 digits in blocks of 5000 digits were taken to be poker hands and were classified as:

Class Symbol Expected Frequency Per Block
Bustsabcde302.4
Pairsaabcd504
Two pairsaabbc108
Threesaaabc72
Full houseaaabb9
Foursaaaab{4.5}
Fiveaaaaa{0.1}

There were 200 sets of 1000 poker hands in the table, and for each set a goodness-of-fit χ2 was computed with 5 d.f. (the fours and fives were combined). The manner in which these 200 values fit the χ2 distribution is shown in Table 2.

Table 2. Distribution of Chi-square Values

Probability Values of χ2 Expected Frequency Observed Frequency
P > .90 0 – 1.60 20 22
.90 >P > .80 1.61 – 2.35 20 19
.80 > P > .70 2.36 – 3.00 20 22
.70 > P > .60 3.01 – 3.70 20 19
.60 > P > .50 3.71 – 4.35 20 20
.50 > P > .40 4.36 – 5.20 20 29
.40 > P > .30 5.21 – 6.10 20 22
.30 > P > .20 6.11 – 7.30 20 15
.20 > P > .10 7.31 – 9.20 20 15
P < .10 9.21 or more 20 17
Total 200 200

The goodness-of-fit test gives:

χ2 = 7.7 for 9 d.f., P = 0.55.

The combined frequencies of poker hands in the whole table are shown in Table 3. The largest difference between expected and observed frequencies (for threes) is about 2.25 times its standard deviation, which is roughly at about the 9 or 10 per cent probability level (looking merely at the largest of five independent normal observations).

Table 3. Poker Test on The Million Digits (200,000 Poker Hands)

Classes Expected Frequency Observed Frequency
Busts (abcde) 60,480 60,479
Pairs (aabcd) 100,800 100,570
Two pairs (aabbc) 21,600 21,572
Threes (aaabc) 14,400 14,659
Full house (aaabb) 1,800 1,788
Fours (aaaab) {900} {914}
Fives (aaaaa) {20} {18}
Total 200,000 200,000

The goodness-of-fit test gives:

χ2 = 5.5 for 5 d.f., P = 0.35.

Also, the frequencies of poker hands were computed for each of ten blocks of 100,000 digits and the mean and standard deviation was computed from the ten values for each kind of hand. The results are shown in Table 4.

Table 4. Mean and Standard Deviation of Frequencies in Seven Classes of Poker Hands

Classes Theoretical Mean Actual Mean Theoretical Std. Dev. Actual Std. Dev.
Busts 6048 6047.9 64.9 60.3
Pairs 10080 10057.0 70.7 78.4
Two pairs 2160 2157.2 43.9 45.8
Threes 1440 1465.9 36.9 26.6
Full house 180 178.8 13.4 8.9
Fours 90 91.4 9.5 11.5
Fives 2 1.8 1.4 1.9

Serial and Run Tests. Some further tests were made on the first block of 50,000 digits to look particularly for any evidence of serial association among the digits. The serial test classified every successive pair of digits by each digit of the pair in a ten-by-ten table. The frequencies of the different pairs are given in Table 5, where the first digit of the pair is shown in the left column of the table and the second digit is shown at the top. Thus there were 510 cases in which a zero followed a one. The frequency χ2 for the row (or column) totals is 7.56, which is about the 0.60 probability level for 9 d.f.

Table 5. Frequencies of Ordered Pairs of Digits

  Second Digit  
First Digit 0 1 2 3 4 5 6 7 8 9 Total
05084565095075024894715044884894923
15105144745145044814964865075275013
24515234934845024665145064934844916
35004724764665134785405135304634951
45135614814855265134855105245115109
54754905275074934814895124655544993
64944864914835255045305395134905055
75085124544985505335165044855205080
84635034755145205445144915204424986
95014965364934745045005154614944974
Total492350134916495151094993505550804986497450000

Table 5 can be tested by a criterion originally due to Kendall and Smith, 1939, and revised by Good, 1953. Assuming all pairs equally likely we get a normalized sum of squared deviations of 107.8. However, this statistic does not have a χ2-distribution. On the other hand, it is the sum of the error variation and twice the row (or column) variation, where under the assumption of perfect randomness, the error variation is asymptotically distributed like χ2 with 81 degrees of freedom. We take the error variation as our test criterion. This gives a χ2 of 107.8 – 2(7.56) ~ 92.7, which is about the 0.18 level for 81 degrees of freedom.

Finally, in the same block of 50,000 digits all runs were counted with the obviously satisfactory results shown in Table 6.

Table 6. Run Test

Length of Run Expected Frequency Observed Frequency
r = 1 40500 40410
r = 2 4050 4055
r = 3 405 421
r = 4 40.5 48
r ≥ 5 4.5 5

Normal Deviates

Half of the random digit table was used to produce 100,000 standard normal deviates by solving for x in Equation (1),

where D is a five-digit number from the table and

is the cumulative standard normal distribution. The Bureau of Standards tables of F(x) were used (1948).

The deviates were determined by the five-digit numbers on the left-hand half of every page of the digit table. The deviates in the first column correspond page by page with the five-figure digits in the first column of the first 200 pages of the digit table; the deviates in the second column correspond page by page with the first column of the second 200 pages of the digit table. Similarly, the third and fourth columns of deviates were derived from the second column of five-figure digits, etc.

A χ2 test of the fit of the entire table of deviates to the normal distribution was performed using 400 class intervals (Mann and Wald, 1942) with roughly 250 expected in each. The χ2 value was found to be 346.4, which for 399 d.f. indicates a very close fit; the probability of a larger value of χ2 is about 0.97. The detailed data for this test are given in Table 7.

Table 7: Goodness-of-fit Test for Normal Deviates

Class Limits Expected Number Observed Number (Obs.- Exp.)2/Exp.
    Left Tail Right Tail Left Tail Right Tail
.000 239.364 229 267 .449 3.191
.006 239.355 253 240 .778 .002
.012 239.338 225 232 .859 .225
.018 279.195 275 286 .063 .166
.025 239.271 243 245 .058 .137
.031 239.227 232 255 .218 1.040
.037 239.173 285 221 8.781 1.381
.043 278.958 284 277 .091 .014
.050 239.029 225 244 .823 .103
.056 238.948 225 218 .814 1.836
.062 238.860 260 231 1.871 .259
.068 278.546 225 287 10.293 .257
.075 238.638 253 211 .864 3.201
.081 238.522 224 224 .884 .884
.087 278.118 254 290 2.091 .508
.094 238.242 212 222 2.891 1.107
.100 238.098 225 250 .721 .595
.106 277.590 297 272 1.357 .113
.113 237.760 217 235 1.813 .032
.119 237.590 232 224 .132 .777
.125 237.413 230 266 .231 3.442
.131 276.744 262 272 .786 .081
.138 236.998 244 226 .207 .510
.144 236.792 235 250 .014 .737
.150 275.989 271 275 .090 .004
.157 236.320 246 273 .397 5.693
.163 275.415 267 279 .257 .047
.170 235.810 250 221 .854 .930
.176 235.561 233 233 .028 .028
.182 274.495 265 291 .328 .992
.189 234.994 230 236 .106 .004
.195 234.718 250 233 .995 .013
.201 273.481 263 283 .402 .331
.208 234.095 232 228 .019 .159
.214 272.731 302 292 3.141 1.361
.221 233.435 243 224 .392 .381
.227 233.117 223 241 .439 .267
.233 271.557 276 286 .073 .768
.240 232.401 234 248 .011 1.047
.246 270.701 292 302 1.676 3.619
.253 231.649 229 225 .030 .191
.259 269.802 274 253 .065 1.046
.266 230.859 251 227 1.757 .065
.272 268.860 269 271 .000 .017
.279 230.034 226 233 .071 .038
.285 267.876 266 274 .013 .140
.292 229.173 235 221 .148 .291
.298 266.850 248 238 1.332 3.119
.305 266.282 249 283 1.122 1.050
.312 227.779 240 216 .656 .609
.318 265.193 244 274 1.694 .292
.325 226.830 213 234 .843 .227
.331 264.065 284 284 1.505 1.505
.338 263.440 282 269 1.308 .117
.345 225.302 230 214 .098 .567
.351 262.251 299 265 5.150 .029
.358 261.596 267 279 .112 1.158
.365 223.693 229 217 .126 .200
.371 260.347 275 279 .825 1.336
.378 259.659 266 253 .155 .171
.385 222.009 224 225 .018 .040
.391 258.353 266 247 .226 .499
.398 257.633 250 244 .226 .721
.405 256.905 268 269 .479 .569
.412 256.164 260 283 .057 2.811
.419 255.415 243 250 .603 .115
.426 254.654 284 269 3.382 .808
.433 217.661 230 208 .699 .429
.439 253.215 265 259 .548 .132
.446 252.425 251 250 .008 .023
.453 251.626 257 235 .115 1.099
.460 250.817 264 249 .693 .013
.467 249.999 256 231 .144 1.444
.474 249.170 249 271 .000 1.913
.481 248.333 244 268 .076 1.558
.488 247.486 233 274 .848 2.841
.495 246.630 239 246 .236 .002
.502 280.803 276 264 .082 1.005
.510 244.765 275 262 3.735 1.214
.517 243.881 249 254 .107 .420
.524 242.988 231 250 .591 .202
.531 242.087 223 242 1.505 .000
.538 275.555 288 256 .562 1.388
.546 240.126 229 228 .516 .612
.553 239.199 246 244 .193 .096
.560 272.224 282 281 .351 .283
.568 237.183 235 252 .020 .926
.575 236.231 255 229 1.491 .221
.582 268.802 255 273 .709 .066
.590 234.163 239 211 .100 2.291
.597 266.419 281 276 .798 .345
.605 232.062 218 220 .852 .627
.612 263.999 270 249 .136 .852
.620 262.692 261 285 .011 1.894
.628 228.776 253 207 2.565 2.073
.635 260.216 272 260 .534 .000
.643 258.881 256 265 .032 .145
.651 225.417 233 230 .255 .093
.658 256.352 263 270 .172 .727
.666 254.989 272 248 1.135 .192
.674 253.618 266 263 .605 .347
.682 252.239 245 235 .208 1.178
.690 250.849 255 274 .069 2.137
.698 249.453 239 240 .438 .358
.706 248.048 228 252 1.620 .063
.714 246.635 248 243 .008 .054
.722 245.215 230 254 .944 .315
.730 243.787 235 246 .317 .020
.738 272.544 296 240 2.019 3.886
.747 240.729 231 230 .393 .478
.755 239.279 240 250 .002 .480
.763 267.449 269 266 .009 .008
.772 236.177 229 256 .218 1.664
.780 263.944 292 283 2.982 1.376
.789 233.048 241 215 .271 1.398
.797 260.410 246 276 .797 .933
.806 258.526 239 227 1.475 3.844
.815 228.216 228 203 .000 2.786
.823 254.953 235 236 1.562 1.409
.832 253.050 262 250 .317 .037
.841 251.143 262 258 .469 .187
.850 249.228 241 261 .272 .556
.859 247.309 245 225 .022 2.012
.868 245.385 237 247 .287 .011
.877 270.387 253 270 1.118 .001
.887 241.307 220 245 1.881 .057
.896 239.369 237 231 .023 .293
.905 263.687 267 281 .042 1.137
.915 235.266 241 211 .140 2.503
.924 259.121 226 236 4.234 2.063
.934 256.713 266 252 .336 .087
.944 254.300 235 250 1.465 .073
.954 251.886 268 235 1.031 1.132
.964 249.469 272 244 2.035 .120
.974 247.051 238 249 .332 .015
.984 244.633 243 239 .011 .130
.994 242.212 230 231 .616 .519
1.004 263.640 274 260 .407 .050
1.015 237.132 250 250 .698 .698
1.025 258.052 267 270 .310 .553
1.036 255.128 256 241 .003 .782
1.047 252.207 219 254 4.372 .013
1.058 249.289 238 245 .511 .074
1.069 246.374 258 244 .549 .023
1.080 243.465 253 235 .373 .294
1.091 262.286 271 256 .290 .151
1.103 237.399 229 215 .297 2.113
1.114 255.683 267 265 .501 .340
1.126 252.252 280 275 3.052 2.051
1.138 248.830 243 227 .137 1.915
1.150 245.421 231 271 .847 2.666
1.162 242.022 256 234 .807 .266
1.174 258.370 270 263 .524 .083
1.187 254.414 235 256 1.481 .010
1.200 250.476 260 247 .362 .048
1.213 246.557 271 260 2.423 .733
1.226 242.659 255 227 .628 1.010
1.239 256.990 256 242 .004 .874
1.253 252.521 251 262 .009 .356
1.267 248.081 257 225 .321 2.147
1.281 243.672 244 220 .000 2.300
1.295 256.219 257 232 .002 2.289
1.310 251.234 273 215 1.886 5.226
1.325 246.291 244 241 .021 .114
1.340 257.309 249 285 .268 2.980
1.356 251.786 268 266 1.044 .802
1.372 246.320 240 229 .162 1.218
1.388 255.788 265 229 .332 2.805
1.405 249.751 252 278 .020 3.195
1.422 243.787 280 237 5.379 .189
1.439 251.707 268 267 1.055 .929
1.457 245.192 240 237 .110 .274
1.475 251.846 247 257 .093 .105
1.494 257.489 259 268 .009 .429
1.514 249.809 238 256 .558 .153
1.534 242.262 233 238 .354 .075
1.554 257.931 247 237 .463 1.699
1.576 249.141 242 263 .205 .771
1.598 251.267 239 265 .599 .751
1.621 242.072 236 249 .152 .198
1.644 252.941 252 222 .004 3.785
1.669 252.098 226 281 2.702 3.313
1.695 250.295 254 224 .055 2.762
1.722 247.560 263 276 .963 3.267
1.750 252.118 247 217 .104 4.892
1.780 246.755 265 265 1.349 1.349
1.811 255.166 255 257 .000 .013
1.845 246.473 246 226 .001 1.701
1.880 249.853 249 259 .003 .335
1.918 249.912 266 237 1.036 .667
1.959 252.136 248 238 .068 .793
2.004 249.874 249 251 .003 .005
2.053 252.080 239 273 .679 1.736
2.108 251.207 248 217 .041 4.658
2.170 249.038 238 251 .489 .015
2.241 250.376 242 251 .280 .002
2.326 250.143 226 240 2.330 .411
2.432 249.585 248 241 .010 .295
2.575 251.174 262 256 .467 .093
2.807 249.526 254 264 .080 .840
Total 50124 49876 158.967 187.450

A more refined test of the fit in the tails was made on the deviates exceeding 2.326 in absolute value. Eighty intervals (Mann and Wald, 1942) were used, each with an expectation of approximately 25. The χ2 value was 76.26, with 80 d.f.; the probability of a larger value is about 0.61. The details of this test are given in Table 8.

Table 8: Goodness-of-fit in 1 Per Cent Tails

Class Limits Expected Number Observed Number (Obs.- Exp.)2/Exp.
    Left Tail Right Tail Left + Right Tails
2.326 26.366 29 22 .986
2.336 25.757 21 20 2.165
2.346 25.159 26 30 .960
2.356 24.574 17 24 2.348
2.366 23.999 19 21 1.416
2.376 25.748 24 22 .664
2.387 25.082 20 21 1.694
2.398 24.428 20 21 1.284
2.409 23.790 20 24 .606
2.420 25.240 30 35 4.672
2.432 26.524 32 36 4.516
2.445 23.748 23 26 .237
2.457 24.948 30 23 1.175
2.470 25.986 25 30 .657
2.484 25.098 23 14 5.083
2.498 24.236 20 23 .803
2.512 25.037 29 24 .670
2.527 25.682 28 25 .227
2.543 24.657 22 24 .304
2.559 23.669 16 16 4.970
2.575 26.868 28 25 .178
2.594 24.261 21 26 .563
2.612 25.652 24 18 2.389
2.632 24.336 24 31 1.829
2.652 25.321 21 23 .950
2.674 24.926 26 26 .093
2.697 24.412 29 32 3.221
2.721 25.624 31 26 1.133
2.748 24.639 31 24 1.659
2.776 25.135 27 25 .139
2.807 25.164 26 39 7.635
2.841 24.759 24 21 .594
2.878 25.087 25 28 .339
2.920 25.144 26 35 3.893
2.968 24.731 19 24 1.350
3.023 25.063 29 18 2.609
3.090 25.160 35 22 4.245
3.175 25.002 20 32 2.959
3.291 24.939 24 15 3.996
3.481 24.977 26 30 1.052
Total 1000.928 990 1001 76.263

The only tests made on the squares of the deviates consisted in computing sums of k squares and comparing the distribution of the sums with the χ2 distribution with k d.f., employing again the standard goodness-of-fit test. This was done for k = 25, 50, 100, 300, with the following results:

k Number of Sums Number of Intervals (i) χ2 with i – 1 Degrees of Freedom Probability of a Larger χ2
25 4000 100 92.92 0.66
50 2000 100 92.45 0.67
100 1000 50 57.75 0.19
300 333 34 38.70 0.23

The fourth column gives the goodness-of-fit χ2 value for the fit to the χ2 distribution with k degrees of freedom. Intervals of approximately equal probability were used in all cases.

Use of the Tables

The lines of the digit table are numbered from 00000 to 19999. In any use of the table, one should first find a random starting position. A common procedure for doing this is to open the book to an unselected page of the digit table and blindly choose a five-digit number; this number with the first digit reduced modulo 2 determines the starting line; the two digits to the right of the initially selected five- digit number are reduced modulo 50 to determine the starting column in the starting line. To guard against the tendency of books to open repeatedly at the same page and the natural tendency of a person to choose a number toward the center of the page: every five-digit number used to determine a starting position should be marked and not used a second time for this purpose.

The digit table is also used to find a random starting position in the deviate table: Select a five-digit number as before; the first four digits give the starting line (the lines being numbered from 0000 to 9999) and the fifth digit gives the starting position in the line.

Ordinarily, the table is read in the same direction as a book is read; however, the size of the table may be effectively increased by varying the direction in which it is read. Thus, one may read columns instead of lines, may read the table backward, may read lines forward but pages from bottom to top, etc. Of course, care must be taken in using these devices to avoid introducing correlations when the table is used more than once on the same problem.

To obtain a random permutation of the integers 1, 2, . . . , n, select a random starting position; use the five-digit number containing the starting position and the following n – 1 five-digit numbers; put the integers in the same order as these n five-digit numbers. In case of ties among the five-digit numbers, use additional columns to the right to make six or more digit numbers. The same procedure is used to obtain a random permutation of n objects, some of which are indistinguishable, by merely numbering the objects arbitrarily from 1 to n.

To obtain random observations from any distribution G(x), use Eq. (1), substitute G(x) for F(x), and employ as many digits in D as required for the desired accuracy of the observations. Of course the negative exponent of 10 in Eq. (1) must be equal to the number of digits in D. If G(x) has a discontinuity at x0, define it to be continuous on the right and take the solution of Eq. (1) to be x0 when the left side of Eq. (1) falls between G(x0-) and G(x0). For example, if

G(x) = 1 – e–x

and one is content with three-figure accuracy, then the three-digit number 082 determines an observation from a population distributed by G(x) as follows:

.0825 = 1 – e–x,

x = .086.

A technique suggested by von Neumann, called the "rejection method," enables one to substitute for the solution of Eq. (1) a stochastic process involving a much simpler computation; this technique is discussed in Kahn, 1956.

To obtain pairs of normal deviates with given correlation ρ, use pairs (x, y) of independent deviates from the table and transform them to

Thus for ρ = –.6, for example, if (.732, –1.205) are two deviates from the table, then

(.732, –1.403)

is a pair of deviates from a normal population with the desired correlation.[3]

In general, to obtain a random observation from a bivariate population with distribution G(x,y), one uses a marginal distribution on one variate, say, G1(x), and the conditional distribution, say, G2(y/x), on the other. Two random numbers determine the observation: one determines x by employing G1(x) in Eq. (1), and the other determines y by employing G2(y/x) in Eq. (1). Thus, if a probability density is uniform (and equal to two) over the triangle bounded by x = 0, y = 0, x + y = 1 and is zero elsewhere, then

and two four-digit random numbers, 5402 and 1770, determine the observation (.3220, .1200). The direct generalization of this procedure will determine observations from multivariate populations.

Complete tables available for download at the top of the page ⤴

View addendum on data quality issues

References

1. Kendall, M. G., and B. B. Smith, Random Sampling Numbers, Cambridge University Press, 1939.

2. Juncosa, M. L., Random Number Generation on the BRL High-Speed Computing Machines, Ballistic Research Laboratories Report No. 855, Aberdeen Proving Ground, Maryland, 1953.

3. Meyer, H. A., L. S. Gephart, and N. L. Rasmussen, On the Generation and Testing of Random Digits, WADC Technical Report 54-55, Wright-Patterson Air Force Base, Ohio, 1954.

4. Moshman, Jack, "Generation of Pseudo-random Numbers on a Decimal Calculator," J. Assoc. Computing Machinery, Vol. 1, 1954, p. 88.

5. The Monte Carlo Method (Proceedings of a Symposium held in 1949), National Bureau of Standards Report AMS 12, Government Printing Office, Washington 25, D.C., 1951.

6. Curtiss, J. H., "Sampling Methods Applied to Differential and Difference Equations," Seminar on Scientific Computation, International Business Machines Corp., New York, 1949.

7. Kahn, H., and A. W. Marshall, "Methods of Reducing Sample Size in Monte Carlo Computations," J. Operations Res. Soc. of Amer., Vol. 1, 1953, pp. 263–278.

8. Kahn, H., Applications of Monte Carlo, The RAND Corporation, RM-1237, 1956.

9. Horton, H. B., and R. T. Smith, "A Direct Method for Producing Random Digits in Any Number System," Ann. Math. Statistics, Vol. 20, 1949, pp. 82–90.

10. Mann, H. B., and A. Wald, "On the Choice of the Number of Class Intervals in the Application of the Chi-Square Test," Ann. Math. Statistics, Vol. 13, 1942, pp. 306–317.

11. Tables of Probability Functions, 2d ed., U.S. Government Printing Office, Washington, D.C., 1948.

12. Good, I.J., "The Serial Test for Sampling Numbers and Other Tests for Randomness," Proc. Camb. Phil. Soc., Vol. 49, 1953, pp. 276–284.

Notes

  • [1] For readers who are not statisticians: The χ2 (chi-square) test is a standard statistical criterion used to measure discrepancy from expectation. The probability value associated with the criterion ranges between zero and one. A small probability value, e.g., less than .05, indicates the possibility of a discrepancy or bias. For very large samples, such as is the case here, with 125,000 digits, the test is extremely sensitive and will result in a small probability value even though the bias may be quite trivial from a practical standpoint.
  • [2] Some issues concerning discrepancies in the numbers presented here are discussed in an addendum prepared in April 1997.
  • [3] RAND has a table of deviations for Gaussian-Markov chains with a discrete time parameter. There is one chain of 10,000 observations for each of the correlations: .600, .800, .900, .950, .970, .980, .990, .995.

This report is part of the RAND Corporation Monograph report series. The monograph/report was a product of the RAND Corporation from 1993 to 2003. RAND monograph/reports presented major research findings that addressed the challenges facing the public and private sectors. They included executive summaries, technical documentation, and synthesis pieces.

This research in the public interest was supported by RAND, using discretionary funds made possible by the generosity of RAND's donors, the fees earned on client-funded research, and independent research and development (IR&D) funds provided by the Department of Defense.

This document and trademark(s) contained herein are protected by law. This representation of RAND intellectual property is provided for noncommercial use only. Unauthorized posting of this publication online is prohibited; linking directly to this product page is encouraged. Permission is required from RAND to reproduce, or reuse in another form, any of its research documents for commercial purposes. For information on reprint and reuse permissions, please visit www.rand.org/pubs/permissions.

The RAND Corporation is a nonprofit institution that helps improve policy and decisionmaking through research and analysis. RAND's publications do not necessarily reflect the opinions of its research clients and sponsors.