It has been stated that probability theory is the study of mathematical models of random phenomena; in other words, probability theory is concerned with the statements one can make about a random phenomenon about which one has postulated certain properties. The question immediately arises: how does one formulate postulates concerning a random phenomenon? This is done by introducing the sample description space of the random phenomenon.
The sample description space of a random phenomenon, usually denoted by the letter \(S\) , is the space of descriptions of all possible outcomes of the phenomenon.
To be more specific, suppose that one is performing an experiment or observing a phenomenon. For example, one may be tossing a coin, or two coins, or 100 coins; or one may be measuring the height of people, or both their height and weight, or their height, weight, waist size, and chest size; or one may be measuring and recording the voltage across a circuit at one point of time, or at two points of time, or for a whole interval of time (by photographing the effect of the voltage upon an oscilloscope). In all these cases one can imagine a space that consists of all possible descriptions of the outcome of the experiment or observation. We call it the sample description space , since the outcome of an experiment or observation is usually called a sample . Thus a sample is something that has been observed; a sample description is the name of something that is observable.
A remark may be in order on the use of the word “space”. The reader should not confuse the notion of space as used in this book with the use of the word space to denote certain parts of the world we live in, such as the region between planets. A notion of great importance in modern mathematics, since it is the starting point of all mathematical theories, is the notion of a set. A set is a collection of objects (either concrete objects, such as books, cities, and people, or abstract objects, such as numbers, letters, and words). A set that is in some sense complete, so that only those objects in the set are to be considered, is called a space. In developing any mathematical theory, one has first to define the class of things with which the theory will deal; such a class of things, which represents the universe of discourse, is called a space. A space has neither dimension nor volume; rather, a space is a complete collection of objects.
Techniques for the construction of the sample description space of a random phenomenon are systematically discussed in Chapter 2 . For the present, to give the reader some idea of what sample description spaces look like, we consider a few simple examples.
Suppose one is drawing a ball from an urn containing six balls, of which four are white and two are red. The possible outcomes of the draw may be denoted by \(W\) and \(R\) , and we write \(W\) or \(R\) accordingly, as the ball drawn is white or red. In symbols, we write \(S=\{W, R\}\) . On the other hand, we may regard the balls as numbered 1 to 6; then we write \(S=\{1,2,3,4,5,6\}\) to indicate that the possible outcome of a draw is a number, 1 to 6.
Next, let us suppose that one draws two balls from an urn containing six balls, numbered 1 to 6. We shall need a notation for recording the outcome of the two draws. Suppose that the first ball drawn bears number 5 and the second ball drawn bears number 3; we write that the outcome of the two draws is \((5,3)\) . The object \((5,3)\) is called a 2-tuple. We assume that the balls are drawn one at a time and that the order in which the balls are drawn matters. Then \((3,5)\) represents the outcome that first ball 3 and then ball 5 were drawn. Further, \((3,5)\) and \((5,3)\) represent different possible outcomes. In terms of this notation, the sample description space of the experiment of drawing two balls from an urn containing balls numbered 1 to 6 (assuming that the balls are drawn in order and that the ball drawn on the first draw is not returned to the urn before the second draw is made) has 30 members:
\[\begin{array}{rllll} S=\{(1,2), & (1,3), & (1,4), & (1,5), & (1,6) \tag{3.1} \\ (2,1), & (2,3), & (2,4), & (2,5), & (2,6) \\ (3,1), & (3,2), & (3,4), & (3,5), & (3,6) \\ (4,1), & (4,2), & (4,3), & (4,5), & (4,6) \\ (5,1), & (5,2), & (5,3), & (5,4), & (5,6) \\ (6,1), & (6,2), & (6,3), & (6,4), & (6,5)\} \end{array}\]
We next consider an example that involves the measurement of numerical quantities. Suppose one is observing the ages (in years) of couples who apply for marriage licenses in a certain city. We adopt the following notation to record the outcome of the observation. Suppose one has observed a man and a woman (applying for a marriage license) whose ages are 24 and 22, respectively; we record this observation by writing the 2-tuple \((24,22)\) . Similarly, \((18,80)\) represents the age of a couple in which the man’s age is 18 and the woman’s age is 80. Now let us suppose that the age (in years) at which a man or a woman may get married is any number, 1 to 200. It is clear that the number of possible outcomes of the observation of the ages of a marrying couple is too many to be conveniently listed; indeed, there are \((200)(200)=40,000\) possible outcomes! One thus sees that it is often more convenient to describe , rather than to list, the sample descriptions that constitute the sample description space \(S\) . To describe \(S\) in the example at hand, we write \begin{align} S=\{2 \text {-tuples }(x, y):\ & x \text { is any integer, } 1 \text { to } 200, \tag{3.2} \\ & y \text { is any integer, } 1 \text { to } 200\} . \end{align}
We have the following notation for forming sets. We draw two braces to indicate that a set is being defined. Next, we can define the set either by listing its members (for example, \(S=\{W, R\}\) and \(S=\{1,2,3,4,5,6\}\) ) or by describing its members, as in (3.2). When the latter method is used, a colon will always appear between the braces. On the left side of the colon, one will describe objects of some general kind; on the right side of the colon, one will specify a property that these objects must have in order to belong to the set being defined.
All of the sample description spaces so far considered have been of finite size. 1 However, there is no logical necessity for a sample description space to be finite. Indeed, there are many important problems that require sample description spaces of infinite size. We briefly mention two examples. Suppose that we are observing a Geiger counter set up to record cosmic-ray counts. The number of counts recorded may be any integer. Consequently, as the sample description space \(S\) we would adopt the set \(\{1,2,3, \ldots\}\) of all positive integers. Next, suppose we were measuring the time (in microseconds) between two neighboring peaks on an electrocardiogram or some other wiggly record; then we might take the set \(S=\) {real numbers \(x\) : \(0
It should be pointed out that the sample description space of a random phenomenon is capable of being defined in more than one way. Observers with different conceptions of what could possibly be observed will arrive at different sample description spaces. For example, suppose one is tossing a single coin. The sample description space might consist of two members, which we denote by \(H\) (for heads) and \(T\) (for tails). In symbols, \(S=\{H, T\}\) . However, the sample description space might consist of three members, if we desired to include the possibility that the coin might stand on its edge or rim. Then \(S=\{H, T, R\}\) , in which the description \(R\) represents the possibility of the coin standing on its rim. There is yet a fourth possibility; the coin might be lost by being tossed out of sight or by rolling away when it lands. The sample description space would then be \(S=\{H, T, R, L\}\) , in which the description \(L\) denotes the possibility of loss.
Insofar as probability theory is the study of mathematical models of random phenomena, it cannot give rules for the construction of sample description spaces. Rather the sample description space of a random phenomenon is one of the undefined concepts with which the mathematical theory begins. The considerations by which one chooses the correct sample description space to describe a random phenomenon are a part of the art of applying the mathematical theory of probability to the study of the real world.
- Given any set \(A\) of objects of any kind, the size of \(A\) is defined as the number of members of \(A\) . Sets are said to be of finite size if their size is one of the finite numbers \(\{1,2,3, \ldots\}\) . Examples of sets of finite size are the following: the set of all the continents in the world, which has size 7; the set of all the planets in the universe, which has size 9; the set \(\{1,2,3,5,7,11,13\}\) of all prime numbers from 1 to 15, which has size 7; the set \(\{(1,4),(2,3),(3,2),(4,1)\}\) of 2-tuples of whole numbers between 1 and 6 whose sum is 5, which has size 4.
However, there are also sets of infinite (that is, nonfinite) size. Examples are the set of all prime numbers \(\{1,2,3,5,7,11,13,17, \ldots\}\) and the set of all points on the real line between the numbers 0 and 1, called the interval between 0 and 1. If a set \(A\) has as many members as there are integers \(1,2,3,4, \ldots\) (by which is meant that a one-to-one correspondence may be set up between the members of \(A\) and the members of the set \(\{1,2,3, \ldots\}\) of all integers) then \(A\) is said to be countably infinite . The set of even integers \(\{2,4,6,8 \ldots\}\) contains a countable infinity of members, as does the set of odd integers \(\{1,3,5, \ldots\}\) and the set of primes. A set that is neither finite nor countably infinite is said to be non-countably infinite . An interval on the real line, say the interval between 0 and 1, contains a non-countable infinity of members. ↩︎