Google
 

Trailing-Edge - PDP-10 Archives - decuslib20-01 - decus/20-0020/statea.dem
There are 2 other files named statea.dem in the archive. Click here to see a list.
100'  NAME--STATEACH
110'
120'  DESCRIPTION--TEACH PROGRAM ON COMPUTATION OF BASIC STATISTICS
130'
140'  SOURCE--UNKNOWN
150'
160'  INSTRUCTIONS--TYPE "LIST" AND FOLLOW INSTRUCTIONS.
170'
180'
190'  *  *  *  *  *  *  MAIN PROGRAM  *  *  *  *  *  *  *  *  *
200'
210     NOW THAT YOU UNDERSTAND HOW TO WORK WITH DATA ARRAYED IN
220 MATRICES, CONSIDER SOME OF THE INFORMATION HOU MIGHT LIKE TO KNOW
230 ABOUT THE COLLECTION OF NUMBERS IN A GIVEN ROW OR COLUMN.  YOU HAVE 
240 ALREADY FOUND THE BIGGEST ELEMENT IN A ROW AND THE AVERAGE OF
250 THE ROW.  WHAT ELSE WOULD BE INTERESTING TO KNOW?
260   
270  
280   ***MEAN, VARIANCE, AND STANDARD DEVIATION***
290 
300     ONE OF THE MOST INTERESTING PIECES OF INFORMATION TO KNOW ABOUT
310 A SET OF NUMBERS IS THE STANDARD DEVIATION--A MEASURE  OF HOW THE 
320 NUMBERS DEVIATE OR SPREAD OUT AROUND THE AVRAGE NUMBER(THE MEAN)
330  
340     FOR EXAMPLE, IF THE SET OF DATA IS (3,3,3,3,3) THEN THE MEAN
350 IS  3 AND THE STANDARD DEVIATION IS ZERO BECAUSE THE NUMBERS DON'T
360 DEVIATE AT ALL FROM THE MEAN.  THE NUMBERS HAVE NO SPREAD ABOUT THE
370 MEAN.  HOWEVER, IF YOUR SET OF DATA IS (1,2,3,4,5) THE STANDARD
380 DEVIATION IS  1.41421  . . . BUT WHAT IS THE STANDARD DEVIATION?
390  
400     WELL THE STANDARD DEVIATION IS THE SQUARE ROOT OF THE VARIANCE.
410 BUT WHAT IS THE VARIANCE?  
420  
430     WELL THE VARIANCE IS SORT OF THE AVERAGE DISTANCE AWAY FROM THE
440 MEAN.  WE WILL CONCENTRATE ON FINDING OUT HOW IT IS COMPUTED AND
450 SEE WHAT THE PHRASE "SORT OF AVERAGE DISTANCE" REALLY MEANS.
460  
470     TO COMPUTE THE VARIANCE YOU MUST:
480  
490          FIRST COMPUTE THE MEAN OF THE DATA.
500  
510         SECOND FIND THE DIFFERENCE BETWEEN EACH ELEMENT AND THE
520         MEAN.  IF OUR DATA IS (1,2,3,4,5) THEN THE MEAN IS  3 AND
530         THE FIRST DIFFERENCE IS (3-1).  BUT NOTICE THE LAST 
540         DIFFERENCE IS (3-5).  NOW (3-1)=2  BUT (3-5)= -2.
550         WE DON'T REALLY CARE ABOUT THE MINUS SIGN, WE JUST WANT TO
560         KNOW HOW FAR AWAY THE ELEMENT IS.  SO WE COULD TAKE THE
570         ABSOLUTE VALUE OF (3-1) AND (3-5).  BOTH OF THESE ARE
580         2  .  OR WE COULD SIMPLY SQUARE THE VALUE (3-1) AND
590         (3-5)--REMEMBERING THAT ANY NUMBER SQUARED IS POSITIVE--.
600         SO  INSTEAD OF GETTING THE DISTANCE AWAY FROM THE MEAN WE
610         GET THE SQUARE OF THE DISTANCE AWAY; IN THE EXAMPLE
620         4 INSTEAD OF 2 .
630          FOR HISTORICAL REASONS THE SQUARE OF THE DISTANCES IS
640         THE MEASURE THAT WE WILL USE IN SPITE OF THE FACT THAT
650         ABSOLUTE VALUE MAY APPEAR MORE NATURAL RIGHT NOW.
660     
670         THIRD, WE ADD THE SQUARES OF THE DISTANCE FROM THE MEAN.  
680         THAT IS WE SUM THE SQUARE OF THE DIFFERENCES.  IN OUR 
690         EXAMPLE WE ADD (3-1)^2+(3-2)^2+(3-3)^2+(3-4)^2+(3-5)^2 =
700                         4    +   1    +   0   +   1   +  4= 10
710  
720          FORTH, WE DIVIDE BY THE NUMBER OF ELEMENTS IN OUR SAMPLE
730          TO GET THE  AVERAGE SQUARE DIFFERENCE.  IN OUR EXAMPLE
740          WITH FIVE ELEMENTS  10/5 = 2. AND  2 IS THE VARIANCE
750          OF THE SAMPLE.  
760  
770  
780     REMEMBER WE SAID THE SQUARE ROOT OF THE VARIANCE IS  THE 
790 STANDARD DEVIATION.  SO IN OUR EXAMPLE THE STANDARD 
800 DEVIATION IS THE SQUARE ROOT OF 2, THAT IS 1.41421   .
810  *   *   *   *   *   *   *
820  
830    TRY TO WRITE A PROGRAM THAT READS  FIRST  N, THE NUMBER OF PIECES 
840 OF DATA AND THEN THE DATA AND THEN COMPUTES M, THE MEAN;  V, THE
850 VARIANCE AND D, THE STANDARD DEVIATION.
860 
870    THERE IS A TEST ROUTINE YOU CAN USE IF YOU LABEL YOUR VARIABLES 
880 AS LISTED ABOVE AND IF YOU NAME YOUR PROGRAM  STATBGIN  .
890 IN OTHER WORDS AFTER YOU HAVE EXPERIMENTED WITH VARIOUS SETS
900 OF DATA, YOU MAY TYPE  TEST  TO BE SURE YOUR PROGRAM IS CORRECT.
910   
920 GOOD LUCK. . . . .