Google
 

Trailing-Edge - PDP-10 Archives - decuslib10-06 - 43,50373/tbfdoc.fil
There is 1 other file named tbfdoc.fil in the archive. Click here to see a list.



                    DOCUMENTATION FOR PROGRAM

			    TBF.BAS





		JAMES FENNESSEY AND SUSAN RADIUS

		DEPARTMENT OF SOCIAL RELATIONS
		THE JOHNS HOPKINS UNIVERSITY
		BALTIMORE, MARYLAND  21218


		VERSION:  17 NOVEMBER 1975













	DISCLAIMER:
	ALTHOUGH THIS PROGRAM HAS BEEN CHECKED WITH REASONABLE
	CARE, IT IS NOT GUARANTEED.  THE USER EMPLOYS IT BASED
	ON HIS OWN INSPECTION OF THE CODE.  NEITHER THE AUTHORS
	NOR JOHNS HOPKINS UNIVERSITY ARE LIABLE FOR INCIDENTAL
	OR CONSEQUENTIAL DAMAGES.



1.	IMPLEMENTATION

THIS PROGRAM IS WRITTEN IN THE LANGUAGE 'BASIC', AND IS AT
PRESENT AVAILABLE FOR THREE DIFFERENT MACHINES: THE DECSYS- 
TEM-10, DARTMOUTH TIME-SHARING SYSTEM HONEYWELL 635 AND
THE WANG LABORATORIES 2200B SYSTEM.

THE PROGRAM IS DISTRIBUTED FOR USE WITH A 72 CHARACTER LINE
TERMINAL.  HOWEVER, THIS PARAMETER CAN BE CHANGED BY MAKING  
MINOR MODIFICATIONS IN THE 'BASIC' CODE.  SEE NOTE #1.   

THIS DOCUMENTATION REFERS SPECIFICALLY TO THE DECSYSTEM-10
VERSION, BUT IS GENERALLY APPLICABLE TO THE OTHER VERSIONS
AS WELL.





2.	PURPOSE

THE PROGRAM EVALUATES THE PROBABILITY DENSITY VALUES AND THE
CUMULATIVE PROBABILITIES OF EITHER A STUDENT'S -T- DISTRIBUTION
OR A BEHRENS DISTRIBUTION, FOR A LIST OF INPUT VALUES SPECIFIED
BY THE USER.  THE PROGRAM ALSO PROVIDES A PLOTTED GRAPH OF THE
DENSITY CURVE.  





3.	DESCRIPTION

AT THE BEGINNING OF THE RUN, THE USER IS ASKED TO TYPE "2" IF ONLY
A PLOT OF THE DENSITY CURVE IS WANTED.  OTHERWISE, THE USER 
TYPES "1" TO OBTAIN A TABULATION.  IN THE LATTER CASE, THE
PROGRAM WILL LATER ASK THE USER IF A PLOT IS ALSO WANTED.

NEXT, THE USER IS ASKED TO SPECIFY EITHER: (1) THE STUDENT'S -T-
DISTRIBUTION TO DESCRIBE A SINGLE MEAN; (2) THE STUDENT'S -T-
DISTRIBUTION TO DESCRIBE THE DIFFERENCE OF TWO MEANS; OR (3) THE
BEHRENS DISTRIBUTION TO DESCRIBE THE DIFFERENCE OF TWO MEANS, 
WHERE THE TWO VARIANCES ARE UNKNOWN AND NOT ASSSUMED EQUAL.

IN EACH OF THE ABOVE THREE ANALYSIS SITUATIONS, THE PROGRAM THEN
ASKS THE USER TO TYPE IN THE NUMBER(S) OF CASES, THE SAMPLE MEAN(S),
AND THE SAMPLE VARIANCE(S).  THE SAMPLE VARIANCE ( "V" ) IS
DEFINED AS FOLLOWS:
   V = ( SUM( X(I) - M)^2 )/(N-1)
WHERE M IS THE SAMPLE MEAN,  N IS THE NUMBER OF CASES IN THE
SAMPLE, AND X(I) DENOTES THE I-TH CASE  (I RANGES FROM 1 TO N).

THE PROGRAM ACCEPTS THE DATA INPUT (EITHER A SET OF N, M, AND V
OR TWO SETS OF N, M, AND V, AS APPROPRIATE TO THE TYPE OF PROBLEM
BEING DONE), AND THEN PROVIDES THE USER WITH A TRIAL LOWER
BOUND, DENSITY, AND CUMULATIVE PROBABILITY.  IF THE USER WISHES
TO EMPLOY THIS LOWER BOUND AND A SYMMETRIC UPPER BOUND, HE
RESPONDS BY TYPING THE SAMPLE MEAN, AS INSTRUCTED BY THE PROGRAM.
IF THE USER WISHES TO EMPLOY A DIFFERENT LOWER OR UPPER BOUND, HE
TYPES THE LOWER BOUND HE WISHES TO EMPLOY.  THE PROGRAM NEXT  
ASKS FOR THE UPPER BOUND DESIRED, AND THEN SHOWS THE DENSITY AND
CUMULATIVE PROBABILITY FOR THE LATEST LOWER BOUND.  IF THE USER
IS SATISFIED WITH THIS SET OF BOUNDS, HE TYPES THE SAMPLE MEAN, AS
INSTRUCTED BY THE PROGRAM.  THE PROGRAM NEXT ASKS FOR THE WIDTH
OF THE INTERVAL BETWEEN POINTS.  MOST FREQUENTLY THE INTERVAL
WILL BE CHOSEN SO THAT THERE ARE APPROXIMATELY 20 TO 40 POINTS
TABULATED AND APPROXIMATELY 40 TO 50 POINTS PLOTTED.  THE 
INTERVAL ALSO IS USUALLY CHOSEN SO THAT THE RESULTING POINTS
ARE SIMPLE NUMBERS.

IF A TABULATION HAS BEEN REQUESTED, THE PROGRAM TYPES IT ON
THE TERMINAL.  ALSO, A DISK FILE OF THE OUTPUT IS CREATED
FOR SUBSEQUENT TYPING ON THE TERMINAL OR PRINTING ON A LINE
PRINTER.

THE USER IS THEN ASKED TO SPECIFY THE LOWER AND UPPER BOUNDS AND 
THE STEP WIDTH FOR THE DENSITY PLOT.  IF BOTH THE TABULATION
AND THE DENSITY PLOT ARE BEING OBTAINED, THESE PLOTTING
LIMITS AND STEP WIDTH ARE INDEPENDENT OF THOSE USED FOR THE
TABULATION.  THIS ALLOWS THE USER TO OBTAIN THE TABULATION
ONLY FOR A PORTION OF THE RANGE, OR AT WIDELY SPACED
POINTS, AND THEN TO OBTAIN THE DENSITY PLOT ACROSS THE
ENTIRE RANGE, OR AT MORE NARROWLY SPACED POINTS.
THE PROGRAM THEN TYPES THE DENSITY PLOT ON THE TERMINAL
AND ADDS THE PLOTTING OUTPUT TO THE DISK FILE SO THAT IT
MAY LATER BE TYPED ON THE TERMINAL OR PRINTED ON A LINE
PRINTER.



4.	OPERATING LIMITS AND ACCURACY

THERE ARE SEVERAL RESTRICTIONS ON THE INPUT ALLOWED, BUT THESE
DO NOT CONSTITUTE A LIMITATION ON THE PROGRAM'S UTILITY.

THE NUMBER OF CASES ENTERED AND THE VARIANCES ENTERED MUST BE
GREATER THAN ZERO.

THE MAXIMUM DEGREES OF FREEDOM IN THE STUDENT'S -T- DISTRIBUTION
THAT IS ACTUALLY EVALUATED BY THE CUMULATIVE PROBABILITY ROUTINE
IS 2812.  THIS NUMBER IS SO LARGE THAT THE NORMAL APPROXIMATION
TO THE STUDENT'S -T- DISTRIBUTION IS PERFECTLY ADEQUATE IF THE
DEGREES OF FREEDOM IS GREATER THAN 2812.

THE MAXIMUM NUMBER OF POINTS THAT MAY BE TABULATED OR PLOTTED
IN A SINGLE PROBLEM IS 200.  THIS LIMIT IS ARBITRARY AND MAY  
BE ALTERED BY MINOR ADJUSTMENT TO THE BASIC CODE IN LINES
#2330 AND #3000,  CHANGING 'W1<=200' TO 'W1<=' <THE 
DESIRED VALUE>.

THE ACCURACY OF THE ALGORITHM FOR THE CUMULATIVE STUDENT'S -T-
DISTRIBUTION IS APPROXIMATELY 4 DECIMAL PLACES FOR ALL VALUES
OF THE DEGREES OF FREEDOM AND OF -T- TABULATED IN TABLE 9 OF
PEARSON AND HARTLEY (1966).  SEE FENNESSEY AND RADIUS (1975).
THE ALGORITHM USED FOR THE CUMULATIVE -T- PROBABILITY DOES NOT
DEPEND UPON ANY PRIOR CALCULATIONS.  THUS, ONE MAY SPECIFY
THAT CUMULATIVE PROBABILITIES BEGIN AT ANY POINT.

THE ACCURACY OF THE APPROXIMATION USED FOR THE BEHRENS DISTRIB-
UTION IS SATISFACTORY FOR MOST PURPOSES PROVIDED
THE DEGREES OF FREEDOM FOR EACH OF THE TWO SAMPLES IS AT
LEAST 7.  IF ONE OR BOTH OF THE DEGREES OF FREEDOM IS LESS THAN
7,  THE DENSITIES AND/OR CUMULATIVE PROBABILITIES IN THE TAILS  
ARE LIKELY TO BE IN ERROR BY MORE THAN 20 PERCENT.  SEE
FENNESSEY AND RADIUS (1975) FOR MORE DETAILED DISCUSSION.





5.	ALGORITHMS

THE ALGORITHMS USED ARE AS FOLLOWS:

	FOR THE APPROXIMATION OF THE BEHRENS DISTRIBUTION,
        A FORMULA DUE TO PATIL IS USED.  THIS FORMULA
        CREATES A "DILATED" STUDENT'S -T- AS THE APPROXI-
        MATION OF THE BEHRENS DISTRIBUTION. SEE V. K. PATIL,
        APPROXIMATION TO THE BEHRENS-FISHER DISTRIBUTION,
	BIOMETRIKA, 1965, VOL. 52, PP. 267-271.  SEE ALSO
	G. E. P. BOX AND G. C. TIAO, BAYESIAN INFERENCE IN
	STATISTICAL ANALYSIS (ADDISON-WESLEY, 1973) P. 107.
	SEE ALSO V. K. PATIL, BEHRENS-FISHER DISTRIBUTIONS,
	(UNPUBLISHED PH. D. DISSERTATION, THE UNIVERSITY OF
	MICHIGAN, 1963)  MICROFILM ORDER #63-6937.

	FOR THE CUMULATIVE PROBABILITY OF THE STUDENT'S -T- DIS-
        TRIBUTION, AN ALGORITHM BY A. SCHLEIFER BASED UPON
        THE INCOMPLETE BETA FUNCTION RATIO IS USED.  IT
        WAS CODED ORIGINALLY IN 'FORTRAN' AS PART OF THE
        'MANECON' PACKAGE.  'MANECON' WAS DEVELOPED BY
        R. SCHLAIFER AT THE HARVARD BUSINESS SCHOOL.
        THIS ALGORITHM IS INCORPORATED IN THE 'MANECON' ROUTINES:
	'DBETCU' AND 'SPCASE'.  THE FORMULAS ARE DESCRIBED IN
	THE BOOK 'COMPUTER PROGRAMS FOR ELEMENTARY DECISION
	ANALYSIS' (HARVARD, 1971), PP. 200-205.  THE ADAPTATION WAS  
        MADE WITH THE PERMISSION OF THE PRESIDENT AND FELLOWS
	OF HARVARD COLLEGE.

        FOR LN(GAMMA(X)), AN ALGORITHM USING A STIRLING ASYMPTOTIC
        APPROXIMATION IS USED.  SEE ABRAMOWITZ AND STEGUN,
        P. 257, EQ. 6.1.41.





6.	SAMPLE PROBLEMS

SAMPLE OUTPUT FOR PROBLEMS INVOLVING EACH OF THE THREE KINDS OF
EVALUATION (I.E., SINGLE -T-;  DUAL -T-; AND BEHRENS) IS PROVIDED
IN A SEPARATE DOCUMENT.





7.	INSTRUCTIONS FOR USE ON THE DECSYSTEM-10
        AT THE JOHNS HOPKINS UNIVERSITY

  1.	LOG INTO THE DECSYSTEM-10
  2.	TYPE "R BASIC" TO INITIATE BASIC MONITOR PROGRAM
  3.	WHEN BASIC TYPES "READY", TYPE "OLD TBF"
  4.	WHEN BASIC TYPES "READY", TYPE "RUN"
  5.	ANSWER QUESTIONS AS THEY ARE ASKED.  BE PREPARED TO
	SUPPLY LOWER AND UPPER LIMITS BETWEEN WHICH THE
	VALUES TO BE EVALUATED LIE.  NOTE THAT OFTEN IT IS
	DESIRABLE TO MAKE THE PLOT SYMMETRIC ABOUT THE
	MEAN, BUT THAT FOR THE TABULATION IT IS OFTEN 
	SUFFICIENT TO EVALUATE VALUES ON ONE OR THE OTHER
	SIDE OF THE MEAN.





8.	REFERENCES


M. ABRAMOWITZ AND I.A. STEGUN (EDS.),HANDBOOK OF
MATHEMATICAL FUNCTIONS  (NATIONAL BUREAU OF STANDARDS,
APPLIED MATHEMATICS SERIES, 55;  U.S. GOVERNMENT
PRINTING OFFICE, 1968).


G.E.P. BOX AND G.C. TIAO,  BAYESIAN INFERENCE IN STATISTICAL
ANALYSIS. (READING, MASS., ADDISON-WESLEY, 1973).


J. FENNESSEY AND S. RADIUS,  "THE PRESENTATION OF STATIS-
TICAL RESULTS CONCERNING MEANS AND DIFFERENCES OF MEANS",
TECHNICAL REPORT,  DEPARTMENT OF SOCIAL RELATIONS, THE
JOHNS HOPKINS UNIVERSITY,  NOVEMBER 1975.


V. K. PATIL, "APPROXIMATIONS TO THE BEHRENS-FISHER DISTRIBUTION",
BIOMETRIKA, VOL. 52, 1965, PP. 267-271.


V. K. PATIL, THE BEHRENS-FISHER DISTRIBUTIONS. (UNPUBLISHED
PH. D. DISSERTATION, THE UNIVERSITY OF MICHIGAN, 1963)
MICROFILM ORDER #63-6937.


R. SCHLAIFER,  COMPUTER PROGRAMS FOR ELEMENTARY DECISION
ANALYSIS. (HARVARD UNIVERSITY, 1971).





9.	NOTES

  1.    TO ADAPT THE PRINT LINE TO A DIFFERENT WIDTH,
        CHANGE THE TWO STATEMENTS THAT NOW READ MARGIN 80
        AND MARGIN#1,80 TO THE DESIRED MARGIN.  ALSO,
        CHANGE THE STATEMENTS IN WHICH Y9/50 APPEARS TO
        USE A NEW DIVISOR.  THESE STATEMENTS APPEAR IN THE
        DENSITY PLOT SUBROUTINE.  IN ADDITION, IT IS
        NECESSARY TO CHANGE THE TWO STATEMENTS DIRECTING
        THE PRINTING OF THE DOTTED Y-AXIS.  AS CURRENTLY
        WRITTEN, THE LAST CHARACTER OF THE DOTTED AXIS
        PRINTS IN COLUMN 72.


IIIAAAA>>AAA>						AA@@@@IIIAIII6			AAA>>AAA>>AAAA``			AA@@@@*U*73