Trailing-Edge
-
PDP-10 Archives
-
decuslib20-05
-
decus/20-0149/mulopt.hlp
There are 2 other files named mulopt.hlp in the archive. Click here to see a list.
Multiple Linear Regression Analysis
THE OPTION SPECIFICATION
It is possible to have the program perform some tasks optionally by
providing an option specification in a job. It consists of the keyword
"Options" followed by a list of option names or corresponding option
numbers (the option statement), separated by commas and terminated with a
';' (semicolon). The following ten options are available:
option number option name
1 Transformed data matrix
2 Correlation matrix
3 Residual analysis
4 No regression analysis
5 Process submodels
6 Print input data
7 No input data rewind
8 Save original model
9 Test reduced model
10 Missing values
Options 1, 2, 3 and 6 cause the corresponding piece of information to
be printed. However, option 1 lists only those (possibly transformed)
variables that are present in the model formula in a neat tabular form,
while option 6 lists all the original input data serially (eleven numbers
per line) without any special layout, because the input data consists (by
definition) of an unstructured series of numbers (cf. "Help"/Data).
Option 4 suppresses the regression analysis; it is meant to be used
in combination with option 1 and/or 2.
Option 5 causes the program to process submodels, which are formed by
a form of backward elimination: each time the last term from the right
hand part from the model formula is omitted, by deleting the last column
from the design matrix, and a regression analysis is performed with the
reduced design matrix. Messages are generated about which terms are
omitted, while further processing of the job ceases when the resulting
model formula is of the form: y = c. Moreover a test is made (under the
usual assumptions) whether the omitted terms did contribute significantly
to the regression sum of squares (cf. "Help"/Tests).
To option 5 a specifier list can be appended, to prevent the pro-
duction of waste output for unwanted submodels. In this list the number of
terms to be omitted from the model formula (counting backwards, starting at
the end) must be given enclosed in parentheses. For example the option:
Process submodels (6, 10) instructs the program to process only two
submodels, one with the last six terms omitted and one with the last ten
terms omitted (from the original model formula). If the user asks for more
terms to be omitted than are present in the model formula, an error message
is supplied and the execution of that job is terminated. Moreover, if no
explicit specifier list is appended to option 5, the options 2 and 3 yield
no effect (even if specified), which is also to prevent the production of
waste output for the submodels.
Option 7 gives the user the opportunity to process consecutive pieces
of input data in consecutive jobs. Normally the processing of the input
data for each job starts with the first number in the data specification
(or with the first number in the datastream), and the program gives a
(warning) message if the input formula does not match the input data
precisely. This option disengages the message and causes the program to
continue processing input data where the previous job had finished.
Option 8 causes the residual degrees of freedom and residual sum of
squares from the current job to be saved, in order to be able in the next
job, by means of specifying option 9, to test whether the model under
consideration in that next job, shows a significant increase in residual
sum of squares in comparison with the model in the previous job. In effect
this gives the possibility of testing a hypothesis concerning a linear
combination of the parameters from a model (cf. "Help"/Tests), for
instance:
"Model 1" y = b1 * x1 + b2 * x2 + b3 * x3;
"Options" Save original model;
"Run"
"Model 2" y - 4 * x1 = b2 * (x1 + x2) + b3 * x3;
"Options" Test reduced model;
"Run"
causes the null hypothesis: b1 = b2 + 4 to be tested (in the second job).
Option 10 may be used to identify some observations or repetitions
as 'missing'. In a specifier list, appended to this option, the missing
values must be given enclosed in parentheses. When a repetition equal to a
missing value is encoutered in the input data, the corresponding set of
repetitions for the dependent variable(s) is not included in the design
matrix. When an observation equal to a missing value is encountered in the
input data, or when none of the repetitions are included in the design
matrix, the corresponding set of observations for the independent variables
together with the (possibly empty) set of repetitions for the dependent
variable(s) (i.e. the 'case') is not included in the design matrix.