HASSLE STAT PACK FORTRAN IV - STAT LARS PALMER AB HASSLE FACK 431 20 MOLNDAL 1 SWEDEN HASSLE STAT PACK F IV - STAT A)GENERAL THE COMPONENTS OF THE TOTAL PACKAGE ARE: 1)STAT DESCRIBED HERE 2)OUTLAY CREATES AN INPUT FILE TO STAT 3)KSORT REARRANGES ROWS AND COLUMNS IN THE FILE 4)OUTCHK CHECKS ON UNFORMATTED FILE FOR ERRORS NOT ON DECUS TAPE 5)BIGSTA HANDLES >10 COLUMNS NOT ON DECUS TAPE OPERATING ON OUTLAY FILES ARE ALSO SEVERAL SPECIFIC PROGRAMS E.G. LESQ THAT DOES A NON LINEAR LEAST SQUARES CURVE FIT. B)IMPLEMENTATION AS OF TODAYS DATE THE FOLLOWING ROUTINES ARE IMPLEMENTED: NUMBER PERFORMS -1 LIST 0 NEW DATA FROM SAME FILE 1 MEANS ETC 2 T-TESTS 3 REGRESSION LINES 4 TYPE 2 REGRESSION 5 CORRELATION MATRIX 6 ANOVA ONE SIDED 7 SHEFFE CONTRASTS 8 ANOVA TWO SIDED 13 WILCOXON 14 MANN-WHITNEY 15 KRUSKAL-WALLIS 16 SPEARMAN RANK 17 FRIEDMAN 18 NUMERIC INTEGRATION ON X/Y 19 MEDIANS AND RANK OF COLUMN 20 CROSSTABULATION 21 SCATTER PLOT 22 VARIOUS CONVERSIONS 48 OUTPUT A COMMENT 49 FLAG-SETTING ROUTINE 50 EXIT TO MONITOR F IV - GENERAL-PAGE 1 HASSLE STAT PACK F IV - STAT C)INDATA AND FILE UNIT NUMBERS THE FOLLOWING INDATA FILES ARE USED: UNIT CONTAINS 6 THE DATA - IN OUTLAY FORMAT - TO BE ANALYZED 5 A PARAMETER FILE CONTAINING THE REQUEST FOR THE ANALYSES TO BE PERFORMED. FORMAT SEE BELOW (D). OUTFILES UNIT CONTAINS 3 THE RESULTS NORMALLY TO LINE PRINTER. IF NO LINE PRINTER IS AVAILABLE REASSIGN. LDEVE ERROR INFORMATION. LDEVL TRACE OF PROGRAM FLOW. LDEVE AND LDEVL ARE FURTHER SPECIFIED UNDER CARD 49. F IV - INDATA-PAGE 2 HASSLE STAT PACK F IV - STAT D)THE PARAMETER FILE NOTE THAT AT PROGRAM START THE PROGRAM READS THE PARAMETER FILE AS IF IT HAD JUST SEEN A 0 REQUEST CARD. THE REQUEST CARDS HAVE THE FOLLOWING FORMAT: CT(CARD TYPE),ARG1(ARGUMENT 1),ARG2,.... SOME OF THE REQUEST CARDS MUST BE FOLLOWED BY PARAMETER CARDS. THE MOST COMMONLY USED FORMAT IS: CT, KOL1, KOL2,IOPT, WITH KOL1, KOL2 BEING 2 COLS TO BE COMPARED AND IOPT AN ANALYSIS SPECIFIC PARAMETER. CARD TYPE ARGUMENTS PARAMETER CARDS -1 0-1 1 0 0 1(-4) GROUPS +1 COLUMNS 1 0-2 2 3 3 2-3 4 2-3 0-1 5 0 6 0 7 0-1 0-1 8 0 13 2 14 2 15 0 16 2 17 0-1 18 2-3 19 0-1 20 2-3 21 2-3 22 1-3 48 0-1 AS SPECIFIED BY ARGUMENT. 49 0-4 50 0 ****COMMENTS*** CARD TYPE -1 THE REQUEST CARD -1 ASKS FOR A LISTING OF THE INPUT DATA. IF ARG1=0 IT MUST BE FOLLOWED BY A CARD INDICATING THE FORMAT FOR THE LISTING. THIS MUST BE A STANDARD FORTRAN FORMAT NOTATION WITH AS MANY FORMAT DESCRIPTIONS AS THERE ARE COLUMNS IN THE FILE. IT SHOULD CONTAIN 1H , AS ITS FIRST ELEMENT. FIV-PARAMETERS-PAGE 3 HASSLE STAT PACK F IV - STAT CARD TYPE IF ARG1 <> 0 ONE OF THREE INTERNAL FORMATS IS USED: -1,1 F10.1 -1,2 F10.4 -1,3 G12.4 ****MISSING DATA**** ALL ROUTINES IN THE PROGRAM TREAT A 0-VALUE AS MISSING DATA. (NONMISSING DATA = 0 CAN BE SET AS E.G. 1E-25). FORMAT -1,3 IS NOT SUITABLE FOR MISSING DATA, BUT IF THERE ARE ANY A SPECIFICATION ERROR WILL BE GENERATED. 0 READS NEW DATA. THE PARAMETER FILE ALWAYS BEGINS WITH THE PARAMETER CARD FOLLOWING THE 0 CARD. ****PARAMETER CARD*** 1(-4) NUMBER OF GROUPS TO BE READ INTO CORE AND THEIR NUMBERS. THE GROUP NUMBERS ARE NOT PRESERVED IN CORE. THEY ARE TREATED AS ONE GROUP. TWO SPECIAL FORMS OF THIS INPUT ARE AVAILABLE: 1)IF ONLY THE NUMBER OF GROUPS IS SPECIFIED IT IS TAKEN TO MEAN THE FIRST N SEQUENTIAL GROUPS I.E. 5 IS INTERPRETED EQUIVALENT TO 5,1,2,3,4,5 NOTE: IF N>24 USE SEVERAL CARDS SEE # 2)IF THE FIRST CARD IN THE INPUT FILE IS NEGATIVE IT IMPLIES THAT THE ANALYSES FROM HERE TO THE NEXT 0 CARD, OR TO THE 50 CARD IF NO 0 CARD INTERVENES, ARE TO BE DONE FOR ALL GROUPS IN THE INPUT FILE I.E. -1 5 1 5 0 IMPLIES 'ON ALL GROUPS IN INPUT FILE GENERATE MEAN AND CORRELATION MATRIX FOR THE FIRST 5 COLUMNS'. IF UNIT 5 IS ASSIGNED TO UNIT 4 (E.G. THE REQUESTS ARE GIVEN FROM THE CONSOLE) THIS OPTION SHOULD NOT BE USED. # THESE CARDS ARE IN 25I3 FORMATS AND MUST IF NECESSARY BE CARRIED OVER TO ANOTHER CARD. FIV-PARAMETERS-PAGE 4 HASSLE STAT PACK F IV - STAT CARD TYPE NEXT NUMBER OF COLUMNS TO BE USED AND THEIR NUMBERS. THE COLUMNS CAN BE REORGANIZED IN CORE BY PLACING THE REQUEST IN A NON-SEQUENTIAL MANNER. SPECIAL FORM 1) AS DESCRIBED UNDER ABOVE ALSO APPLIES HERE. IF NUMBER OF COL IS STATED > THAN NUMBER OF COLUMNS IN FILE IT IS ADJUSTED TO THE LATTER.IF A TOTALLY EMPTY COLUMN IS READ THE COLUMN COUNT IS REDUCED BY ANALYSIS NO 1.THEREFORE IT IS POSSIBLE TO ROUTINELY SPECIFY E.G. 10 AS NUMBER OF COLUMNS IF ALL EXISTING COLUMNS ARE WANTED. 1 THE TREATMENT OF THE ARGUMENTS IS: IF ARG1 AND ARG2 BOTH = 0 CALCULATE MEANS AND STANDARD ERRORS. IF ARG1=15 CALCULATE MEANS ONLY ON ROWS THAT ARE COMPLETE I.E. CONTAIN NO MISSING DATA SEE 7. IF ARG1<=NK AND ARG2=0 PRINT SUMS OF SQUARES OF COL(ARG1). IF ARG1 AND ARG2 BOTH <= NK PRINT SUMS OF SQUARES AND CROSSUMS FOR ALL COMPLETE PAIRS. IF ONE DATA IS MISSING SO ARE BOTH SKIPPED. 2 3 TYPES OF T- TESTS ARE IN THE PACKAGE. CALLING: 2,K1,K2,N N IS TYPE OF TEST AND K1 K2 THE 2 COLUMNS TO COMPARE. N=1 STANDARD FORMULA. SIGMA PRESUMED EQUAL. N=2 SIGMA NOT PRESUMED EQUAL. N=3 PAIRED VALUES. 3 REGRESSION LINE - CALLING 3,K1,K2 4 TYPE II REGRESSION (BARTLETTS 3 GROUP) REF S&R PAGE 480.THE NORMAL REGRESSION LINE PRESUMES X TO BE WITHOUT ERROR. IF SO IS NOT THE CASE ANALYSIS 4 IS TO BE USED. CALLING 4,KOLX,KOLY,IOPT OPTIONAL THE LINE (Y=A+BX) IS CALCULATED AND CONFIDENCE LIMITS TO B ARE PRINTED. IF IOPT = 0 THE VALUES P=0.05,0.01,0.001 ARE USED. IF IOPT>0 - IOPT(<4) P VALUES MUST BE GIVEN. THESE ARE IN THE FORM 0MAXXL JOB ABORTED$ YES B)ON READING THE ANALYSIS REQUESTS: NON-EXISTING ANALYSIS REQUESTED IGNORED YES ANALYSIS REQUIRES COL NR THESE ARE WRONG IGNORED # ANALYSES REQUIRES A 3RD ARGUMENT BUT THIS IS MISSING IGNORED # C)IN SPECIFIC ANALYSES: BESIDE THESE THERE ARE SOME SELFEXPLANATORY MESSAGES. NR -1 ARGUMENT OUT OF RANGE IGNORED # BAD FORMAT ON 0 FRTS ABORTS PROGRAM 1 ARG OUT OF RANGE IGNORED # 2 CODE WRONG IGNORED YES 2 TOO FEW DATA IGNORED YES 3 TOO FEW DATA IGNORED YES 4 BAD P VALUE SKIP 4 # 7 " SKIP 7 # 3,6,8 EMPTY COL IGNORED YES 13 N<6 SKIPPED # 15 NK<3 SKIPPED # 16 N<4 SKIPPED # 17 NK<3 SKIPPED # 18 N<3 SKIPPED # 22 ON BAD ARGUMENTS(<0)TO SQUARE ROOTS OR LOGS THE VALUE IS SET TO 0 MANY ROUTINES ON EXCEEDING CAPACITY IGNORED # #ALL THESE ARE SIGNALLED BY THE TEXT: SPECIFICATION ERROR IN STAT## WHERE ## IS THE MODULE NUMBER. FIV - ERROR-PAGE 11 HASSLE STAT PACK F IV - STAT ALL THESE TEXTS ARE SENT TO DEVICE LDEVE. IN SOME PLACES A DIVIDE BY ZERO MIGHT BE PRODUCED WHEN IT IS NOT GUARDED FOR. NOTE THAT LINE NUMBERS ARE IRRELEVANT. $IF THE JOB IS ABORTED THE PROGRAM SEARCHES THE PARAMETER FILE FOR A 50 CARD OR A 0 CARD SO AS TO CONTINUE WITH THE NEXT INDATA REQUEST. H)LOADING THE LOADING INFORMATION IS IN THE FILE STATB.BI FOR USE BY BATCH. THE FILE STATC.BI IS A BATCH FILE FOR COMPILING THE NECESSARY ROUTINES (NOTE: THIS FILE IS OS/8 V 3 FORMAT). NOTE THAT ALL MODULES ARE COMPILED WITHOUT LINE NUMBERS. THE PROGRAM REQUIRES 12K. IF THE PROGRAM IS LOADED WITHOUT OVERLAYS IT TAKES ABOUT 35 K. IN A 16K DECTAPE SYSTEM YOU ARE ADVICED TO MOVE SOME OF THE ROUTINES NOW AT LEVEL 2 TO LEVEL MAIN. THIS SAVES A LOT OF OVERLAYING CORE. IF YOU WANT TO REMOVE ROUTINES FROM THE PROGRAM STUDY THE TRANSFER TABLE IN SORT. YOU CAN INHIBIT CALLS TO THE VARIOUS MODULES BY CHANGING THE VALUES IN THE DATA LIST IN SORT. NOTE THAT THE OVERLAYS ARE VERY COMPACT PACKED. TO INCREASE THE PACKAGE A GOOD DEAL OF THOUGHT WOULD BE REQUIRED. I)TESTDATA TESTDATA TO THE PROGRAM IS IN STATDA.DA(/6) USING STATST.DA AS UNIT 5 WILL TEST ALL ROUTINES IN THE PACKAGE. J)REFERENCES: BESIDES THOSE MENTIONED IN THE GENERAL INTRODUCTION THE FOLLOWING BOOKS HAVE BEEN REFERRED TO IN THIS PACKAGE: D.COLQUHOUN (C) LECTURES ON BIOSTATISTICS 1971 M ABRAMOWITZ & I STEGUN HANDBOOK OF (A) MATHEMATICAL FUNCTIONS 1964 FIV-LOADING-PAGE 12 HASSLE STAT PACK F IV - STAT K)SUBROUTINES THE STATISTICAL PACKAGE USES A NUMBER OF SUBROUTINES THAT ARE OF A MORE GENERAL NATURE AND AS SUCH ARE NOT CONSIDERED PART OF THE PACKAGE. THESE ARE: 1) A NUMBER OF ROUTINES FROM THE SSP LIBRARY (SCIENTIFIC SUBROUTINES) SEE THE DEC-10 LIBRARY FOR FURTHER INFORMATION. THESE ROUTINES ARE: UTEST (PERFORMS MANN WHITNEY UTEST) MPAIR (PERFORMS WILCOXONS MATCHED PAIRS RANK SIGN TEST) SRANK (PERFORMS SPEARMAN RANK) TWOAV (PERFORMS FRIEDMAN ANALYSIS) AND THE GENERAL ROUTINES TIE, NDTR AND RANK. 2)SEVERAL GENERALIZED SUBROUTINES PROGRAMMED FROM VARIOUS SOURCES: NUMINT NUMERICAL INTEGRATION (D&R) PVALUE PROBABILITY FUNCTION (ADAPTED FROM DEC10 LIBRARY) FINV,TINV,ZINV PROBABILITY FUNCTIONS (A) THE OTHER SUBROUTINES ARE SPECIFIC TO THE PACKAGE. ALL ROUTINES SPECIFIC TO THE PACKAGE HAVE NAMES IN THE FORM STAT??.FT. FIV-SUBROUTINES-PAGE 13