next up previous 222
Next: Restarting xcatview after a crash
Up: Browsing and selecting with an X display
Previous: Browsing and selecting with an X display


Statistics computed for individual columns

Statistics can be computed for one or more individual columns. They can be computed from either all the rows in the catalogue or just the subset of rows comprising a selection which has been created previously. Obviously, only non-null rows are used in the calculations. Statistics can be displayed for columns of any data type, though for CHARACTER and LOGICAL columns the only quantity which can be determined is the number of non-null rows.

For each chosen column its name, data type and the number of non-null rows (that is, the number of rows used in the calculation) are displayed and the statistics listed in Table [*] are computed. Though all these quantities are standard statistics there is a remarkable amount of muddle and confusion over their definitions, with textbooks giving divers differing formulæ. For completeness, and to avoid any possible ambiguity, the definitions used in xcatview and catview are given below. These formulæ follow the CRC Standard Mathematical Tables[4] except for the definition of skewness which is taken from Wall[30].


Table: Statistics computed for columns

Minimum
Maximum
Total range
 
First quartile
Third quartile
Interquartile range
 
Median
Mean
Mode (approximate)
 
Standard deviation
Skewness
Kurtosis


In the following the set of rows for which statistics are computed is called the `current selection' and it contains $n$ non-null rows. $x_{i}$ is the value of the column for the $i$th non-null row in the current selection. The definitions of the various statistics are then as follows.



next up previous 222
Next: Restarting xcatview after a crash
Up: Browsing and selecting with an X display
Previous: Browsing and selecting with an X display

CURSA Catalogue and Table Manipulation Applications
Starlink User Note 190
A.C. Davenhall
4th November 2001
E-mail:ussc@star.rl.ac.uk

Copyright © 2001 Council for the Central Laboratory of the Research Councils