Statistics Anyone?




[Photo of the Author]


original in en Jürgen Pohl


Jürgen Pohl works as an R&D Engineer and technical translator on the Pacific Coast of the US.


Most statistics packages seem to be overwhelming in their scope, forcing the user on a very steep learning path, most of them are also very expensive. However, there are a few convenient alternatives for those who need them most: the beginner, who is being initiated to the secrets of the magic world of statistics, as well as the user for whom the elaborate commercial packages would be overkill. SalStat is one of those sought-after alternatives - thanks to its creator it is open source, meaning: free! Another advantage: the program is platform independent. Last but not least, it is very easy to use. The program is written in Python, however it can be used without any prior knowledge of that language.

ArticleIllustration:[Das Titelbild des Artikels]



SalStat - the Statistics Program


Building on his own experience, the creator of SalStat was very well aware of the predicament in which many of those being introduced to statistics find themselves in: in order to expand their newly acquired knowledge (or just to do their homework or project...) an affordable, easy to use statistics program on their own machine is essential. With this in mind Alan James Salmoni developed SalStat and published it under the GNU license. It can be found at its homepage.

What can SalStat do?

Unfortunately this article cannot provide an introduction to statistics. In 'Resources' you will find some information on this topic. Here are the lists of the statistics and tests SalStat is able to generate:

Parametric and non-parametric tests are combined here:

N (count) range
sum number of missing cells
mean geometric mean
variance harmonic mean
standard deviation skewness
standard error kurtosis
sum of squares median
sum of squared deviations median absolute deviation
coefficient of variation mode
minimum interquartile range
maximum number of unique levels of data

Inference Statistics

t test (paired) Pearsons correlation
t test (unpaired) Spearmans rho correlation
1 sample sign test Kendalls tau correlation
2 sample sign test Point biserial r correlation
F test for variance ratio linear regression
Wilcoxon Ranked Sums Test Single factor analysis of variance (between subjects)
Mann-Whitney U Test Single factor analysis of variance (within subjects)
Kolmogorov-Smirnov test Kruskall-Wallis H test
Paired Permutation test Friedman test

How Do We Work With SalStat?

Before you can take a look at SalStat you need to install it on your machine , but we will get to that later. First I would like to give you an idea of what you are getting.
When you open the program these two windows appear on your screen:

SalStat Statistics Main Window

The first window (titled 'SalStat Statistics') in the foreground of the screenshot above shows a data entry grid like most spreadsheets: here we enter the data of our samples to be analyzed. At the top of the window the usual toolbar - clicking on one of the tools brings up a dropdown list of functions to select from.

The second window (titled 'SalStat Statistcs - Output'), which is initially empty, will show the statistics results of the executed test.

Nothing is more frustrating than having a program like this installed without any data to play with - the author was so wise to include a test file (testreport1.txt) with known results. Simply enter the test data into your table and choose what kind of statistics or tests you would like to see and then hit 'Okay' and the result will appear in the output window. The windows below happened in following sequence :

  1. We entered the data in the columns A, B, C of the data grid of our open SalStat Statistics window. The data are from the sample file testreport1.txt.
  2. Next we went to the 'Analyse' tool on the tool bar. Here we chose the 'One Condition Test'
  3. The 'One condition Test' window opened. We marked the specifics of our test:

The previously blank 'SalStat Statistics - Output' window (our screen shot below, left) will show the results of the test.

Test File

Getting and Installing SalStat

The program is waiting to be downloaded from its website. A number of alternatives are offered to accommodate your operating system(s). I have the program on two desktop machines with different operating systems in two locations. The source code is available for downloading as well - maybe you want to show off your (Python) programing skills...? Before trying any installation, please read the 'Basic Users Guide', also available on the SalStat homepage under 'Documentation'. The guide gives clear instructions on how to install the program, we need not to repeat them here - please have a look at the website.

Customizing SalStat

Another very useful part of SalStat is its built-in ability for users to write their own scripts - to automate tasks, build their own tests, etc. In the program's manual, which is the main part of the 'Help' tool, you can find a detailed description on how to do that ('Scripting and Making Your Own Tests'). Have a look, it is a very helpful introduction to scripting. It should encourage the user to utilize the scripting feature. The last tool of the 'Analyse'-dropdown list gives you access to the 'Scripting Window' - here you can enter your scripts: try the samples given in the manual, they could convince you to use this feature.

One minor hang-up for some people: the program is written in Python. In order to do serious program customization you would need to learn the language. Fortunately Python is an open language, meaning free: you can download it with extensive instructions (addressing everyone from beginner to expert) from the Python webpage. If you do not want to deal with Python you can use SalStat as is - but with some Python knowledge you may enhance your benefits from this program.


SalStat was written with ease of use in mind. The user can click his or her way through a wish list of statistics and tests. The manual gives instructions for all the tests, including some hints on the value of their results. In general, however, it is assumed that you have at least basic knowledge of statistics or are in the process of acquiring it.

One word of caution: before you jump in and bank on your career as an up-and-coming scientist by using results generated with this program, listen to the recommendations of its creator and convince yourself of its merits by testing it! Those who are just starting with statistics will find many examples in text books - plug some of the available data into SalStat and see what you are getting.The test file (testreport1.txt), which comes with the download, gives you some result comparisons of tests run with other programs.