Skip to content
Bewa edited this page Jun 27, 2018 · 8 revisions

What was Instat and what is R-Instat?

The original Instat is a general statistics package. It is simple enough to be useful in the teaching of statistics and has the power to assist research in any discipline that requires the statistical analysis of data.

Instat was written by experienced statisticians engaged in teaching, research and consultancy, in collaboration with professional programmers.

Instat is used by many people who have not used a statistics package before. It is easy to learn and to use. It is designed to encourage good statistical practice and to support the teaching of statistics, both in class-room situations and on an individual basis. We hope this will also apply to R-Instat

The Windows version has been developed from Instat around 2000). This was an improvement on our first shareware version released in Autumn 1994. Before then, Instat had been sold commercially. R-Instat is a total rewrite, and is build on the enourmous power of the R statistical system.

Why should I use R-Instat?

Some points are as follows:

  1. It is simple to use.

  2. All the documentation is available on-line.

  3. It encourages good statistical practice. For examples of what we mean by this, see the series of Case Studies or the Introductory Guide.

  4. It is quick and easy to install. Installation usually takes less than 5 minutes.

  5. It has a wide range of facilities. These include:

    • good data manipulation
    • descriptive statistics
    • plotting
    • multi-way tables
    • simple, multiple regression, gelenralised linear models, etc
    • analysis of variance
  6. It can handle quite large data sets, for example

    • 10 variables by 200 cases is no problem
    • 100 variables by 10,000 cases is now easy
    • We are keen to see the limits of R-Instat
  7. R-Instat was free for individual non-commercial use (now it is completely free). R-Instat is both free and open source. It can be downloaded and includes both the program and the full documentation. It can easily be distributed to friends, colleagues, or students.

What about documentation and help?

One problem with the effective use of packages can be the lack of documentation when it is needed. This is particularly the case with shareware packages. This is not the case with R-Instat. There are 4 main guides for R-Instat, as follows

Tutorial, Introductory, Reference, Climatic

This version includes all these guides. They are available online. When you install R-Instat you can automatically install all these guides. This means that if you are teaching a group of 20 students, all of them have easy access to the manuals. The guides are available as help files and in printable pdf format.

Both the program and the Guides can be downloaded from our web site http://www.reading.ac.uk/ssc

So, what is the catch?

There isn't a catch. Instat has been written and used by staff who are interested in improving the teaching of statistics. Statistics is often not a popular subject. This is sometimes because courses are unnecessarily theoretical, particularly when taught to students for whom it is not their main subject.

We are often amazed to see a statistics course still taught separately from computers. In some developing countries, computers are still not plentiful for individual students, but where they are available, their use can make a statistics course much more relevant and more fun.

In the help files on Teaching and in the new Introductory Guide, we describe ways that statistics teaching can develop, once computers and statistical software are available. We also include a set of 11 Case Studies, as well as many examples from R packages, etc.

We believe R-Instat can continue in its role of providing support for the teaching of statistics, and for research that involves statistical analyses, for many years to come. We hope you enjoy using it.

Is R-Instat only for small data sets?

No.

It also depends what you mean by small. Think of your data as one, or more rectangular worksheets, or data frames. In each worksheet the columns are the variables you measured and the rows are the cases. In the old Instat the maximum number of columns in a worksheet is 127. Now, the front-end grid apparently has a limit of 32,000 columns - though the limit in R is higher!

For the maximum number of rows, i.e. cases, lets take some examples. With experimental data there may be 100 rows, and that is low, and so fine. With daily climatic analyses for 50 years there are close to 20,000 rows and that is fine too.

If you have 50 stations, each with 50 years of daily data, then you are close to 1 million rows. That is the limit in many spreadsheets, including the grid we use as the front end to R. However, the grid is just a visual window to the R data frames, where all the work is conducted. Well before having 1 million rows we suggest you don't have all the data displayed in the front-end grid. R itself can cope with much longer data sets.

So, the limits are very high. However, processing the data, with very long columns may take a long time. We suggest the limits now are concerned with processing time and not space. If you have a very large data set - with millions of rows - then we strongly suggest you migrate from using R-Instat to using R itself. And if you really want to know about large data sets, then R has a task view that can help. It is here: https://cran.r-project.org/web/views/HighPerformanceComputing.html

We currently use a spreadsheet (Excel) for our statistical work, so should we move to R-Instat?

No.

Using any statistics package does not mean abandoning Excel. Consider instead adding a statistics package to your use of Excel.

Most statistics packages including Instat can read from Excel workbooks and also write to Excel. So you can still continue doing data manipulation, tabulation and graphics in Excel. If you add a statistics package, then you could just use it for the more advanced statistical work that is not in Excel or is poor in Excel.

Where does R-Instat fit in relation to a spreadsheet?

We are enthusiasts for Excel, even for statistical work. Excel is useful, because of its powerful data manipulation, tabulation (pivot-tables), and graphics. Its dynamic nature (the way it updates automatically) and ease with which you can add to Excel are excellent.

However there are two problems with Excel for statistical work. The first is that Excel does not encourage "good statistical practice". We think that this can be put right, but it does need an add-in. See our web site for further information.

Second, it has weaknesses once you consider more advanced statistics than simple tables and graphs.

So, when you fall off the end of Excel for statistical work, you need to add something. This could be one of the many add-ins. It could be a (free) statistical package like Instat, (which can be downloaded from www.ssc.rdg.ac.uk/), or it could be a commercial statistics package.

R-Instat does encourage good statistical practice. It supports the teaching of statistics to a reasonable level. It may be sufficient for all the analyses you need. If it is not sufficient, it can be a useful "stepping-stone" to encourage you to add one of the commercial statistics packages to your repertoire.

Couldn't R-Instat be an add-in to Excel?

We do have an add-in to Excel, to encourage "good statistical practice" when using Excel. It would have been too difficult to make the current Instat into an add-in.

Maybe if it is very successful, then a future version might be usable as an add-in? For now, R-Instat's importing from Excel is very flexible and it can write data to an Excel workbook.

How does R-Instat compare with Excel add-ins?

Add-ins range from free, to 10's of pounds (dollars) to some at many hundreds of pounds. They are not to be ignored, because they can add useful features, like boxplots or improved regression. We think that R-Instat compares well, with all but the most expensive. And if you are looking to spend 100's of pounds, then we suggest that you are likely to opt for one of the standard commercial statistics packages. They have also become easy to use.

How does R-Instat compare with other free software for statistics?

We are obviously hopelessly biased, and the situation keeps changing. So you should look for yourself. A web site with comprehensive information on statistics packages is http://www.ltsn.gla.ac.uk/ from the old Computers in Teaching (CTI) initiative.

How does R-Instat compare with commercial statistics packages?

The basis for our comparison is that we use many standard packages in our statistics courses. For our own students these include Excel, Minitab, SAS, Genstat and S-Plus. In our Statistical Services Centre short-course programme and commissioned courses we have also used SPSS, JMP, Statgraphics, Statistica, StatExact, Conoco or MLWin.

On the statistical side, with the exception of Excel, R-Instat is not as powerful as these packages.

R-Instat is intended to complement, rather than compete with, the standard statistics packages. For those who currently just use a spreadsheet, it can be a "stepping-stone". Once you are familiar with any Windows based statistics package, you will find it easy to add another.

If you are already using a standard statistics package then consider adding R-Instat, because of its use as a teaching tool. Most standard packages are primarily to process data, though they can be used to support the teaching of statistics. R-Instat is the other way round. It is largely to support the teaching of statistics, though it can be used for data processing.

What do you mean "Not as Powerful?"

Modern statistical methods have advanced particularly in two areas. The first is the processing of non-normal data and the second is the processing of data at "multiple levels".

For example for analysing data from "contingency tables R-Instat includes facilities for log-linear models, that is much more powerful than the traditional simple chi-square test. That is an example of the analysis of data from a non-normal distribution. We even include a guide to explain the comparison in detail, because this sort of development is to be encouraged. The standard statistics packages have log-linear models and much more.

If you have data at different levels (like people within households) you might need software that can analyse these data at the multiple levels. If you have such data, then look first at our analysis guide (see our Help menu), because you don't always need to do the analysis at the different levels. If you do, then you need, not just any commercial package, but a powerful one, like Genstat (with REML), SAS (PROC Mixed), S-Plus or MLWin.

What is your relationship with commercial statistical software producers?

In summary it is "good, but not commercial." We do not act as an agent for any producer, nor to we receive any commission on sales.

We can therefore give unbiased advice on statistical software. As we give R-Instat away to all individuals, we feel that it complements, rather than competes with the standard statistics packages.

Why does R-Instat have a teaching menu?

In this version we have moved the Teaching Menu to be part of the Help system.

We describe what this means in the Teaching help file.

Our main aim is to encourage good teaching practice, rather than R-Instat. We believe imaginative teaching requires the use of a software package, but it doesn't have to be Instat. It also needs other resources. Those from Reading are freely available, whether, or not, you use Instat. We also list web sites where you can find resources elsewhere.

Why does R-Instat have a climatic menu?

This has been a research area at Reading for many years, and R-Instat has included facilities for the analysis of climatic data for 15 years. We believe R-Instat and Instat are currently the only general statistics packages to add these facilities. They were a main reason that the DOS version of Instat continued to be used and led to support from the UK Met Office for the development of the Windows version. We are very grateful to the UK Met Office for this support.

What are you, as producers, hoping to get from R-Instat?

"Hope" is the key word, and we are hoping for a lot. We usually do, until realism sets in. Our hopes include:

  • More imaginative use of the computer to support the teaching of statistics.
  • Improved teaching of statistics (and analysis of data) generally, and particularly in developing countries.
  • Publicity for the skills of the SSC in training, software evaluation and software development.
  • Funds from department and site licences to support further developments of R-Instat.
  • More constructive and effective analyses of climatic data.