August 20, 2014

Getting Started with the R Data Analysis Package

Professor Norm Matloff
Dept. of Computer Science
University of California at Davis
Davis, CA 95616
R is a wonderful programming language for statistics and data management, used widely in industry, business, government, medicine and so on. And it's free, an open source product. The S language, of which R is essentially an open source version, won the ACM Software System Award in 1998.

Downloading R:

R is available for Linux, Windows and Mac systems.
You can download R from its home page.
For Ubuntu Linux or other Debian-related OSs, a more direct method is:
% sudo apt-get install r-base

Learning R:

There is a perception among some that R has a steep learning curve, but I disagree. True, R usage has its advanced aspects, but my recommendation is simply, just get started! Start simple, and then refine gradually.
I'll list a few tutorials below (not necessarily the best, just ones I know of). But first, I wish to make a very important point:
"When in doubt, try it out!" That's a slogan I invented to illustrate the point that R's interactive mode allows you to try your own little experiments, the best way to learn. Keep this in mind when you go through the tutorials listed below and in Google.
Here are some resources that I would recommend for learning R:

Advanced R:

R Programming Tools:

One of the most debated topics in R online discussions is that of programming tools for R, of which there are many.
  • I'm not a fan of integrated development environments, but if you like IDEs, there are a number of open source products available:
    • The most popular is undoubtedly RStudio, RStudio, introduced in 2011, and growing rapidly in functionality.
    • For fans of the Eclipse framework, StatET is available, and includes a debugging tool.
    • More established products include JGR, Rcmdr, RKWard.
    • There are a number of plugins for text editors such as Emacs (for which a debugger is available), Vim, gedit, and so on.
    • In the commercial realm, there is one from Revolution Analytics , which also includes a debugging tool.

People you can talk to:

  • There are various mailing lists (start with R-help) shown on the R home page.
  • There are R user groups in cities around the world. I'm active in the the San Francisco Bay Area group. We hold meetings once a month, with one or two speakers. Many attendees are new to R.
  • Another online place to ask questions is Stack Overflow.
©