background preloader


Facebook Twitter


Excel. Bayesian statistics for dummies. Pvclust: An R package for hierarchical clustering with p-values. An R package for hierarchical clustering with p-values Ryota Suzuki(a, b) and Hidetoshi Shimodaira(a) a) Department of Mathematical and Computing Sciences Tokyo Institute of Technology b) Ef-prime, Inc.

pvclust: An R package for hierarchical clustering with p-values

What is pvclust? Pvclust is an R package for assessing the uncertainty in hierarchical cluster analysis. Pvclust provides two types of p-values: AU (Approximately Unbiased) p-value and BP (Bootstrap Probability) value. Pvclust performs hierarchical cluster analysis via function hclust and automatically computes p-values for all clusters contained in the clustering of original data. An example of analysis on Boston data (in library MASS) is shown in the right figure. 14 attributes of houses are examined and hierarchical clustering has been done. Installation pvclust can be easily installed from CRAN. Install.packages("pvclust") On Windows you can use Packages -> Install package(s) from CRAN... from menu bar.

Download The latest version should be found at the CRAN web site [FAQ] Q. > data(lung) HPC packages for R. This CRAN task view contains a list of packages, grouped by topic, that are useful for high-performance computing (HPC) with R.

HPC packages for R

In this context, we are defining 'high-performance computing' rather loosely as just about anything related to pushing R a little further: using compiled code, parallel computing (in both explicit and implicit modes), working with large objects as well as profiling. Unless otherwise mentioned, all packages presented with hyperlinks are available from CRAN, the Comprehensive R Archive Network. Several of the areas discussed in this Task View are undergoing rapid change. Please send suggestions for additions and extensions for this task view to the task view maintainer . Direct support in R started with release 2.14.0 which includes a new package parallel incorporating (slightly revised) copies of packages multicore and snow.

Parallel computing: Explicit parallelism Several packages provide the communications layer required for parallel computing. Using R for statistical analyses. Help and Documentation My Publications See my books about R at my Publications Page: Statistics for Ecologists using R and Excel.

Using R for statistical analyses

Published December 2011 Beginning R: The Statistical Progreamming Language. The Essential R Reference. Documents There are plenty of sources of help and information regarding R. “Using R for Data Analysis and Graphics - Introduction, Examples and Commentary” by John Maindonald [PDF]. These are available via the 'Contributed Documentation' section. Courses From 2009 I am going to be running a series of short courses in data analyses for conservation biologists.

Help within R The help system within R is comprehensive. Click on the 'Help' menu. If you want help on a specific command you can enter a search directly from the keyboard: help(keyword) A shortcut is to type: ? This is fine if you know the command you want. Apropos("part.word") You type in a part.word and R will list all commands that contain that string of letters. R comes with a number of data sets.