public
Description: A R package for splitting, applying and combining large problems into simpler problems
Homepage: http://had.co.nz/plyr
Clone URL: git://github.com/hadley/plyr.git
hadley (author)
Thu Oct 29 17:29:50 -0700 2009
commit  aac3c938815787948eb3732967bb4926d8774378
tree    8ca39fd6714561c83a0eb424370b1744f385649c
parent  2686e4f9d64eda02c5dfc3bb17b2dda6dbac15a5
plyr / ANNOUNCE
100644 18 lines (12 sloc) 1.23 kb
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
plyr is a set of tools for a common set of problems: you need to break down a big data structure into manageable pieces, operate on each piece and then put all the pieces back together. For example, you might want to:
 
  * fit the same model to subsets of a data frame
  * quickly calculate summary statistics for each group
  * perform group-wise transformations like scaling or standardising
  * eliminate for-loops in your code
 
It's already possible to do this with built-in functions (like split and the apply functions), but plyr just makes it all a bit easier with:
 
  * absolutely consistent names, arguments and outputs
  * input from and output to data.frames, matrices and lists
  * progress bars to keep track of long running operations
  * built-in error recovery, and informative error messages
 
Some considerable effort has been put into making plyr fast and memory efficient, and in most cases it is faster than the built-in functions.
 
You can find out more at http://had.co.nz/plyr/, including a 20 page introductory guide, http://had.co.nz/plyr/plyr-intro.pdf. You can ask questions about plyr (and data-manipulation in general) on the plyr mailing list. Sign up at http://groups.google.com/group/manipulatr