As I stand[*] here at Heathrow waiting for my flight back to the States, I thought I'd dash off a few quick reflections of the userR! 2011 conference at University Warwick. It was an outstanding event. There's something about a conference of just a few hundred attendees (there were about 450) that creates a sense of camaraderie and common purpose you just don't get at larger conferences. It was wonderful to re-connect with colleagues, meet long-standing collaborators previously only known via email, and meet many new friends. The event was tremendously well-run, and as a community-run conference proceeded much more smoothly than many professionally-managed conferences I've been to. A big thank-you to the useR 2011 organizing committee (John Aston, Julia Brettschneider, David Firth, Ashley Ford, Ioannis Kosmidis, Tom Nichols, Elke Thönnes and Heather Turner) for such a fantastic conference, and we at Revolution Analytics were proud to have been a sponsor. (Photo courtesy useR! 2011.)
There was some great information shared in the tutorials and invited and contributed sessions, too. Here are a few quick nuggets compiled from my notes:
From Max Kuhn's tutorial:
- The caret package is a powerful yet easy-to-use front end to more than 122 different kinds of predictive models in R. It provides a consident user interface to all models, and makes it easy to tune and compare models to select the one with the best predictive power for your data.
- The partykit package provides some beautiful visualizations of tree models.
From Ulrike Gromping's talk
- The FrF2 function (from the eponymous package) is a complete solution for industrial experimental design.
- George Box (of the Box-Cox transformation) is R.A. Fisher's son-in-law.
From Jonty Rougier's talk
- Nomograms, the lost art of representing equations as combined axes, are pretty cool.
- Moroccan donkeys are adorable, and English donkeys are the biggest charity recipients in the UK.
From Pairach Piboonrungroj's talk:
- Structural Equation Modeling is just a combination of factor modelling and regression
- R packages for SEM include sem, OpenMx and lavaan
From Fabrizio Ortolani's talk:
- The forecast function is a simple yet powerful time series forecasting tool, and out-performs similar functionality in SAS and SAP.
From Patrick Burns' talk:
- Modern portfolio optimization is computationally intensive, and is basically a knapsack problem with many constraints
- The packages testthat and RUnit, and the functions do.call and tryCatch, are useful for testing R code
- Generating random inputs to functions is a good way of testing, especially of error-detecting code which often goes untested.
From Ian Cook's talk:
- R has been validated R for use in the pharmaceutical industry many times
- FDA officers have spoken about the use of R for clinical research: Sue Bell in 2006 (although this link now seems to be broken -- anyone have a copy?) and Matt Soukup in 2007.
From Alexander Kowarik's talk:
- The sparkTable package combines official statistics tables with sparklines
- A cool application of sparklines is win-loss records for football teams
From Olaf Mersmann's talk:
- The microbenchmark package provides nanosecond-scale benchmarking of R functions on some hardware systems.
From Paul Murrell's keynote:
- You can use R to read a PDF file, and extract the vector representation of the images and the text for analysis.
- The R graphics engine now has support for complex shapes (e.g. polygons with holes), which can be drawn with the grid.path and polypath functions
- The gridSVG package will soon make it possible to create interative SVG graphics (e.g. high-quality clickable maps) with R.
From Simon Urbanek's keynote:
- The forthcoming ix package will soon allow for interactive, cross-linked exploratory graphics in R that supports millions of points with high performance.
Anyway, I have to board my plane now. You can see more commentary from useR 2011 at the #user2011 hashtag on Twitter.
[*] Literally, stand. The only power outlets are conveniently placed out of cable reach of chairs. Thanks, BAA!
Re outlets at airports. Listen to this podcast (on headphones if you've got them): http://99percentinvisible.org/post/8155793588/episode-32-design-for-airports
Posted by: Harlan | August 19, 2011 at 05:15
"You can use R to read a PDF file, and extract the vector representation of the images and the text for analysis" - how does this work?
Posted by: RRR | August 20, 2011 at 04:07
Thank you David. I wasn't able to attend some of them. This
Round-up is very helpful. One quick question. What is the package that reads in PDF file?
Posted by: Kelvin Lam | August 22, 2011 at 23:46
Hi David, I found the Sue Bell slides on the Wayback Machine at http://web.archive.org/web/20060926184929/http:/www.fda.gov/cder/Offices/Biostatistics/Bell.pdf
Posted by: Ian Cook | August 23, 2011 at 07:52
Thanks, Ian! Here's a direct link.
Posted by: David Smith | August 25, 2011 at 12:01