In case you missed them, here are some articles from September of particular interest to R users.

A tutorial on using R with Jupyter Notebooks and how to control the size of R graphics therein.

A new version of Revolution R Open is available, featuring multi-threaded computing for R 3.2.2.

One benefit of fitting statistical models to large data sets: learning curves.

Using the AzureML package to publish R functions as web services.

The R Consortium forms a committee to oversee projects, headed by Hadley Wickham.

Functions for interpolation in R.

The EARL London conference (preview here) included many applications of R, from AstraZeneca, Allstate, Douwe Egberts coffee and others.

A new online Data Science and Machine Learning course, featuring R and sponsored by Microsoft.

Reading financial time series data into R with the zoo package.

An update to the checkpoint package brings support for knitr and rmarkdown documents in reproducible projects.

The new Microsoft Data Science User Group Program offers sponsorships for R user groups worldwide.

A series on model validation in R using: basic methods; in-training set measures; out-of-sample procedures; and cross-validation techniques.

BlueSky Statistics, a new open-source GUI for R.

Accessing data in Google spreadsheets with the googlesheets package for R.

Antony Unwin on the care of datasets in R packages.

General interest stories (not related to R) in the past month included: building a scale model of the solar system, a new way to visualize the Discrete Fourier Transform, and a Portal-themed remodel.

As always, thanks for the comments and please send any suggestions to me at david@revolutionanalytics.com. Don't forget you can follow the blog using an RSS reader, via email using blogtrottr, or by following me on Twitter (I'm @revodavid). You can find roundups of previous months here.

In case you missed them, here are some articles from August of particular interest to R users.

Creating interactive time series charts of financial data in R.

Many R books have been translated into Chinese.

A tutorial on visualizing current-events geographic data with choropleths.

Revolution R Enterprise 7.4.1 is now available on Windows and Linux servers and in the Azure Marketplace.

Zillow uses R to estimate the value of houses and rental properties.

There’s a new (and free) online course on edX for R beginners, sponsored by Microsoft and presented by DataCamp.

Mini-reviews of 5 new R packages: AzureML, distcomp, rotationForest, rpca, and SwarmSVM.

The R Consortium’s best practices for secure use of R.

How to extract data from a SQL Server database in Azure to an R client running Linux.

DeployR Open 7.4.1, the open-source server-based framework for simple and secure R integration for application developers, is now available.

R 3.2.2 is now available.

A review of the JSM 2015 conference and the prevalence of R there.

R is available with Cortana Analytics, which you can learn about in upcoming workshops and webinars.

A comparison of the network structure of the CRAN and Bioconductor repositories.

Using R to find signal in noisy data.

I discussed the R Consortium in the inaugural episode of the R Talk podcast.

An exponential random graph model of connections between CRAN packages.

Using the igraph package to simplify a network graph.

An introductory guide to the Bioconductor project.

An animation shows every commit to the R source code over 18 years.

General interest stories (not related to R) in the past month included: Macklemore on mopeds, reconstructed timelapses, a moving visualization of WW2 fatalities and the Magnus Effect.

As always, thanks for the comments and please send any suggestions to me at david@revolutionanalytics.com. Don't forget you can follow the blog using an RSS reader, via email using blogtrottr, or by following me on Twitter (I'm @revodavid). You can find roundups of previous months here.

In case you missed them, here are some articles from July of particular interest to R users.

An alternative to stacked bar charts with the streamgraphs package.

Joseph Rickert shares his process for creating the monthly new and updated packages "spotlight" feature on MRAN.

Using R to analyze data from its API reveals R to be the 8th most popular language by activity on StackOverflow.

On accumulating results in R using looping operations.

In an in-depth profile, Hadley Wickham shares his motivations for creating his many useful R packages.

The latest rankings by IEEE Spectrum puts R as the 6th most popular programming language, rising 3 since 2014.

Revolution R Open 3.2.1 is now available, bringing multi-threaded performance and new platforms to the latest R engine.

A look at trends of questions on StackOverflow for Python and R.

Setting up a Linux VM on Azure, and importing data into R from MySQL and mariaDB.

The winners of the 2015 KDD Cup, and how you can analyze the data with R in Azure ML Studio.

The chair of the local committee shares some statistics from the useR! 2015 conference.

A review of R packages for extreme value statistics.

Using the igraph package to create interactive (and embeddable) network graphs from data in R.

Package author Ari Lamstein shares his tips for creating an email-based R course.

In an EMC-sponsored competition to analyze data generated by a motorcycle racer, both winners used R.

A visualization of the network structure of CRAN packages finds connected communities including one centered the "Hadleyverse".

Using R and A/B testing to evaluate the performance of advertising.

A roundup of press generated by the announcement of the R Consortium.

My reflections on the successful 2015 useR! Conference in Denmark.

Experiences using R Markdown and Github for teaching.

Resources from a tutorial on RHadoop, for using R with Hadoop.

General interest stories (not related to R) in the past month included: a short film about impossible business meetings, filming the motion of guitar strings with just a smartphone, an anthem for R users, a short documentary on the mission to Pluto, a review of "Statistics Done Wrong", and a clever illustration of personal bias.

As always, thanks for the comments and please send any suggestions to me at david@revolutionanalytics.com. Don't forget you can follow the blog using an RSS reader, via email using blogtrottr, or by following me on Twitter (I'm @revodavid). You can find roundups of previous months here.

In case you missed them, here are some articles from June of particular interest to R users.

The R Consortium, a trade group dedicated to the support and growth of the R Community, has launched with the R Foundation, Microsoft, RStudio and others as founding members.

A detailed FAQ for fitting Generalized Linear Models in R.

My presentation on Microsoft’s embrace of R, both in supporting the open-source R community, and connecting R with Microsoft platforms.

Packages for analyzing the RStudio CRAN logs, used to calculate the top 100 R packages by downloads.

Counting the number of packages on CRAN by platform.

Getting data into and out of R applications with DeployR.

A review of the various options for using R with Hadoop.

Using R to search for CRAN packages by topic area.

R code to draw the Archimedes Spiral.

A controversial caution about using only pairwise-complete observations when calculating correlation/covariance matrices in R.

You can use the RBlpapi package to access Bloomberg data with R.

SparkR, a package to use the Spark distributed-computing framework from R, is now part of the Apache Spark project.

An interactive map locates the 160+ R user groups around the world.

R has 64-bit objects, but there are constraints having only 32-bit integers.

R is sometimes called a quirky language, but I argue that these design decisions have directly led to many innovations in statistical computing.

R and BioConductor were featured at “BUILD” (Microsoft’s developer conference in San Francisco), shown being called on-stage from a mobile app.

A review of some of the presentations at R/Finance 2015 in Chicago.

Using the rpud package to calculate distance matrices using the GPU in R.

A tutorial on using Azure as a data source for R.

A comparison of several high-performance computing approaches in R.

General interest stories (not related to R) in the past month included: planning A/B tests, a critique of US state flags, a new type of bearing, a warning about drop bears and a visual comparison of the Game of Thrones books and TV series.

In case you missed them, here are some articles from May of particular interest to R users.

RStudio 0.99 released with improved autocomplete and data viewer features.

A tutorial on the new Naive Bayes classifier in the RevoScaleR package.

R is the most popular Predictive Analytics / Data Mining / Data Science software in the latest KDnuggets poll.

A Shiny application predicts the winner of baseball games mid-game using R.

A list of over 100 open data sources you can use with R.

Revolution R Open 3.2.0 now available, following RRO 8.0.3.

A review of talks at the Extremely Large Databases conference, featuring Stephen Wolfram and John Chambers.

My TechCrunch article on the impact of open source software on business features several R examples.

You can improve performance of R even further by using Revolution R Open with Intel Phi coprocessors.

New features in Revolution R Enterprise 7.4, now available.

The next release of SQL Server will run R in-database.

Create embeddable, interactive graphics in R with htmlwidgets.

Computerworld reviews R packages for data wrangling.

A tutorial on using data stored in the Azure cloud with R.

Using histograms as points in scatterplots, and other embedded plots in R.

A comparison of data frames, data.table, and dplyr with a random walks problem.

A video on using R for human resources optimization.

How to call R and Python from base SAS.

General interest stories (not related to R) in the past month included: a song written by an iPhone, a Facebook algorithm that tells when “like” becomes “love”, a map of light pollution and a machine-learning application that tells you how old you look.

As always, thanks for the comments and please send any suggestions to me at davidsmi@microsoft.com. Don't forget you can follow the blog using an RSS reader, via email using blogtrottr, or by following me on Twitter (I'm @revodavid). You can find roundups of previous months here.

In case you missed them, here are some articles from April of particular interest to R users.

Joseph Rickert reviews the inaugural New York City R User Conference, featuring Andrew Gelman.

Engineer Vineet Abraham compares performance benchmarks for R and Revolution R Open on OS X and Ubuntu.

R was featured in the keynotes for the BUILD developer’s conference.

Mark Malter created a Shiny application to explore baseball statistics.

A curated list of the best packages, add-ons and resources for R according to Qin Wenfeng.

An analysis of paintings from British museums reveals an increasing use of the colour blue over the last two centuries.

Journalists are increasingly referencing source research via DOIs, and packages from rOpenSci allow R users to access that research programmatically.

Microsoft is hiring programmers to work on R-related projects.

Some examples of visualizing the results of hierarchical clustering with a heat map.

The Financial Times published an interactive data visualization based on R to explore European unemployment statistics.

Announcing R 3.2.0.

Recent R user group meetings have covered Shiny, SparkR, htmlwidgets, and dynamic pricing models.

A story about teaching R to archaeologists in Myanmar, and coping with package installation in a low-bandwidth environment with the miniCRAN package.

RPowerLabs allows electrical engineers to experiment on virtual power distribution systems.

Two high-performance packages from RStudio for reading data into R: readr (for text data) and readxl (for Excel data).

A list of the top 25 R user groups in the world by membership.

A guide to association rules and market basket analysis in R.

The choroplethrZip package allows R users to create data maps from US zip codes.

Revolution Analytics is now a subsidiary of Microsoft.

DeployR 7.4, a web-services framework for integrating R code to other applications, is now available for download.

Coarse-grained parallel computing with R on servers and Hadoop with rxExec in Revolution R Enterprise.

Revolution R Open 8.0.2 was released (and RRO 8.0.3 is now available, too).

General interest stories (not related to R) in the past month included: a video travelling at the speed of light, a snowy music video, and visualizing the bassline in a Marvin Gaye classic.

In case you missed them, here are some articles from March of particular interest to R users.

Overview of "Targeted learning" packages for R, including SuperLearner and tmle.

The 7 most common R error messages, by frequency of mentions on StackOverflow.

Slides and a webinar replay on reproducible data analysis with R and the "checkpoint" package.

Joseph Rickert reviews the new book, "Hands-On Programming with R" by Garrett Grolemund.

R users are invited to participate in the 2015 Rexer Data Mining Survey.

Using the "smbinning" package to discretize continuous data for machine learning.

Analyzing the nocturnal activities of New Yorkers via their Instagram posts (yes, NYC does sleep sometimes).

Gradient-boosted trees for big data with the rxBTrees function in Revolution R Enterprise.

New features in the updated "checkpoint" package for reproducible data analysis.

Thoughts on using Vim as an interface to R.

An article on the impact of open source software on data science features R.

A new white paper describes the architecture of a DeployR server for connecting R to other applications.

A surprising simulation to calculate pi.

How to track the progress of parallel and distributed computations with a progress bar.

A brief summary of new features in R 3.1.3.

An overview of the Hadleyverse, the collection of R packages by Hadley Wickham.

Analysis of activity in R user groups shows a recent spike in meetings.

Tools to extract and compare colors from images with R, and finding the true colors of "that dress".

Using Domino's new "R Notebook" to explore data with interactive graphics.

Computerworld's "R for Beginners" hands-on guide is now available as a downloadable PDF.

General interest stories (not related to R) in the past month included: a musical domino cascade, a deep zoom into the Mandelbrot Set, a culinary celebration of Pi Day, and an unusual undersea perspective.

In case you missed them, here are some articles from February of particular interest to R users.

The John M. Chambers Statistical Software Award announcement for 2015.

The new R package "distcomp" allows researchers to collaborate on data spread across multiple sites.

David Smith's interview with theCUBE on R, data science, and Microsoft's acquisition of Revolution Analytics.

R is used to measure impact of climate change, and other Strata keynote presentations.

Some tricks for monitoring the progress on parallel R jobs using foreach.

"Analytics Marketplaces" are all the rage today, but CRAN was there first.

A preview of some of the major R-related conferences and events of 2015.

The checkpoint package has been updated to make it even easier to run R scripts with fixed R package versions.

A tutorial to introduce R to users of Microsoft Excel.

R used to assess the "virality" of posts on new-media sites like Buzzfeed.

A review of the HP Workshop on Distributed Computing in R featuring Luke Tierney, Dirk Eddelbuettel, Martin Morgan, Simon Urbanek and other R luminaries.

R is a top-ranked language on GitHub, according to statistics like number of repositories, code pushes and number of forks.

You can use the rcrunchbase package to access data on startup companies.

The R package "syuzhet" applies sentiment analysis to novels to infer their dramatic arc.

The new "quickcheck" package provides assertion-based testing with random inputs for R.

A theorem for calculating an upper bound for the generalization error of a machine learning classifier.

Some practical advice for sharing Shiny applications with shinyapps.io.

A visualization of Paris's street orientations reveals the history of the city.

General interest stories (not related to R) in the past month included: how our brains trick us into seeing the wrong colors, an hilarious parody of cooking shows, a new ASA website to help journalists with Statistics, the statistical model behind the rules of cricket, and why rivers meander.

In case you missed them, here are some articles from January of particular interest to R users.

Slides on reproducible data analysis with Revolution R Open and the checkpoint package.

A review of a recent Bay Area R User Group meetup, featuring Hadley Wickham, Ryan Hafen and Nick Elprin.

In an article at opensource.com, I explain why now is a great time to learn R and provide some resources to get started.

Norm Matloff reviews the state of the art in parallel programming with GPUs in R.

A tongue-in-cheek R script provides excuses for when your P-values aren't *quite* significant enough.

I explain what Microsoft's acquisition of Revolution Analytics means for Revolution R users and the R community generally, and review the media coverage.

Joe Rickert reviews the state of R integration with Spark.

Tufte's classic weather data visualization recreated in R for Dayton, Chicago and New York City.

A new R-based course, Statistical Computing for Biomedical Data Analytics.

An introductory tutorial for R, aimed at budding econometricians.

Harvard offers a free 5-week online course on R.

A look at, and some resources for using, R's base graphics capabilities.

An update to the "R is Hot" whitepaper with new applications and statistics on R usage.

Interactive R notebooks with Domino Data Lab.

The dplyr package has been updated with new data manipulation commands for filters, joins and set operations.

Kudos to the rapidly-growing BioConductor project, recently featured in Nature.

An online R-based application evaluates your risk of flooding.

Twitter releases an R package for anomaly detection in time series.

A Revolution Analytics consultant describes how he used R to visualize soil attributes using the ggmap package.

Yihui Xie created a voice-controlled R graphics application.

Video of talks by Trevor Hastie (on machine learning) and John Chambers (reminiscing on his time at Bell Labs).

The top 10 posts on the Revolutions blog from 2014.

General interest stories (not related to R) in the past month included: a comeback for real and virtual pinball, a geometry construction game, a typography game and a musical 'tribute' to Shia LeBoeuf.

In case you missed them, here are some articles from December of particular interest to R users.

R was featured in recent articles in Nature News and Mashable.

A recap of the 6th Spanish R Users Conference.

R was the recipient of a 2014 "Bossie" award for best open-source big data tools.

A fractal Christmas tree created with R.

You can use the rgl package to explore a 3-D shapefile of the comet Churyumov–Gerasimenko (no sign of Philae, though).

Using R to solve a probability problem about assigning tasks to people on randomly-selected days.

Looking at how the Queen's Christmas speech has evolved over 50 years.

A list of useful statistical resources from 2014 (many related to R), from Jeff Leek.

Revolution Analytics is offering cash sponsorships for local R user groups for its 2015 program.

The new leadership of the R Foundation.

How to make interactive 2-D and 3-D R plots with Plotly.

Useful examples of cartography with complex survey data from the swmap project.

The ASA publishes new guidelines on undergraduate Statistics programs.

Revolution R Open 8.0.1 is now available for download, based on R 3.1.2.

The latest O'Reilly survey of data scientists indicates prevalent use of R, surprisingly low use of Python stats libraries.

Quandl now publishing new commercial data sources accessible from R.

A graph-based method of clustering CRAN packages into "communities" like "statistical learning".

A webinar on sports analytics with R and Storm.

Cindy Brewer, who created the palettes behind the RColorBrewer package, is profiled in Wired magazine.

Highlights from some recent local R user group presentations on D3 visualizations, Slidify, ggplot2, data.table and more.

The 25 most-referenced R packages, according to the PageRank algorithm.

General interest stories (not related to R) in the past month included: spirally optical illusions, a short film exploring the Solar System, and the top big-data analytics companies.