« Because it's Friday: Speeding up time, keeping it smooth | Main | Integrating R with production systems using an HTTP API »

August 18, 2014


Feed You can follow this conversation by subscribing to the comment feed for this post.

It's interesting to see something like this in a popular press article (was it also in the print version?). While it's nice to see the recognition, I would argue that the breakdown is more 95-5, rather than 80-20 (plus or minus 10%..:).

There are a number of startups in this space to solve the problem of data cleaning. Tamr (http://www.tamr.com/) Paxata (http://www.paxata.com/) Trifacta (http://www.trifacta.com/) are good examples to ease this pain, especially in an enterprise setting.

It sure is, and unfortunately it's a task where organizations still struggle. According to a recent IDG SAS survey only 10 percent say their are extremely capable in this area.

Peter Fretty

The comments to this entry are closed.

Search Revolutions Blog

Got comments or suggestions for the blog editor?
Email David Smith.
Follow revodavid on Twitter Follow David on Twitter: @revodavid
Get this blog via email with Blogtrottr