Last night, the Chicago Cubs made history by winning the World Series for the first time in 108 years, beating the Cleveland Indians in a nail-biting extra-innings game. As a result, today's blog post is a little late as I, along with most of Chicago, was celebrating into the wee hours of this morning. In recognition of the event, and the fact that simple data analysis is all I can muster today, I thought I'd use the excellent Lahman package, which provides a trove of baseball statistics for R, to have a look at the historical performance of the two teams.
First, let's tidy up the Teams data set to look at only the Cubs and Indians:
Now, let's take a look at the average number of runs each team has hit every year, since the Indians joined the league in 1901:
Historically, the Indians have generally hit more runs than the Cubs. On the other hand, the Cubs' pitching has been better in recent years, allowing fewer hits per game:
Congratulations to the Cubs!!!