Most popular machine learning R packages – part 2

In a previous post: Most popular machine learning R packages, trying to hash out what are the most frequently used machine learning packages, I simply chose few names from my own memory. However, there is a CRAN task views web page which “aims to provide some guidance which packages on CRAN are relevant for tasks related to a certain topic.” So instead of relying on my own experience, in this post I correct for the bias by simply looking at the topic
Machine Learning & Statistical Learning. There are currently around 100 of those packages on CRAN.

Using the cranlogs library I query the number of downloads for all those packages over the 2018 period.

Of course the assumption here is that the number of downloads is a good proxy for (unobserved) popularity, but I think that is a fairly weak assumption.

Here is the result:

Most popular machine learning R packages:


These are the total number of downloads (divided by 10^4 for readability).

At the top of the list, most names look familiar, but not all. There are few undiscovered (by me at least) potentially powerful packages.

Action points: to check
– John Fox, one of those R titans, wrote the effects package. Effect Displays for Linear, Generalized Linear, and Other Models.
arules package provides the infrastructure for representing, manipulating and analyzing transaction data and patterns (frequent itemsets and association rules).
plotmo package Plot a Model’s Residuals, Response, and Partial Dependence Plots.

Code

The code I used to construct the data is below. Using the CRAN Task Views you can do the same with other tasks you care about, e.g. extreme value estimation, or handling missing data.

2 comments on “Most popular machine learning R packages – part 2”

Leave a Reply

Your email address will not be published. Required fields are marked *