Hi! Welcome to the blog! I came across an interesting problem at work. The company I work for has a dataset full of items, and they needed to find what the optimal combinations of these items would be. This sounds like a simple problem, but when you are dealing with large spreadsheets, this task may be a little daunting. No fear! There are multiple ways to tackle combination problems in R, and today I will showcase a few methods of doing so. Lets first make a sample data set. data <- data.frame( This is obviously not a great set of data, but it is sufficient to make combinations! We are first going to use the expand.grid() function. You can check out the documentation for expand.grid() here. It is a very simple function that gets the job done by taking vectors to make combinations. Let's use it! #To make combinations for each item in each category, we need to filter this data into separate vectors for the expand grid function. So lets make some filters. The benefit of using expand.grid() is that it's really easy to add many vectors (or columns) to use for combinations. As far as I know, something like outer() uses just 2 vectors (without some function). For example; #Let's make another example data frame. The drawback with combn is that it never makes dupes in each combination. So a combination will never be A A A B B B. here is how it works with expand.grid() n2comb <- expand.grid(data2,data2,data2) expand.grid has the advantage of doing this in a really simple way. It might get too large if you add too many columns though. If you have only 2 vectors you can also use outer() data3 <- c("A","B","C","D","E")
0 Comments
|