The popularity on the included bargraph is the average for each genre. The boxplot also shows the median and quartiles. I calculated popularity simply by adding the omnibus ratings count and Google ratings count for each game (not the best metric probably, but it’s what we have). It just so happens that the genres with the most games are also the genres that have the highest average review counts per game.
Also, I’ve done a regression analysis of popularity that includes sub-genres, encoding them in the same way as primary genres (since they’re mostly the same words):
Summary of regression results including sub-genres
Well, the R^2 is a bit higher here, at 0.363 vs 0.351 for the primary genre-only model. So that’s good, indicating that the sub-genre provides some additional information. The coefficients for Word Count and is_free are pretty similar as before.
The interesting thing is that it totally changes the directions of some of the coefficients for the genres:
- Romance is now the best genres, giving 1080 to 5558 additional reviews (it was negative when only considering primary genre; I’m blaming Wayhaven).
- The bonus from Fantasy is no longer significant no longer significant (the range is -408 to 2485).
- Superhero and Supernatural still give significant bonuses.
In order of highest to lowest predicted popularity gain, the genres are:
and these genres are predicted to cause popularity loss:
A * indicates a statistically significant result at p<0.05.
I’ve also done a bit of analysis on the ratings. Here’s an interactive plot of games with ratings by genre.
Summary of regression results for predicting omnibus ratings given genre and word count
- The R^2 is 0.434, which is not bad.
- The “baseline” rating is about a 4.
- Higher word counts give higher ratings.
- Free games have a lowered rating by about 0.24 stars.
Genres sorted by rating bonuses:
Slice of Life*
And here are the predicted “negative” genres:
Again, a * indicates statistical significance at p<0.05.
Interestingly, Adventure and Spy were some of the worst performing genres in terms of popularity. But they’re both literally sample sizes of 1. So is School.