Saturday, August 9, 2025

Tinker, Tailor, Soldier, Poor Man

While we wait for the 2025 survey to run its course, still more to discuss from the 2024 survey.

Today's analysis examines four wargaming traits in the hope of reducing these data in order to produce a few identifiable and meaningful wargaming profiles based upon survey responses.  The statistical technique used to explore relationships and correlations between these four variables is Multiple Correspondence Analysis (MCA).  MCA extends Correspondence Analysis (CA), which is typically used for two categorical variables, to more than two variables.  MCA is a powerful exploratory tool for summarizing and visualizing relationships in datasets with several categorical variables, helping to uncover patterns and associations that might otherwise be hidden.  MCA has been utilized in a number of past analyses.

Very briefly, the key points of MCA are:

  • Purpose: MCA helps to detect and represent underlying structures in complex categorical data, making it easier to interpret relationships between variables and categories.
  • How it works: It transforms categorical data into a numerical format (indicator matrix), then applies dimensionality reduction (similar to Principal Component Analysis for quantitative data) to project the data into a lower-dimensional space.
  • Output: The results are often visualized as maps or plots, where similar categories and individuals are positioned close together, revealing associations and clusters.
  • Applications: Widely used in social sciences, marketing, and survey analysis to explore patterns in responses, profiles, or preferences.
The questions pulled from the survey and used in this study are:

  1. Do you consider yourself mostly a historical, or more a fantasy/sci-fi wargamer on a scale of '0' (pure historical gamer to '6' (purely fantasy/sci-fi gamer)? 
  2. How do you rate yourself as a craftsman on a scale of '1' (terrible) to '5' (great)?  Variable name = CRAFTSMAN with values 1-5.
  3. On a scale of '1' (not interested) to '5' (deeply interested), how much do you research the (fictional or not) background to your game?  Variable name = RESEARCH with values 1-5.
  4. How many painted figures do you have in your collection?
  5. How often do you currently game?
The variables under consideration are Craftsman, Research, Collection Size (Collection_Size), and Gaming Frequency (Game_Freq).  Will any identifiable patterns emerge from these data manipulations?  Well, let's see.

To begin, only survey respondents whose primary interest is historical wargaming ('0' or '1' in question 1) are included.  Total number of respondents in the sample is 1,652.  With that criterion set, the frequency counts of each variable and its values are illustrated in the following four bar graphs:
Craftsman
Research
Collection Size
Game Frequency
While pairwise comparisons between any two variables can be useful, all four of the variables need to be considered simultaneously to extract any meaningful patterns and relationships between wargamers and their tendencies.  To produce enough separation between values, outlier removal can be a useful technique used in an iterative process.  Upon first inspection (see MCA: Initial), many of the data points are compressed into the lower left of the graph.  This is a good candidate to test select outlier removal techinques.
MCA: Initial
In the first iteration of outlier removal, two outliers are removed.  They are Research1 and Collection Size of 20,001-25,000.
Outlier Removal: Research1 and Collection Size 20,001-25,000
After Iteration 1, values are still compacted into the lower left quadrant of the graph without much separation.  In Iteration 2, three variables are removed as outliers.  These are Research2, Craft1, and Collection Size 0-100.  Now, these three could have been removed in Iteration 1 but I kept them in to illustrate the process.
Outlier Removal: Research2, Craft1, Collection Size 0-100
After Iteration 2 of outlier removal, the spread between values is improving but Craft5 emerges as an outlier.  Iteration 3 removes Craft5.
Outlier Removal: Craft5
Having completed three outlier removal iterations, the spread between values is improved with enough separation and distinction to stop and assess the results.  Next step is to move onto the analysis and interpretation of the resulting graph.
MCA: Final 
Using the origins in both dimensions (1 and 2) four quadrants are delineated.  By studying each of the four quadrants, is there any underlying inference that emerges to help classify each of these spaces into any meaningful grouping label?

Overlaying wargaming terminology of troop experience to each of these four quadrants seems to fit the model inference in a reasonable fashion.  I use the terms of Crack, Veteran, Regular, and Green to distinguish the attributes within each quadrant.
MCA: Interpretation
The loadings of variables and values into each quadrant present themselves counterclockwise as shown by the green arrow as follows:
Crack: Identified by high research values (5) and craftsmanship (4) values, gaming more than once a week and having massive painted armies (Poor Man!).
Veteran: Identified by large armies with weekly gaming.  Little distinction with Regulars with respect to research values (3).  No loading on craftsmanship. 
Regular: Identified by high research values (4) and medium craftsman values (2,3).  Gaming tends toward bi-monthly with painted armies 100-500 figures.
Green: Identified by good sized painted armies (501-2,500) and infrequent gaming but with little interest in either research or craftsmanship.

Interesting results and equally interesting groupings between the loadings within each quadrant.  Always a surprise when data reveal their hidden, underlying tendencies.  Where do I fit into this analysis?  Well, I fit into the Crack classification quite closely with the exception that my craftsmanship rank is not likely up to '4' standards.  I need to step up my game!

Where would you fit into this scheme or do you?

I wonder if adding in rules source would add anything worthwhile into this analysis?

6 comments:

  1. I think I might best fit the Veteran classification Jon, although I am more interested in "craft" (terrain building) than I am in research.

    ReplyDelete
    Replies
    1. Craft3 is near the quadrant border separating Veteran and Regular suggesting little separation between the two groups. “Veteran” seems a good fit for you, then.

      Delete
  2. An interesting read Jon, going by the classifications I think I would probably come in the Regular category although I don't game as often as I should!

    ReplyDelete
    Replies
    1. Glad you understood the analysis! Maybe you can squeeze out some more time for gaming?

      Delete
  3. I wasn't sure what the craftsman question was and just assumed making your own stuff? I think I put a 3, but if craftmanship includes spending time painting your miniatures to a standard you like then I should of went higher as I do spend a lot of time doing that I guess. Not really sure what is considered spending a lot of time per miniature is though. I honestly don't know where I would fit, but I suppose leaning towards regular,

    ReplyDelete
  4. I would see my self as Crack for research and craftsmanship (no surprise on that given my former work!), Regular for gaming and a mix of Green/Regular for painted armies size.

    ReplyDelete