Showing posts with label Province. Show all posts
Showing posts with label Province. Show all posts

Tuesday, January 29, 2013

And the Winner is...

At this point I've probably bored you enough with my forays into telling you which tiers and nations are better on one weee 'ittle map visited only by tier 3 and under tanks. But I have one last big reveal, the full listing. Trying to estimate just how good all 54 tier 3 or less tanks are on Province represents a lot of typing, after all, those names are long, but for you, I did it.

Now, to get you ready, I used as my baseline the T1 Cunningham, so positive values indicate it's better to have that tank than a Cunningham, while negative values indicate you're better off with the Cunningham.


Honestly, I rather surprised by just how many of these produce significant results. If you read the previous posts, you'll notice that most of these values fall in line with the findings from their regressions: Artillery is bad, TDs good, Premium Tanks good. The two best tanks to have on your team are German Premiums. Tier 1&2 British tanks are poor shadows of their counterparts.
The best:

PzKpfw II Ausf J: This one really surprises me, since looking through the data I can only find one instance of it appearing in a battle. But the data suggests (with a surprising level of significance) that having one on your side gives you a nearly 30% increase in win chance. Obviously either the tank or those who drive it are extremely good...

PzKpfw S35: Another tier 3 German premium, this one give you an extra ~25% chance of winning compared to having a Cunningham.

M22 Locust: American Premium this time, boasting a respectable +23% win chance.

FCM36 PaK40: The French Tier 3 TD seems almost perfect for this map, long view range and powerful gun. It's slowness is not a problem as there isn't much movement to do. This translates to a +22% win chance.

PzKpfw 38H735.f: The infamous Micro-Maus gets the distinction of being the only Tier 2 to make the top 5. Each one on your side give +21% chance of winning.


Obviously it's easy to find a pattern to this data. Premiums are good, Artillery Bad, the lower the tier the worse the tank. It's pretty simple to understand why this is, most people don't play the Tier 1-3 for more than a couple weeks, and even then, rarely with elited tanks and good crews. Premium tank players on the other hand tend to have more experience and are willing to pay, both gold and credits, for better crews and ammo. Further, premiums have no XP grind, so you're never going to find a premium tank fighting at less that full effect.

In an attempt to verify this theory, let us take a look at two premium tanks in particular, the T1E6, this year's gift tank and the M3 Light, the Lend-Lease M3 Stuart that players could get by finishing the tutorial. One peculiarity of the M3 Light is that when you get it, it comes with a free 100% crew, giving players with little experience a fully qualified crew much sooner than they would hav otherwise.

The T1E6, on the other hand, was gifted to everyone with a 50% crew. If you were like me, you immediately dismissed them to place your trained Chaffee crew members in it. But some people forget that it is a premium tank and can use tankers trained on other vehicles.

Comparing the M3 and the T1E6, it's fairly obvious that the M3 underperforms for it's tier, the fact that  50% crew tier 2 and a 100% crew tier 3 perform similarly is ample evidence of that. Comparing to other tier 3 premiums, the skill differential can be estimated at about 0.8, or 16% win chance. This shows that experience is not an irrelevant factor, but having a good crew is also important.


One thing you may have noticed is that the map bias we noted in the last two post is a bit higher in the full tank run. Also, we have some rather extreme biases toward high skill players in the form of premium tanks. How about we try to cut out that bias by removing the premium tanks.


Ok, I messed up a little. The T1E6 stayed in and I don't access to my regression currently to fix it. But, other than a few top tier tanks, almost every tank was close to 5% significance level. And all the low tier ones were highly significant. Plus the map bias has tweaked a bit down toward where we've had it previously.

One thing that should pop out immediately is that almost all of these tanks have negative coefficents, the M3 Stuart being the sole exception (With about 45 predictors, it's rather amazing that we only had one that truely deviated. Further, the significance level indicates just how random that value truely is, so I am going to ignore it and wait for more data). This tells me that it's not so much a question of who is better, but of who is less bad. Obvious offenders appear again, Artillery tends to be close to -0.9, some TDs are very good, others not quite so good.

The surprise winner among these tanks? The French D2 Medium, though it too is a low significance result. Beyond that, the Chi Ha and Luchs make good showings. The most surprising tank would be the classic LoLTracktor (Leichttraktor) which performs 0.11 points better than it's fellow tier ones, though this may be attributed to nostolgia driving by high skill players.

One could make a whole chart of which tanks are good, bad and balanced. But I will leave that for another time.




As always, if you enjoyed this post, please take a moment and download our Replay Upload Tool. It's simple and fast to use and each replay improves the quality of our data and the results we can show!

Saturday, January 26, 2013

That Stiff Upper Lip won't help you down here!

Since my full spreadsheet is still unfinished (give me a break, I've unlocked three tier 8s over the last three days!) and I still need to see just how much information we can develop from the data, I decided to go ahead and take a closer look at how the tanks of different nations perform on Province.

Before I show you the numbers, I want to give you a better understanding of what they mean. All of my numbers are generated using what's called Logistic Regression, which a fancy way of saying we can assign a value to the factors that effect your chances of winning. We get to choose the factors we think will be important, grab them in a data set and throw them at the little computer daemons in a package called R to work their statistical magic and give us these values, which we must then interpret.

When you go to perform a regression in R (or when you do any statistical regression really), you have to give it a function to regress. So far I have (and for the near term I will continue to) functions that are pretty simple: WinChanceSide1 = a + b*NetArtillery + c*NetTDs being the one from the previous post (you can see this at the top of the image). I had another variable to add there, NetTanks, that I didn't include because it would 'overdetermine' the system (NetTanks+NetArtillery+NetTDs=0, so if you know two, you can always find the third). This meant that Tanks were the baseline for the regression and you can figure out the win chances for each team from the values generated.

Now that that's all explained (and hopefully makes sense) let's take a look at the breakdown by nationality:


Interesting, apparently, at least on province, the British low tiers are REALLY underpowered. In fact, that's almost as bad as the artillery from the last post, but much, much more significant. In fact, down here it seems being anything but German is a Bad Idea(TM). Now, once again, I have to caution that these values are fairly insignificant, except for the British, and nationality is mostly a non-effect. In fact, looking at the American values, despite popping up with a slight negative value in this regression, based on the small size of the effect and the significance, I think it can be safely said that the American sub-tier 3 tanks are almost perfectly balanced. More data is probably necessary to identify if the Soviet and German biases are real or imagined (insert plug for people to send us replays with our Uploader here).

Now there's one last category I want to take a look at before we go ahead and regress on all of the tier 1-3 tanks. Premium tanks are generally intended to be "better than a stock tank, but worse than a same tier elited tank." So, down at these low tiers, versus other low tiers, on a very specific map, the results may surprise. After all, people who play low-tier premiums tend to be very experienced and have a greater likelihood of having a good crew. But let's take a look:


Holy unbalanced Batman! Here is a category of tank that gives a HUGE increase in win rates. Having just one extra on your team gives you a whole 5% better chance of winning. That's pretty damn imbalalanced if you ask me...

So now we have three good predictors for winning on province: What side you are on, how many more British tanks do you have and how many more premiums. So I'm going to make us a simple model for guessing how likely you are to win on province, using these three variables:


This gives us a pretty simple, but generally accurate guess of win likelihood from just those three questions:

Chance of Side 1 Winning = (e -(0.275+0.182*NetPremiums-0.08*NetBritish) +1)-1

Chance of Side 2 Winning = 1 - Chance of Side 1 Winning

In case you forgot which side is which


Again, this makes no allowances for draws, but as we have demonstrated, almost all of the draws come out of side 2's win rate, so we can estimate it as:

Chance of Side 2 Winning = 1 - Chance of Side 1 Winning - 0.03

Next time I plan to perform a regression over the whole of tier 1-3 tanks, which I expect to have a lot of low significance results until more data is available. Then we can move on to bigger and better things...

Please, if you enjoyed this post and want to see more like them, be sure to help by contributing your replays! It's fast and simple, just see how here.

Wednesday, January 23, 2013

What is wrong with Province?

To get us started off on the right foot, I'm going to work with a nice, small subset of the 174k battles that vbaddict has supplied us as a seed.

Why Province? In all honesty, the fact that it is tier limited makes it a very attractive way to keep the data set small(ish), as well as only being availible as standard mode. Being limited to tier 3 or less tanks (tier 2 artillery) means that we are only looking at 53 different tanks that can possibly be in the battle. That means I can set up my spreadsheets before my morning coffee runs out.

Of the 174k battles the vbaddict supplied, 2950 of the took place on Province. I am going to simplify things and assume that vbaddict doesn't capture a large portion of the total battles that take place during the day, thus treating the total battle population as infinite relative to our sample here.
Side 1 starts in the Green Area to the south, Side 2 in the Red to the North

The straight forward and simple:

Side 1 Win Rate: 54.7% with a standard error of 0.9%
Side 2 Win Rate: 41.8% with a standard error of 0.9%
Draw Rate: 3.5% with a standard error of 0.3%

Really, you could have found that on any WoT Statistics site. It's widely known that Province is unbalanced in favor of side 1, anyone who has spent significant time at tier 3 or lower is aware of this. But we're not here to look at things as simple as map biases, we're looking for answers.

One possible explanation for the map bias is issue with the matchmaker when dealing with the tier limitation. Unfortunately  if you look at the distribution of tank tiers and types across the two teams, this theory falls rather flat on its face.
Positive values indicate Side 1 had more of that tank type in battle, Negative side 2
This leaves the bias in the map.

The Logistic Equation: For us, PI is the probability of a win for side 1


Next, since we have the data on the types and tiers of tanks, let's take a look at how they effect your chances of winning. This is going to be a logistic regression on all tiers and types of tanks to see if the proportion of any of them is a significant predictor of success on the battlefield. Note that because of the fact that that we can determine certain variables from the others (ie Tier 1 from Tier 2 and Tier 3 tanks)



The first thing to realize is that we are looking at the data from the point of view of side 1. I set up our predictors as net values, ie the number of artillery on side 1 minus the number of artillery on side 2.

If we look at artillery and tank destroyers, leaving other tanks out to avoid over-determination, there is a slight positive effect from tank destroyers and a bigger negative effect from artillery. So each artillery that you have more than the other team decreases your teams chances by about 10%. Similarly, TDs have a positive effect on win chance, ~1.1%.

The 'Intercept' value tells us what we already know, the base chance of side 1 winning is about 56.7%. As outcomes for the regression can only be absolute win or loss, this is actually the chance of side one winning with out draws as an option. Comparing this to the earlier values, it would seem that introducing draws as an option makes it look like most draws come from Side 2's win chance. This may lead one to a theory that most of the draws on province are from side 2 teams that can't entirely capitalize on a weak side 1.

Now all this sounds good, but there is one more column to take note of: P(>|Z|), also known as the P value. In general, for a result to be statistically significant, you want a P-value of <0.05, meaning a less than 5% chance of obtaining the value by chance, which neither net artillery or TDs manages, each of these having ~20% chance of giving these results even if they had no actual effect on the outcome. However, the Intercept is on the order of 10^-12, which is extremely significant. Thus, from this data set, artillery and TDs are not necessarily good predictors, but the team side is by a large margin.