It comes from MM data, since that's the only data that exists in sufficient quantity to perform any serious analysis on. Even in the limited range of TV difference we see a reasonably strong relationship between TV difference and underdog loss... there's neither a data-based nor logic-based reason to assume that as TV difference increases beyond the scope of the existing data, it will suddenly reverse that trend... in fact, even the so-called design goal of inducements doesn't support such an idea.spubbbba wrote:In R, B and MM then it’s very rare to play matches with big TV differences. So surely those would have to come from league play. By your own standards do we have enough games to show the advantages a higher TV gives, and whether this is mitigated by inducements and if so by how much?
"By my own standards" yes, we have enough data to support the idea that TV difference progressively favours the team with the higher TV, in any format. That doesn't mean that the underdog always loses, it just means that across all games played, we can expect to see the TV underdog losing more often, and losing more the farther below the other team's TV it is.
Would it be great to have a metric shit-ton of additional data from play at higher TV differences? Absolutely. You'll never get it, though, and we know that... so worrying too much about it is like sitting around jerking off to the thought of supermodels. I prefer to remain grounded in reality.
Ok, first off, due to the lack of extensive data on league play, and the additional sources of error involved that mean we'd need comparably more league data to get the same statistical power, we have no actual data on which to compare League and MM play - all of that is based on feelings and suspicions. I fully understand the theory behind what you're saying, but you need to understand that it remains theory until it is supported by something more than anecdotal evidence.spubbbba wrote:Also I’m not sure how you’d factor in the differences in the metagame between TV based matching and league play. The crp rules were written for league play and NTBBL is primarily concerned with this too.
The trouble with matching by TV and open play is that each game is effectively a 1 off. If you play a 2000 vs 2000 game and win but lose a bunch of players so end up 1500 TV your next game won’t be against another 2000 team but someone roughly equal to you. That is why many argue that min-maxing isn’t an issue in leagues.
Maybe you can say "well, everybody thinks this".. and then I can point out that "everybody" thought the earth was flat, and everybody thought the sun revolved around the earth. Objective reality is not a democracy.
Another important point is that the difference between environments has to be so big that we'd expect to see a significantly different result in Leagues than in MM. You (and galak and dode) say that BB was designed for leagues, not MM play, and yet MM play shows win%s reasonably close to the tier expectations. Are we seriously saying that we expect that if we actually had enough league data for similar levels of analysis, we'd really be seeing a large enough difference in league win%s to say that MM data is apples to its oranges? It's very, very unlikely, given what we've seen of the data we actually DO have access to.
House rules really do a lot to hurt the validity of data, yes. Data from a different ruleset is garbage data if we're trying to examine the specific ruleset and rosters of the game as written. However, the assigning of skills thing is more theory than fact, since we have no data on it. I think it sounds reasonable, but likewise, the idea that the earth is flat is pretty reasonable too, from my personal experience.. luckily we have data that lets us know it isn't.spubbbba wrote:It’s why I don’t think tabletop tournaments are much use for looking at low TV balance, not only do they reset every game but also have lots of house rules. Things like being able to assign skills really makes a huge difference to some races, Lizardmen instantly spring to mind.
It's not a similar issue at all. Compositional issues in leagues are much more focused on the fact that certain rosters will not be represented at all, and when there's only one or two teams, the coach's own ability is a much bigger factor in the resulting data. In open leagues you have a large number of teams of every roster, and in MM you can't pick and choose which rosters you play against. The random matching results in less error.spubbbba wrote:Another potential issue is that the open leagues have imbalances of teams just like leagues do. If you look at B it is dominated by bashers, so how would that be factored in?
There are statistical methods to compensate for the different number of teams of certain rosters. If you go back to older postings you'll see I've previously posted the win%s both in total, and controlling for the roster compositional imbalances. If you had a large enough number of leagues, the same controls could be applied there, but like I said, it's the OTHER sources of error that we can't control for, that will make the league data less statistically powerful.
I am not saying that League data is inherently useless. What I am saying is that you'd need, say, 3000 League games to get the same statistical resolution as 1000 MM games.. and we actually have many multiples LESS League data than MM data... and even with the large amount of MM data we have, we find our statistical power gets pretty low as we try to break things down into certain TV ranges and racial combinations. Now, 3000 vs 1000 is just me picking arbitrary numbers (its not actually 3:1... the ratio is unknown since we don't have enough League data to calculate it) but we do know that there are more sources of error in League data than MM data, and more sources of error always create that sort of scenario.
Maybe not, but I'm not trying to move those particular things. If I can get some people to understand better then the community as a whole is better for it. Plenty of people don't really care (including people involved in these arguments) but that's how most debates work... you're not actually expecting the other side to change their mind, you're presenting two sides of an argument so that people who care to listen have more on which to base their own resulting opinions, etc etc.Koadah wrote:There's is probably a little bit more understanding now but even so, I don't think that things have moved on a whole lot from where I came in.