##### Analysis

# 2014 Minnesota Vikings: Game Theory and quarterback competitions

By now, you’ve probably seen the news that Christian Ponder isn’t taking too many snaps at quarterback compared to Matt Cassel and Teddy Bridgewater. In most reports, Ponder takes a back seat to Cassel and Bridgewater. Bridgewater has passed the ball a few more times, but it seems clear that Cassel is the presumed starter heading into camp.

All quarterbacks have been given impressive control at the line of scrimmage, which is nice but perhaps irrelevant to the battle at hand given the fact that the system is new for everybody.

In talking with Dusty on my podcast at the Daily Norseman, he stumbled upon an idea that I didn’t think too critically about until I listed to the podcast again: the process by which coaches assign reps for quarterbacks.

The way that we instinctively compare the distribution of snaps in the QB competition is against a simple 33-33-33 setup, where each quarterback would get a third of the snaps and it would be decided upon from there. That makes a lot of sense, and it’s pretty simple.

But “fair” doesn’t mean “balanced,” and Ponder can be given “enough” snaps to see if he can fit before determining that he deserves more snaps or isn’t starter-quality.

Dusty brought up two points that together make the case that it’s possible Christian Ponder is in a “fair” quarterback competition with the other two quarterbacks while still receiving fewer snaps. The first is that because Matt Cassel is the presumed starter, he should be given more snaps because chemistry will be very important in the season.

As a result, the other quarterbacks should get fewer snaps, but enough to evaluate them for their fit. If at some point, they demonstrate ability that exceeds the presumed starter, they’ll get more snaps.

The second point is that because Christian Ponder is a known quantity, at least relative to someone like Teddy Bridgewater, he should get fewer snaps because there’s less needed in terms of evaluation. This isn’t as important for Cassel, because as the presumed starter, evaluation isn’t a priority; chemistry is.

This is important, because people often know that both evaluation and preparation are important parts of training camp, but don’t often see them as things that trade off with each other. This is why I haven’t been as gung-ho about quarterback competitions (or competitions at other positions) as others have in the past; it’s fine and dandy that you have some idea of who the best at the position is, but what does it matter if they aren’t comfortable in the system with the players they’re supposed to play with?

That doesn’t mean competition is a bad thing, only that it trades off with other things.

The more you look at it, the more complex it gets—which is why it may be best expressed in the decision-making strategies encompassed in game theory. At the most complex level, there’s a bit of calculus involved (as it is an optimization problem), but I’ve forgotten how to do that and we can skip it anyway, because precision is not important.

First, to understand the assumptions I’m working off of you can briefly glance at the first part of a training camp notebook I put together from last year, but this graph does most of the work:

Every individual rep will provide a different amount of value in whichever function you’re looking at. For a generic player, the first rep will provide a lot of help in learning the plays and scheme, as well as developing talent.

The next rep will offer a little less, and each rep will providing a diminishing amount of learning value thereafter.

But the other function of plays on the field is that it allows a coach to see how good a player is. The first rep will tell someone basically nothing about a player for a few reasons. The first and most obvious reason is that it’s simply not that much information. How do you know that the block he missed was because he’s a bad player and not just bad luck?

The second reason is because the player doesn’t know what he’s supposed to do, because he hasn’t taken any reps yet!

So, each consecutive rep will provide more information, and getting a larger sample size of reps to work with gets more and more valuable, as will the fact that you know how the player will perform once he knows what he needs to do-as he theoretically will on Sunday.

After a point, of course, there’s only so much more information that a rep can provide. Think about seeing the 600th snap after seeing 599 previous snaps. You won’t be changing your opinion of that player because of that snap. In fact, it might take 75 more snaps before you start changing your conclusions about what you saw and 50 more after that before you confirm that changed opinion.

For quarterbacks, this can briefly be expressed as a probability for how well we can evaluate how good they are. The more reps we give a quarterback, the more certain we are of their ability. For veterans, we know a lot more than rookies; there’s a definite amount of film and history that can be applied to figure out what their talent level is. Nevertheless, a new system and terminology can be massively beneficial or detrimental, so the amount of knowledge isn’t extraordinary.

Further, with a vet we’ll know a lot of what we’ll need to know as soon as bullets fly; which tendencies remain and which ones have been eliminated. That’s invaluable information if you know what to check against. You have a history of information to grade the veteran with, and you’ll get a good idea of what you see by the end of installation.

For a rookie, you won’t know if tendencies are even there until you have a significant number of snaps. For the most part, you won’t know what you need to know even if they get all the reps.

The crux is that a rookie is going to improve a lot more with reps than a vet is. You can create a competing set of charts to model this (insert your own numbers, the concept remains the same):

There are a lot of reasons to give the rookie reps, but you really don’t know if they’re worth it for a while. The veteran can earn reps and use them, but you’ll be chasing marginal gains. The problem with two different models of knowledge and development are pretty stark, especially in a three-quarterback race.

A better way to think about it is wins. If you set floors and ceiling for how many wins you get from a quarterback, then translate that into bands of uncertainty and improvement metrics, you’re in a good spot.

A bad quarterback is going to limit your wins (here, let’s say the win floor is three games because of the massive talent on the offense and the improving defense) and a good quarterback is going to push you to unexpected wins (the ceiling should be 10 wins if Bridgewater starts and puts in an RGIII-level performance with a few more lucky bounces and a better defense than Washington).

None of that is true if you don’t give them reps, though.

From the perspective of average wins from the percentage of reps given, you might be able to model it like such (assuming that they get reps after being named the starter after the third preseason game or so):

But that doesn’t tell you enough. We already know that Matt Cassel is a better quarterback than Christian Ponder, and that an arbitrary first-round quarterback (in this case, Teddy Bridgewater), on average is worse than an average starter (which Matt Cassel is not, but is close enough to for the purposes of this exercise).

We know this is incomplete because we know that the point of practice isn’t only to get better. It’s to know who is better. So, if we encompass uncertainty bands around the quarterbacks based on available knowledge, it looks like this:

Here, you can see the uncertainty bands decrease for Bridgewater, but not by much (and yes, it looks like the total win differential increases, but the total amount of information about his available talent level decreases relative to what we can know, ie the range of possible wins compared as a percentage against possible wins is much lower). We know that his floor is three wins, which is valuable information. If Ponder or Cassel have a lower floor, then it’s a no-brainer.

We also know that we don’t know much about rookies, which is why his band is so very wide for his first year (seven games of difference). Both Matt Cassel and Christian Ponder have smaller degrees of uncertainty at the beginning and they don’t decrease by much.

Both Cassel and Ponder see the absolute differences in their floor and ceiling decrease with reps, and that’s because there’s less variability in how they can perform: they are who they are and have had a lot of snaps in the NFL to prove it. We know their floor is higher than Teddy’s because rookies wash out, but these two have not. We also know their ceiling is lower because they have shown limited talent level, while Bridgewater is definitely an unknown.

Note that the average of the floor and ceiling is not the expected win total; think of that as the median amount of wins a quarterback is going to give you (and therefore the probability curve of all possible wins is skewed in a direction like so):

Functionally, we are 95% confident that Teddy Bridgewater will bring us between 3-10 wins, but the likeliest win total is closer to six than seven despite the fact that 6.5 is the average. Five wins are more likely than eight and so on.

Obviously all these numbers are hypothetical, but it does demonstrate the challenge of determining who gets what reps. Optimizing the reps using all of those probability curves would mean finding the spot where all three quarterbacks combine for the most wins. The biggest issue is that it’s a three-dimensional optimization problem, which requires more calculus than I remember practicing.

Roughly, assuming all of the assumptions have been correct, you can create close optimizations. With crudely similar distributions, a QB battle consisting only of Cassel and Ponder has an ideal split at about 70-30. That would give the Vikings a 16 percent chance of five wins, a 15 percent chance of six wins and a 14 percent chance of seven wins.

Between Teddy and Cassel, the optimal split is 70-30 in favor of Teddy, with a 33 percent chance of six wins, a 26 percent chance of five wins and a 21 percent chance of seven wins. Between Teddy and Ponder, the split moves to 80-20 in favor of Teddy, where there’s a 25 percent chance of five wins, a 23 percent chance of six wins and a 21 percent chance of four wins.

The distribution among all three looks to be split evenly between 70-20-10 for Teddy-Cassel-Ponder and 60-20-20. Given the assumption that veteran quarterbacks progress similarly both in terms of knowledge provided and improvement with reps, it’s not a huge surprise. The first distribution gives the Vikings a 27 percent chance of five wins, a 27 percent chance of six wins, and a 17 percent chance of four wins. With that comes a 13 percent chance of seven wins and a seven percent chance of three wins.

The second distribution gives a 29 percent chance of five wins, a 26 percent chance of six wins and a 19 percent chance of four wins.

The assumptions here are a little off and if I’m being truthful the math also assume the quarterbacks split time in the season (there’s a more complicated formula that will search for maximum wins instead of averaging them), but it functionally provides an appropriate proxy for determining how to split reps given certain functions and assumptions.

Naturally, the projected wins and distribution change based on the information you have and the assumptions you take, but the functional model for determining how to distribute reps remains the same.

Even if we assume that a veteran will be hard pressed to find wins early in the season as a starter if he doesn’t take many or any reps with the first team offense (and remember how elite veteran quarterbacks opened the lockout), it’s still likely more valuable to **give the rookie more reps** in general because the value of information is high; there’s a limited amount of information to be gained from Cassel and Ponder that we don’t already know and a lot of information about Bridgewater still to be discovered.

We can further complicate this by arguing that second-team reps are good at evaluation and instruction even if they’re not as good and price that fact in as well, but for now let’s keep this as simple as we can while still demonstrating the complexity of the concept.

That said, Mike Zimmer and his staff did shortcircuit a lot of this by implementing a decision time and sticking to it partway through the process, which by itself will almost always increase the win total. Nevertheless, it’s a good exercise to think about heading into an offseason.

Why is it all the media is in such a hurry to proclaim who is 1, 2, 3. Do you get points on the best media outlet site??? Relax, enjoy the show, the cream will rise to the top. No pushing prodding, or reading tea leaves will make Zim declare before he wants to. In todays vernacular – Chillax.

Outstanding read. Love the graphs for QB evaluation and learning utility based on reps for novice vs vet. As a coach I wonder could these concepts be more instructive if you added 1,2,3 rep values. In today’s game of WR & QB adjustments for coverage and space and the corresponding changes in how to throw them open, WR route running & adjustment errors (3rd team) can greatly destroy instruction and evaluation values for QBs in a new system. Uncertainty gets QBs into bad habits of locking on, checking down too early/often and into poor pocket management. There is also the all important buy in factor from coaches & teammates that results from seeing repeated rep success vs defense (1st team) in practice. I’m not sure of the math but might a low tech weighting system where 1st team reps = 1.25, 2nds = 1, 3rds = .5 adjustments be a better use of data to plan reps for QB evaluation & learning and be more reflective of what really happens in practice? Truly interested in this process.

Thanks!

I think it would be instructive, and I’m sure the value of the reps change by amount (for example, my guess is 100 reps with the second team and 100 reps with the first team have a proportional difference that is not the same as the difference as 200 reps with the second team and 200 reps with the first team), but those weights seem OK. I guess I value first team reps a little more, but that’s quibbling over details. Interesting point about how there’s a greater chance of MISevaluation on the third team (and to some extend misinstruction) which might be important to value, too.

Thanks for reply – Bad reps help the person learning from the mistake made but really muddy up QB training.

I disagree with one piece of this. Cassel has reached his peak- he’s had every opportunity to show who he is- a solid vet to sit behind for his first three years, multiple coordinators and head coaches, many different lines and receivers. He’s still marginally better than Ponder, but Ponder is a kid who was thrust into duty too soon with no mentors, no receivers, and a coordinator who was in over his head. We still don’t know what Ponder will be when he is 30- but I’d bet better than Cassel was (with a decent supporting cast) the first half in London or against Cincinnati.

Not too many offensive lines or offensive coordinators (certainly not like Alex Smith in San Francisco), but I think the odds of him being an average to just-below average QB are actually fairly low. People forget the quarterbacks that burn out, or that Ponder’s career has more closely matched the burnout/backup QBs than it has the late bloomers/spot starters.

Regardless, these win projections are for next season only and are fairly arbitrary anyway.

I would love to see an article describing what a “late bloomer” QB’s early days look like vs a “lifetime backup” QB’s early career.

Arif –

Thanks for the breakdown. It should be noted, however, that you’re missing one important variable in your calculations – namely that more than one “team” gets reps. Just because Cassel, for example, is getting first team reps, doesn’t mean that it’s at the expense of Teddy’s knowledge. He’s still taking both physical and mental reps with the 2nd team. The first team reps for Cassel are targeted more at the chemistry building, the reps with the second team are about knowledge-gaining. Similarly with third team reps.

Second to last paragraph: “We can further complicate this by arguing that second-team reps are good at evaluation and instruction even if they’re not as good and price that fact in as well, but for now let’s keep this as simple as we can while still demonstrating the complexity of the concept.”

Everyone who does not get first team reps will theoretically be getting second-team reps. I imagine that the benefit won’t be flat for each player taking second-team reps, but my guess is that the difference is not substantial. At any rate, this was a demonstration with a small argument at the end, not a definitive opinion piece.

Arif, thanks for the great article. I never thought about applying an economic framework to help determine QB reps. Were you an economics major in college? You clearly have much more of an analytical background than most sports reporters.

Thank you. I’ve been interested in decision theory in a lot of frameworks, but was only an economics major for about two semesters before deciding I didn’t like the classical approach at the U of M. I ended up declaring a different major four or five times in college. Most of my analytical approach doesn’t come from classes I took in college but my experience in high school and college debate.

Arif, why dont teams give players equal reps with the first team and see how the qbs perform against first team offenses and defenses then base their decision off that

Because reps matter for a lot of things, and if you’re reasonably sure that your player is not the best player at your position, that’s taking away prime opportunities to improve the player you think will start.

Also, you can’t do this for every position. Think about it: there are 11 starters and 3 or 4 or other players that will play the vast majority of snaps throughout the season. If you rotate everyone at relatively random intervals, that’s over 500,000 possible combinations of players (assuming 11 positions and the normal training camp breakdown of 3 QBs, 5 RBs, 3 FBs, 5 TEs and so on and so forth) on offense. You won’t get everybody with everybody (not that you technically need to do that) to see who works best with who and still train everyone together with enough reps to develop chemistry.

Basically, give most of your reps to those who need it most, and then give some reps to other people just in case they’re better. If they are, increase their reps.

What i meant was for the positions of weakness like lb and secondary and qb every thing else is great because this is a new regime and alot of these players played in the old one or in a different one so why not see how they react with an opportunity like that and coaches can evaluate based on what they saw because players arent playing the 2nd or 3rd team during games so this way some overlooked players can show off unknown or hidden talent

What planet am I on ?

The Planet of Purple Pocket Protectors.

I listened to the vikingsftw podcast and something has been bothering me. Do you have any graphs and charts on why you chose creamy peanut butter? I mean it tastes good but lacks the excitement of crunchy