Originally Posted by Interesting Ian :
However I think that the only relevant figures here are the 44% and 56% figures for the 1st and 2nd half respectively.
Art Vandelay
If you want to interpolate to five minute intervals, you'll need to know how lambda changes.
I have stated what I want to do and it has absolutely nothing whatsoever to do with considering how the expectation of goals decreases as the match progresses. If I'm interested in how many goals will be scored in a 45 minute period of time then it doesn't matter whatsoever about each 5 minute interval of time. It's only the average for the 45 minutes which can possibly be relevant.
Art Vandelay
Of course, there's a lot of information you're not taking into account, such as yellowcards, field position, etc.
II
I don't need to take them into account since they are implicit in the determination of the probabilities I am using eg the 1x 2 percentages for full time and half time, and the pre-match total goals expectation and superiority.
Originally Posted by Interesting Ian :
What does "having a score of a to b" mean?
Art Vandelay
The first team has a goals, the second team has b goals. The score is a to b.
Eh? The score is a-b, not a
to b.
II
What is a function?
Art Vandelay
A function is a relation between one set and another.
This conveys no meaning to me. What is a set? What is meant by a "relation" in this context?
II
You're not speaking in English. How is a person with no background in mathematics supposed to understand this?? Speak in plain English please.
Art Vandelay
If these concepts could be easily expressed in plain English, the mathematical terrms never would have been invented.
Right. So out of the countless billions of human beings that have ever lived and who have reasoned about mathematics and have built upon other peoples' insights, I'm the one poor sod who has to work everything out for himself from first principles because it is difficult to communicate mathematical reasoning in English.
That's just great . . .
If you assume that x and y have particular values, and calculate, based on that assumption, the probability of getting each particular score, and then sum the resultant probabilities for all scores which result in the first team winning, and then compare the result to the known probability of the first team winning, they should be the same. If they are not, you know your assumption was wrong. By adjusting x and y (not too hard to do, with Excel), you can determine which values are consistent with the known data..
I done all this months ago in drawing up a poisson distribution for correct scores. And obviously that was one thing I checked out, although I had to decide at what point the scores became sufficiently high, and thereby sufficiently unlikely, that they could be safely ignored.
OK, I had to do everything from first principles. Here's how I see it. And note my ability to communicate in plain English!
Let's consider the case where we're calculating the probability that team A will be leading both at half-time and full-time (and similiar reasoning will apply to the other possibilities).
It seems we have to sum the following probabilities up:
The prob that Team A wins 1st half * the prob that Team A wins 2nd half
+
The prob that Team A wins 1st half * the Prob the 2nd half is drawed
+
The prob that Team A wins 1st half * the Prob Team B wins second half by less than Team A's first half lead.
Obviously the only difficulty here is the calculation of Team B wins second half by less than Team A's first half lead.
But I think I've worked out how to do it.
Obtain the total goals pre-match expectation and also the supremacy rating of team A over team B from spreadfair (the supremacy rating is the expectation in numbers of goals of how much team A will defeat team B. Obviously this could be negative).
Now, armed with this information one can then generate the average expected score for the entire match eg 1.5-1.1
If we assume the likelihood of a goal is unaffected by how many goals have been scored so far (which I've read is apparently a pretty reasonable assumption), and taking into account the fact that 44% of all goals will be scored in the 1st half and 56% in the 2nd half, then the number of
additional goals by each team in the 2nd half will be the poisson distribution of 0.56*average expected score for team A for whole match and 0.56*average expected score for team B for whole match.
We can then generate probabilities from the beginning of the 2nd half for team A scoring a specifiable number of gaols
and team B scoring a specifiable number of goals. In otehr words we can generate probabiliites for all possible correct scores for the 2nd half
And of course, using similar reasoning we can do this for the first half too. So what in effect I've done here is to split the whole game into 2 games, one played in each half with the probabilities for the correct scores in each of these halves.
Now to remind ourselves of what we have to do:
To find the probability of Team B winning the second half by less than Team A's first half lead.
If team B is to win by
less in the second half, it must therefore follow that team A necessarily has to win the first half by a goal differential of at least 2 goals.
So team A could win the first half by 2-0 which means that B would have to win the 2nd half by 1-0.
And 3-0/1-0, 2-0, 2-1, 3-2 etc etc
And 3-1/1-0, 2-1, 3-2 etc etc
And etc etc.
Obviously the further I go the less probability the scenarios will be so when the scores become sufficiently big I can safely ignore them.
Right!
So the probability that Team B wins the second half by less than Team A's first half lead will be (the prob that Team A wins 2-0 in first half * the prob that Team B wins 1-0 in 2nd half) + (the prob that Team A wins 3-0 in first half * the prob that Team B wins 1-0 in 2nd half) + (the prob that Team A wins 3-0 in first half * the prob that Team B wins 2-1 in 2nd half) etc etc etc
So anyway that's how I was thinking of doing it. Not entirely sure if it's all correct; will need to think more about it tomorrow.
Night.