I'm sad that I wasn't invited to this party.
This is right up my alley.
You are very cordially invited - don't forget to bring a bottle.
What do you folks think about the priming the data with say equivalent to one 'true' sample length. I think this will, on average, pull the result toward the 'true' mean similar to Bayes but it is really very easy to achieve.
Here is an example of a sample count of a stat that was generated randomly with a 'true' 10% frequency.
{0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,3,3,3,3,3,3,3,3,3,4,4,5,5}
this would produce a running standard Hud stat of:
{ 0.0, 50.0, 33.3, 25.0, 20.0, 16.7, 14.3, 12.5, 11.1, 10.0, 9.1, 8.3, 7.7, 7.1, 6.7, 12.5, 11.8, 11.1, 10.5, 10.0, 9.5, 9.1, 8.7, 12.5, 12.0, 11.5, 11.1, 10.7, 10.3, 10.0, 9.7, 9.4, 9.1, 8.8, 8.6, 8.3, 10.8, 10.5, 12.8, 12.5}
If we prime the Hud stat with the equivalent of one sample length of data with one success we get:
{ 9.1, 16.7, 15.4, 14.3, 13.3, 12.5, 11.8, 11.1, 10.5, 10.0, 9.5, 9.1, 8.7, 8.3, 8.0, 11.5, 11.1, 10.7, 10.3, 10.0, 9.7, 9.4, 9.1, 11.8, 11.4, 11.1, 10.8, 10.5, 10.3, 10.0, 9.8, 9.5, 9.3, 9.1, 8.9, 8.7, 10.6, 10.4, 12.2, 12.0}
This looks much better to me and I would prefer this.
In the above the 'player' happened to have the true mean for this stat, if I generate a 'player' with twice this frequency ie, a player with a 20% stat, but I still prime it with the same 'true' population 10%, I get results that look like this:
{1,2,2,2,3,3,4,4,4,4,4,5,5,5,5,5,6,6,6,6,6,6,6,6,6,7,7,7,7,7,7,7,7,7,7,8,8,8,9,10} << about 20% now
this would produce a running standard Hud stat of:
{ 100.0, 100.0, 66.7, 50.0, 60.0, 50.0, 57.1, 50.0, 44.4, 40.0, 36.4, 41.7, 38.5, 35.7, 33.3, 31.3, 35.3, 33.3, 31.6, 30.0, 28.6, 27.3, 26.1, 25.0, 24.0, 26.9, 25.9, 25.0, 24.1, 23.3, 22.6, 21.9, 21.2, 20.6, 20.0, 22.2, 21.6, 21.1, 23.1, 25.0}
If we prime the Hud stat with the equivalent of one success in sample length of the population 'true' data, exactly the same priming as above, we get:
{ 18.2, 25.0, 23.1, 21.4, 26.7, 25.0, 29.4, 27.8, 26.3, 25.0, 23.8, 27.3, 26.1, 25.0, 24.0, 23.1, 25.9, 25.0, 24.1, 23.3, 22.6, 21.9, 21.2, 20.6, 20.0, 22.2, 21.6, 21.1, 20.5, 20.0, 19.5, 19.0, 18.6, 18.2, 17.8, 19.6, 19.1, 18.8, 20.4, 22.0}
This still looks much better to me and I would prefer this.
All you would need to do this is to 'know' the mean of the population (if you use your own db the median may be better), to do Bayes well you would really need to know more about the underlying distrubtion and it would get a lot more complicated.
I haven't seen this done before for poker so can I call it the "BaseMetal Approach" and sell it to PT or HEM
. Doh! I suppose DC owns it now I've posted.