CORRELATION RESULTS FOR GSF'S COLLECTION
The collection referred to here is not the last extended file but the version that contained 4820 puzzles. I didn't take the last version because it contains many puzzles for which the SER and SER-times are not computed (I guess this is what the value 0 means)
gsf provides the Q1, SER and XR ratings and the Q1 computation times (he also provides the SER computation times, but their format is not useable without transformation: e.g. 1h4m6s).
--------------------------------------------------------------------------------
SER vs Q1 : 0.41
SER vs sqr(Q1) : 0.53
SER vs sqr4(Q1) : 0.62
SER vs sqr6(Q1) : 0.65
SER vs sqr8(Q1) : 0.67
SER vs sqr16(Q1) : 0.69
SER vs log(Q1) : 0.72
This shows that SER is not correlated to Q1 but to log(Q1)
This is indeed the impression I had but this is now confirmed by statistics.
--------------------------------------------------------------------------------
XR vs Q1 : 0.66
XR vs sqr(Q1) : 0.73
XR vs sqr4(Q1) : 0.77
XR vs sqr6(Q1) : 0.77
XR vs sqr8(Q1) : 0.77
XR vs sqr16(Q1) : 0.77
XR vs log(Q1) : 0.77
This shows that XR is less correlated to Q1 than to sqr4(Q1),..., sqr16(Q1) or log(Q1)
--------------------------------------------------------------------------------
ER vs XR : 0.64
ER vs sqr(XR) : 0.73
ER vs sqr4(XR) : 0.78
ER vs sqr6(XR) : 0.80
ER vs sqr8(XR) : 0.81
ER vs sqr16(XR) : 0.82
ER vs log(XR) : 0.83
This shows that ER is less correlated to XR than to sqr4(XR),..., sqr16(XR) or log(XR)
--------------------------------------------------------------------------------
Comments:
We can ask: how is it possible that SER is correlated to log(Q1), XR is correlated to log(Q1) and SER is more correlated to log(XR) than to XR?
Don't forget that we are in a high dimensional space (here R^4820) and there are many ways several unit vectors can be close to each other.
The Q1 scaling is best considered as exponential wrt to the other usual scalings.
--------------------------------------------------------------------------------
Finallly, what can be said of the computation times?
Q1-times vs Q1 : 0.38
Q1-times vs log(Q1) : 0.23
As Q1-times may be 0, one can't compute Q1 vs log(Q1-times), but
Q1 vs sqr16(Q1-times) : 0.50
There doesn't seem to be any correlation between Q1 and Q1-times. But this may be due to the imprecision in Q1-times (the unit is the second and many values are 0).