An interesting partial result.
This is for about 2.7 million puzzles of the file.
I find a significant redundancy, (likely more than 1000 puzzles at the end) .
I assume that this comes from auto morphs in the solution grids.
More interesting is the new ER (skfr rating) distribution
in the table, the first column is the ER rating x10 so 45 is skfr 4.5
The second column is the number of puzzles in the data base havbing this rating
The third column is a progressive sum of the second colum.
- Code: Select all
45 1963 1963
46 12 1975
47 1 1976
56 68 2044
66 2472 4516
67 135 4651
68 3 4654
71 675 5329
72 88 5417
73 26 5443
76 136 5579
77 72 5651
78 145 5796
79 2 5798
80 14 5812
83 553 6365
84 374 6739
85 177 6916
87 6 6922
88 18 6940
89 655 7595
90 8277 15872
91 10647 26519
92 32549 59068
93 5585 64653
94 288 64941
95 481 65422
96 605 66027
97 988 67015
98 239 67254
99 910 68164
100 60200 128364
101 219650 348014
102 661611 1009625
103 326249 1335874
104 236870 1572744
105 61154 1633898
106 12134 1646032
107 1143 1647175
108 105172 1752347
109 288256 2040603
110 65629 2106232
111 84864 2191096
112 3822 2194918
113 773 2195691
114 2455 2198146
115 7388 2205534
116 360833 2566367
117 126500 2692867
118 70 2692937
If I use the same cutoff as in my current work, ER 105, 1 572 744 puzzles (58% of the total) are ignored.
I assume that most of them are downgraded due to the uniqueness rules, but this is in line with my choice to ignore puzzles easy to solve by a player.
So far, the max ER seen by skfr is 11.8, same as in my current work, and the number of 11.8 is very small.
I wait for the final result for more comments, but if this reflects the final status, my new high ratings are very similar to this distribution.