this thread is about the idea of generalizing the Two-Sector Disjoint Subsets (Sue De Coq) technique to more than 2 sectors.
The rule can be put very simple:
A subset of N cells sharing only N different digits is disjoint, if all occurrences of the same digit share one common sector.
This means that none of the N digits can be true more than once in the subset, because all instances of this digit within the subset can "see" each other.
Every candidate outside the subset that shares a sector with all occurrences of the same digit within the subset can be eliminated.
This follows because if such a candidate was true, it would eliminate this digit completely from the subset, leaving us with only N-1 digits for N cells.
This idea was given by Pep in the above mentioned thread, unfortunately he could give only theoretical examples.
Pep wrote:Now all I have to do is find an efficient algorithm to spot these cases....
AFAIK he never accomplished this task, but ronk found a practical example with 3 sectors.
ronk wrote:I found one of these ... a Three-Sector Disjoint Subset pattern ... in puzzle #461 of the top1465. If the pattern wasn't very rare, I would've started a new thread on the topic. As it is rare, I'm only posting this here for those who might have an academic interest.
With basic techniques, the puzzle can be advanced to ...
- Code: Select all
19 3 256 | 24579 24789 4579 | 4568 45678 1678
8 7 25 | 6 24 1 | 9 C45 3
19 4 56 | 3579 3789 3579 | 568 2 1678
----------------------+----------------------+---------------------
2 15 1347 | 8 3479 6 | 345 A4579 B79
357 D68 9 | 2347 2347 347 | 1 A45678 A678
37 68 347 | 1 5 3479 | 23468 46789 26789
----------------------+----------------------+---------------------
4 19 137 | 37 367 2 | 68 689 5
357 59 37 | 347 3467 8 | 26 1 269
6 2 8 | 59 1 59 | 7 3 4
Sets: A = {r5c8,r6c8,r6c9} = {456789}
B = {r4c9} = {79}
C = {r2c8} = {45}
D = {r5c2} = {68}
Excepting the cells of sets A, B, C, and D:
7,9 may be eliminated from box 6 (4 elims)
4,5 may be eliminated from col 8 (3 elims)
6,8 may be eliminated from row 5 (0 elims)
Set A is an almost-almost-almost-disjoint set with three cells and six candidates. It shares two candidates with set B, two different candidates with set C, and an additional two different candidates with set D.
I guess I have an academic interest, but fortunately these Distributed Disjoint Subsets aren't as rare as I feared due to Ron's comment.
Now I finally managed to code a searching routine for Distributed Disjoint Subsets into my solver and I was able to find over 300 puzzles within 100000 randomly generated sudokus, that can't be solved with more basic techniques only but can be solved with this new technique.
Here's a Four-Sector Disjoint Subset for a start:
- Code: Select all
.3...9..5..........8.6...3.4..2...........1..219....6.6.3...72....3....1...45.8..
After SSTS, an XYZ-Wing and a Sue De Coq we get:
.------------------.---------------------.------------------.
| 17 3 6 | 178 *24 9 | 24 18 5 |
| 59 4 127 | 1578 378-12 37-25 | 29 18 6 |
| 59 8 12 | 6 *124 *45 | 249 3 7 |
:------------------+---------------------+------------------:
| 4 6 58 | 2 *18 13-58 | 35 7 9 |
| 3 7 58 | 9 6 *58 | 1 4 2 |
| 2 1 9 | 57 347 347-5 | 35 6 8 |
:------------------+---------------------+------------------:
| 6 5 3 | 18 9 18 | 7 2 4 |
| 8 9 4 | 3 7-2 27 | 6 5 1 |
| 17 2 17 | 4 5 6 | 8 9 3 |
'------------------'---------------------'------------------'
5 cells (*) sharing 5 different digits {12458}.
Digit 1 only appears in column 5, so it can be eliminated from the remaining cells in this column.
Digit 2 only appears in the intersection of box 2 and column 5, so it can be eliminated from the remaining cells of both this box and this column.
Digit 4 only appears in box 2, but there are no eliminations.
Digit 5 appears only in column 6, so it can be eliminated from the remaining cells in this column.
Digit 8 only appears in box 5, so it can be eliminated from the remaining cells in this box.
4 sectors are involved (c56b25).
And another Four-Sector Disjoint Subset:
- Code: Select all
8..9..3....5.....69...78....1....94..7...........2..57..4..6..2.5.2.7.........41.
After SSTS we get:
.------------------.---------------------.------------------.
| 8 246 126 | 9 56 245 | 3 7 14 |
| 7 234 5 | 134 13 124-3 | 8 9 6 |
| 9 346 16 | 346 7 8 | 5 2 14 |
:------------------+---------------------+------------------:
|*256 1 236 | 7 368-5 *35 | 9 4 38 |
| 45 7 39 | 13458 13589 145-39 | 2 6 38 |
|*46 8 369 | 36-4 2 *349 | 1 5 7 |
:------------------+---------------------+------------------:
| 3 9 4 | 15 15 6 | 7 8 2 |
| 1 5 8 | 2 4 7 | 6 3 9 |
|*26 26 7 | 38 389 *39 | 4 1 5 |
'------------------'---------------------'------------------'
6 cells (*) sharing 6 different digits {234569}.
Digits 2 and 6 only appear in column 1, but there are no eliminations.
Digit 3 and 9 appear only in column 6, so they can be eliminated from the remaining cells in this column.
Digit 4 appears only in row 6, so it can be eliminated from the remaining cells in this row.
Digit 5 appears only in row 4, so it can be eliminated from the remaining cells in this row.
4 sectors are involved (r46c16).
And a Five-Sector Disjoint Subset
- Code: Select all
.7.......9.....1.41.3.7.92...9...3..2..1....5.1..28........5.9.7...4..5..8......6
After SSTS we get:
.------------------.-------------------.------------------.
| 4 7 *28 | 29-8 *189 129 | 5 6 3 |
| 9 *256 68-2 | 23568 3568 236 | 1 7 4 |
| 1 *56 3 | 456 7 46 | 9 2 8 |
:------------------+-------------------+------------------:
| 8 *46 9 | 4567 56 467 | 3 1 2 |
| 2 *34 7 | 1 *39 49-3 | 6 8 5 |
| 36 1 5 | 36 2 8 | 7 4 9 |
:------------------+-------------------+------------------:
| 36 23-6 14 | 268 68-1 5 | 248 9 7 |
| 7 9 26 | 2368 4 236 | 28 5 1 |
| 5 8 14 | 279 *19 1279 | 24 3 6 |
'------------------'-------------------'------------------'
8 cells (*) sharing 8 different digits {12345689}.
Digit 1 only appears in column 5, so it can be eliminated from the remaining cells in this column.
Digit 2 only appears in box 1, so it can be eliminated from the remaining cells in this box.
Digit 3 only appears in row 5, so it can be eliminated from the remaining cells in this row.
Digit 4 only appears in the intersection of box 4 and column 2, but there are no eliminations.
Digit 5 only appears in the intersection of box 1 and column 2, but there are no eliminations.
Digit 6 only appears in column 2, so it can be eliminated from the remaining cells in this column.
Digit 8 only appears in row 1, so it can be eliminated from the remaining cells in this row.
Digit 9 only appears in column 5, but there are no eliminations.
5 sectors are involved (r15c25b1).
I know that Ron doesn't appreciate this idea, because it's almost impossible for a human solver to spot. But at least for me that applies to ALS chains, too. Once spotted, I think the Distributed Disjoint Subset is easy to understand, so I deem it worth to investigate it further, because maybe it can explain eliminations easier than long chains, even if it's hard to spot. After all it only requires counting.
Greetings, Stefan