You are given an array \$A\$, which may contain duplicate elements. In each swap, you may swap the value of any two indices \$i, j\$ (i.e. switch the values of \$A_i\$ and \$A_j\$). What is the least amount of swaps needed to sort the array, and what are the corresponding swapped indices?

Scoring

This is code-golf, so shortest code wins. However, your program must terminate in reasonable time (less than 10 seconds) for any array \$A\$ with less than 1000 elements.

Input

The array \$A\$, in any necessary form.

Output

A list of swaps, with each swap being a pair of numbers, in sequential order - first pair in list is swapped first, in any necessary form.
In the output, the numbers should all represent indices. You may output the answer either one-indexed or zero-indexed, but my samples will use one-indexing.
The answer might not be unique. However, your answer should still have the same length as the optimal sequence of swaps.

Test cases

[4,3,2,1] => [(1,4),(2,3)]
[1,3,2,3] => [(2,3)]
[1,2,1,3,2,3] => [(2,3),(4,5)]
[1,2,3,4] => []
[4,2,5,1,3,3] => [(2,6),(1,5),(1,4),(2,3)]

Baaing Cow

Posted 2019-12-28T03:12:11.003

Reputation: 349

Yes, yes, and yes (although there may be more than one optimal sequence, yours should still have the same length as the optimal sequence) – Baaing Cow – 2019-12-28T03:23:27.943

1Can you please add more and longer test cases? – Jonah – 2019-12-28T08:07:59.450

1@Jonah Done. Although the samples may be quite a bit suggestive... – Baaing Cow – 2019-12-28T08:37:52.670

1Per @Neil's comment on my answer, can you please add the test case 4,5,2,1,3,3 with expected length 4. (1 5) (0 4) (0 3) (1 2) is one possible (0-indexed) solution. This test case ensures that you can't just use the same algorithm that works for the "no repeats" case. – Jonah – 2019-12-28T18:48:00.003

1@BaaingCow are you sure this is possible within 10 seconds when there are multiple identical numbers, as per comments below the J and Jelly answers? – Nick Kennedy – 2019-12-28T18:48:11.457

On second thought, it seems that my reference solution is wrong - whoops :| My code also returns an swap sequence of length 5 for 4,5,2,1,3,3. Are challenges with no reference solution allowed? – Baaing Cow – 2019-12-29T03:52:51.823

An more readable algorithm is available at https://stackoverflow.com/questions/15152322/compute-the-minimal-number-of-swaps-to-order-a-sequence/15152602#15152602 However, it doesn't handle arrays with duplicates well. So it could perhaps be tweaked to do that. Or another approach is necessary.

– Kjetil S. – 2019-12-29T06:48:51.077

@BaaingCow You don't need a reference solution, but you do need correct test cases. I believe my edited answer is now correct, and I included a reference version of the algorithm. It found shorter solutions for both of the longer test cases you had previously posted. – Jonah – 2019-12-30T08:57:50.220

Answers

Jelly, 51 43 bytes

ṪµṢ,$Ḣ©€E¬a="ʋṚƊ×\ẸÞṪTḢ,®ṛ¦Ẹ}¡@¥
WÇẸÐ¿FĖẠƇÄ

Try it online!

A pair of links which when called as a monad returns a list of pairs of 1-indexed indices to swap. Based on the clever J algorithm developed by @Jonah, so be sure to upvote that one too!

Explanation

Helper link

Takes a list containing a list of the remaining items to be sorted, optionally preceded by the most recent swap. Takes full advantage of the way that Ḣ (Head) and Ṫ (tail) pop items from a list, which for someone from an R background was something I found took some getting used to when I started using Jelly!

Ṫ                                 | Tail (yields the list of remaining items, and also removes this from the list that Ð¿ is collecting up)
 µ                                | Start a new monadic chain
  Ṣ,$                             | Pair the sorted list with the unsorted remaining items
     Ḣ©€                          | Take the head of each, removing from the list and storing each in turn in the register (so the register is left holding the head of the unsorted list)
               Ɗ                  | Following as a monad
             ʋṚ                   | - Following as a dyad, using a list of the remaining sorted and unsorted lists (in that order) as right argument
        E                         |   - Equal (i.e. the head of the unsorted and sorted lists matches)
         ¬                        |   - Not
          a                       |   - And:
              ="                  |     - Equal to zipped right argument (effectively a logical list indicating which of the remainder of the sorted list matched the head of the unsorted and vice-versa)
                ×\                | Cumulative product
                  ẸÞ              | Sort by whether any true
                    Ṫ             | Tail
                     T            | Convert to list of truthy indices
                      Ḣ           | Head (will be 0 if the head of the unsorted and sorted lists matches)
                               ¥  | Following as a dyad, using the remaining unsorted list as right argument
                       ,          | Pair the index to swap with:
                              @   | - Following with arguments reversed:
                           Ẹ}¡    |   - Run following once if index to swap is nonzero:
                        ®ṛ¦       |     - Replace item at index to swap with the original head of the unsorted list that was stored in the register

Main link, takes a list to be sorted

W           | Wrap in a further list
  ẸÐ¿       | While any non-zero, do the following collecting up intermediate results
 Ç          | - Call helper link
     F      | Flatten (will remove the now empty initial list)
      Ė     | Enumerate (pair the index of each with the item)
       ẠƇ   | Keep only those without zeros
         Ä  | Cumulative sum of each pair

Original algorithm that fails with repeated values, 16 bytes

ỤịƬ€`Ṣ€ŒQTịƲṡ€2Ẏ

Try it online!

A monadic link taking a list of integers and returning a list of lists of integers representing the 1-indexed swaps.

Explanation

Ụ                 | Grade the list z up, i.e., sort its indices by their corresponding values.
  Ƭ€`             | For each member of the resulting list, do the following until no new values, using the list of indices itself as the right argument. Keep all intermediate values. 
 ị                | - Index into list
           Ʋ      | Following as a monad:
     Ṣ€           | - Sort each list
       ŒQ         | - Logical list, true for original copy of each unique value
         T        | - Indices of truthy values
          ị       | - Index into list
            ṡ€2   | Split each into overlapping sets of 2
               Ẏ  | Join outer together

Inefficient algorithm that is too slow for long lists with lots of repeats, 27 bytes

ĠŒ!€ŒpµFịƬ€`Ṣ€ŒQTịƲṡ€2Ẏ)LÞḢ

Try it online!

Nick Kennedy

Posted 2019-12-28T03:12:11.003

Reputation: 11 829

Looks like this fails on the same one that mine does: 4 5 2 1 3 3 (courtesy of Neil) – Jonah – 2019-12-28T17:13:42.320

1@Jonah indeed. I’ve tried testing the Cartesian product of all permutations of identical ones but it fails for the last two examples because there are too many possibilities. I’ll give it some more thought. – Nick Kennedy – 2019-12-28T18:45:02.737

Yep. I'm stuck on exactly the same thing. – Jonah – 2019-12-28T18:45:36.940

@Jonah what do you think of what I’ve done? I noticed that the problem arose when the items that ultimately needed swapping with the duplicates were not in ascending order from left-to-right, so sort the duplicates on that basis. I’m not sure it will always work though. – Nick Kennedy – 2019-12-28T19:18:36.053

I believe this one 4,5,2,1,3,3,10,9,7,6,8,8 can be done in 8. Ie, sometimes you need it to go one way, sometimes the other way. Try it online!

– Jonah – 2019-12-28T19:48:46.930

@Jonah quite right. I’ll leave my two previous answers up for now; one works for any lists without duplicates, the other works for any list but becomes very slow with multiple repeated values. – Nick Kennedy – 2019-12-28T19:54:28.447

My only other idea, which I don't have time to explore atm, is to first remove the "clumped dups", then solve, then add back in the swaps for the clumped dups, because (I think) the clumped dups always move as a chunk of swaps. – Jonah – 2019-12-28T19:56:20.567

@Jonah I haven’t checked, but would this work even for multiple sets of clumped dups which partially swap with each other? – Nick Kennedy – 2019-12-28T20:13:45.917

Okay I believe I have a correct algorithm that is fast in all cases. See my (very ungolfed) edit. In addition, note that the longer test case anwers provided in the OP were incorrect: My algorithm found shorter solutions. – Jonah – 2019-12-29T22:19:15.107

1@Jonah Very clever! I’ve rewritten your algorithm in Jelly. The implementation is different in places, but the overall idea is the same. – Nick Kennedy – 2019-12-30T17:44:41.363

1Thanks Nick! I was kinda hoping you'd do a Jelly one -- was curious how compact it would be. My J solution could be golfed further but I think it will be somewhat long no matter what because these procedural algorithms aren't a great fit for J. – Jonah – 2019-12-30T17:48:29.443

J, 126 bytes

[:>[:{:"1@}./:~(({.@I.@:~:>@{.)({:@](|.@:{`[`]}~;])[,[:(0{*/,&I.{.)(<0{i.@$)*"1{"0 1=|.@])(,:>@{.))^:(-.@-:>@{.)^:a:;&(0 2$'')

Try it online!

Here is the same code in a more procedural form, if you're looking for an explanation of algorithm.

f=. 3 :0
sorting=.y NB. copy of input we'll mutate
sorted=./:~y
swaps=.0 2$''
for_j. i.#y do.
  correct=.j{sorted
  cur=.j{sorting
  if. cur = correct do.
    continue.
  end.
  NB. correctly fill the current idx j
  valid_idxs=.(j<i.#sorting)
  good_j=. valid_idxs * correct = sorting 
  NB. will any of these be the correct desitnation for the current elm?
  good_dest=. valid_idxs * cur = sorted
  both=.good_j * good_dest
  use=.(1 e.both) { good_j ,: both
  swap_with=. {.I.use
  swaps=.swaps,(j, swap_with)
  sorting=.(sorting {~ swap_with, j) (j, swap_with)} sorting
end.
swaps
)

Try it online!

original answer, doesn't work for repeats

J, 18 bytes

[:;(2]\])&.>@C.@/:

Try it online!

NOTE: Thanks to Neil for pointing out this currently fails on the test case 4 5 2 1 3 3.

/: Grade up, ie, return the permutation that puts the list into order
C.@ Convert that to cyclic permutation form, which uses boxes
&.> For each of those boxed lists, ie, permutation cycles...
(2]\]) Return all consecutive pairs of two elements. This gives us the swaps, and it works because J gives the cycle list in descending order.
[:; raze the list of boxes. This gives us all the swap pairs as a single list of pairs, ie, an N x 2 matrix.

Jonah

Posted 2019-12-28T03:12:11.003

Reputation: 8 729

1Wow. Can you explain how it works, and what strategy you used to solve this? – Baaing Cow – 2019-12-28T10:06:31.440

@BaaingCow Done. – Jonah – 2019-12-28T10:14:01.067

@BaaingCow Btw, my swaps with this method are all listed in descending order (ie, 3 1 for swapping 1 and 3). lmk if that's not acceptable, and I'll reverse them. It only adds a few bytes. – Jonah – 2019-12-28T10:23:54.823

Suppose your input is 4,5,2,1,3,3 (or whatever the ambiguous equivalent would be for your code). What's to stop the code outputting 1,4;4,6;6,2;2,3;3,5? – Neil – 2019-12-28T11:25:19.740

Excellent catch, @Neil . The problem is there are multiple valid grade ups bc of the repeats, and they can lead to solns of different length. I could brute force all of them, which seems excessive. I think it would be enough to brute force the endpoints of runs of repeats. Is that the best I could do? – Jonah – 2019-12-28T16:24:23.663

Is this also why the test echo 1 test 3 3 2 3 fails? It should be just 1,3 shouldn't it? Not 1,3 ; 1,2. – Kjetil S. – 2019-12-29T01:20:05.730

@KjetilS. Yep, basically the algorithm is unreliable for any list with repeats. – Jonah – 2019-12-29T02:23:08.153

@LuisMendo Indeed. I have edited it with a now much uglier but at least correct solution. I will re-golf it later. – Jonah – 2019-12-29T22:22:21.303

That was some byte count increase :-) – Luis Mendo – 2019-12-29T22:28:18.060

Wolfram Language (Mathematica), 219 188 184 bytes

F@A_:=MinimalBy[##&@@@f/@#[[1]]&@*(FindPermutation@A~PermutationProduct~#&)/@GroupElements@PermutationGroup[Cycles@*List/@Join@@((f=Partition[#,2,1]&)/@Values@PositionIndex@A)],Length]

Try it online!

-31 bytes thanks to @Chris

-4 bytes by making the code much faster (no more GroupSetwiseStabilizer slowness)

Thanks to @NickKennedy for pointing out that the first version failed on non-unique list entries. The present version is a tour-de-force that calculates the permutation group that leaves the list unchanged, then tries each element of this group to see if it can be used to shorten the result.

This code still goes very slowly on massively duplicated lists.

Explicit code:

A = {4,5,2,1,3,3,10,9,7,6,8,8}
  (* compute the indices of equal elements *)
Q = Values[PositionIndex[A]]
  (* compute the permutation group that leaves A invariant *)
G = PermutationGroup[Cycles@*List /@ Join @@ (Partition[#, 2, 1] & /@ Q)]
  (* find the permutation that sorts A and combine it with every element of G *)
F = PermutationProduct[FindPermutation[A], #] & /@ GroupElements[G]
  (* convert every resulting permutation to the desired form *)
J = Join @@ (Partition[#, 2, 1] & /@ #[[1]]) & /@ F
  (* pick the shortest solutions *)
MinimalBy[J, Length]

Roman

Posted 2019-12-28T03:12:11.003

Reputation: 1 190

2Fails the same edge cases that Jonah’s and mine do - see comments above. – Nick Kennedy – 2019-12-28T20:08:10.233

Thanks @NickKennedy! Fixed. – Roman – 2019-12-29T19:41:17.950

well done. However, you’ve now run into the same issue that Jonah and I had with using permutations - the runtime becomes very long for longer inputs with duplicates such as the last two sets in the original question. This answer certainly won’t handle a 1000 length list with lots of duplicates. I’m still waiting to hear from the OP about this, but my suspicion is you can choose between duplicates and a quick run time but can’t have both. – Nick Kennedy – 2019-12-29T19:45:40.713

@NickKennedy my suspicion is that this problem needs to be solved in GAP by someone who is well versed in group theory. Maybe there are some group-theoretical shortcuts to be had. Not an easy problem.

– Roman – 2019-12-29T21:03:04.027

1not sure if you’ve seen, but there are now working solutions in J and Jelly – Nick Kennedy – 2019-12-31T01:01:33.220

Swaps to Sort an Array

Scoring

Input

Output

Test cases

Answers

Jelly, 51 43 bytes

Explanation

Helper link

Main link, takes a list to be sorted

Original algorithm that fails with repeated values, 16 bytes

Explanation

Inefficient algorithm that is too slow for long lists with lots of repeats, 27 bytes

J, 126 bytes

original answer, doesn't work for repeats

J, 18 bytes

Wolfram Language (Mathematica), 219 188 184 bytes