Python 3.8 (on Unix), 429 bytes

Since nobody else has posted an answer yet, I thought I'd have a go, though I not 100% certain I haven't missed something important (and I am 100% certain the code could be shrunk down significantly).

As noted in the question, evaluating arbitrary Python strings is dangerous. Fortunately, it's easy to come up with a safe subset of strings whose evaluation is still Turing complete, by embedding the Turing complete While Programming Language, summarised below:

This can be embedded into Python using only strings that match the following regex:

^([0-9 \n():_=+-]|while)+$

To reduce the number of necessary characters, we use a few embedding hacks:

variable names are built only of underscores
if B then C1 else C2 is emulated using while loops (and temporary variables x and y, which hang around afterwards but that's ok):

x=B;y=not x
while x:C1;x=False
while y:C2;y=False

skip is emulated using any constant expression, e.g. 0
true is emulated using e.g. 0==0
false is emulated using e.g. 0==1
B & C is emulated using a pre-evaluated if-then-else (and another temporary variable x):

if B then x=C else x=False

¬ B is emulated using 1-B

To make our life easier, we also allow [ and ], as this permits direct operations on lists (including slicing and concatenation) which avoids having to translate the problem from one on lists into one on integers.

Update: as noted by @ØrjanJohansen, it's also necessary to include < to ensure basic arithmetic remains polynomial.

I claim that any program with just those characters will either execute, or (far more likely) fail to execute, or raise an exception, but will not crash the interpreter or change any global state. Please correct me if I'm wrong.

The rest of the solution is now similar to the prototype presented in the question. We generate all strings of increasing lengths n and execute them, prepopulating the input in variable _ and checking and validating any output from the same variable. We catch any raised exceptions and continue.

The main tricky bit is ensuring polynomial complexity (and termination). We do this by breaking out of each execution after n seconds, using Unix signals and a handler that raises an exception (which the string executions themselves have no way of catching). Since every program will appear repeatedly (eg with trailing whitespace), we don't need to worry about breaking out too soon from a correct program. One tricky bit here is to make sure that the program doesn't crash if the signal is received after the execution completed (successfully or not) but before the signal has been reset; this explains the nested try-catch.

from itertools import*
from re import*
from signal import*
I=eval(input())
signal(SIGALRM,0)
for n in count():
 for p in product(*[set(map(chr,range(128)))]*n):
  if match("([0-9 \n()[\]:_=<+-]|while)+$",(s:="".join(p))):
   try:
    try:
     g={'_':I[::]};alarm(n);exec(s,g);h=g['_']
     if h and all(h.count(x)<=I.count(x)for x in h)and sum(h)==0:
      print(h);n=-1;break
    except:0
    alarm(0)
   except:0
 if n<0:break

Try it online! (though don't expect an actual result; it may be P but it's obscenely slow)

Uri Granta

Posted 2019-04-05T16:12:08.207

Reputation: 2 675

This looks like a great effort. Now..... is it correct? :) – Anush – 2019-11-02T12:46:51.660

@Anush The three things to be convinced of are (1) it doesn't crash (2) it eventually finds a polynomial time program (3) it does so in polynomial time. I'm pretty sure 2 is right since While is Turing complete; I'm fairly sure 1 is right, since I can't see how to do anything bad without . or alphabetic characters (apart from while); and I'm mostly sure 3 is right, as I can't imagine anything I've used in the program being non-polynomial. Still, for a formal proof you'd need to prove the CPython implementation of the parser and various built ins! – Uri Granta – 2019-11-02T12:54:23.010

1I think While alone is not enough. The problem is that the language has so primitive arithmetic that although it is Turing-complete, it requires exponential time to do stuff, e.g. testing A < B requires something like brute force incrementing until you find the smallest number. However, I think your Python fragment works, because you also have efficient mutable lists. – Ørjan Johansen – 2019-11-02T22:35:17.653

You may have to change the input encoding, though. I don't see a way for the fragment to decode large input numbers in polynomial time. (Then again, I'm not a Python expert.) – Ørjan Johansen – 2019-11-02T22:53:36.437

1Actually, I think adding < to the fragment might fix that, assuming that doesn't make it unsafe. (I think that's also generally enough to make While not exponentially-blowing-up.) – Ørjan Johansen – 2019-11-02T23:12:38.093

@ØrjanJohansen Good points I hadn't thought of. I'll add <, which doesn't make it unsafe as far as I can see. Presumably this also makes multiplication polynomial, via repeated doubling? – Uri Granta – 2019-11-03T07:48:45.380

1Yeah I think everything is polynomial that should be. My reasoning is a bit quirky: using < and repeated doubling you can search for the largest bit in a number, or a difference of numbers. Repeat until you have all the bits. And repeated doubling allows you to turn a list of bits back into a number. – Ørjan Johansen – 2019-11-03T21:04:46.413

3@EmbodimentofIgnorance Why do you think this a duplicate? There is a restriction here that makes pretty much none of the answers on the other question valid here. – Post Rock Garf Hunter – 2019-04-05T18:04:32.830

If someone claims she has an algorithm (i.e. constructive proof) and publishes a non-constructive proof instead, no one will believe her – Luis Mendo – 2019-04-05T18:08:06.073

1This isn't even close to a duplicate. How do I get this reopened? – Lopsy – 2019-04-05T21:18:32.570

1@Lopsy Only if 5 people vote for it to be reopened. I don't think this is a duplicate, but I don't think it's a particularly well-formed prompt. There are glaring issues with it that I think required more time in the Sandbox. – Don Thousand – 2019-04-05T22:17:32.450

11I hammered this open. This isn't at all a dupe of the general subset sum challenge since the complexity restriction means none of the answers on the other question come even close to working, nor can they possibly unless P=NP. Answers to this will be very very different. – xnor – 2019-04-05T22:20:30.370

Solve Subset-Sum in polynomial time (...if P = NP)

The Problem

Answers

Python 3.8 (on Unix), 429 bytes