Why can you not match orders in parallel using logarithmic reduction, the same way you would sort in parallel? Is it that there is not enough other computation being done other than sorting by time and price?
I think that's allowed but this is where my meagre expertise runs out. You normally have to process orders serially or at least using algorithms that yield the exact same outcome that serial execution would give, but only within a single order book.