implement basic scheduler #145

wanderingbort · 2017-08-08T15:12:45Z

This PR addresses the features of #127

A transaction scheduler is now its own concept extracted from block generation. Producers can select a different scheduling algorithm for any block/test/etc.

A few basic schedulers are provided as are some meta-schedulers which can be used to compose fuzzy test cases. As new requirements emerge the set of schedulers and test cases should grow.

In addition, basic plumbing for scheduling transactions generated as an output of processing other transactions is included however, we currently have no capacity for generating/processing that type of transaction. Those will come as part of a later feature pull.

add basic tests for scheduler

NB: we do not have the facilities to create/process these yet but they can be scheduled scheduling is generated/signed agnostic though the block format still segregates them chain_controllers _db now has indices for generated transactions. tests using all signed transactions for now. closes EOSIO#139

… the producer plugin

changed the output of the schedulers to be just pending_transactions as those are tagged unions of pointers making them small and easily copyable without needing the additional indirection implemented basic test cases to stochastically verify that order of transaction delivery does not affect schedulability

nathanielhourt · 2017-08-08T17:21:29Z

I haven't followed the conversations on scheduling that closely, so I probably need to be filled in. It looks like the overall strategy used here is to collect pending transactions in a list the same way as always, and then when it comes time to produce a block, switch that list over several cycles/threads. Is that correct?

That strategy doesn't make sense to me, but perhaps it's because I don't know what all the requirements of scheduling are... As I see it right now, transactions roll in from the network one at a time, but we are saving them up and scheduling them all at once at the end? Why not build up the schedule one trx at a time as they come in so that when it comes time to produce a block (crunch time, when we're on a deadline), we already have them all scheduled because all that work was done earlier?

So is there some necessary information we don't know until we have a full list of (hopefully-valid) transactions we'd like to schedule in the block? So what are the requirements for a valid schedule? I know we can't put messages with the same code account in different threads of the same cycle... What other requirements are there?

wanderingbort · 2017-08-08T17:57:22Z

@bytemaster and I talked about pipe-lining scheduling so that we can push the cost of that entirely out of the quantum a producer has to produce a new block once it finally has knowledge of the preceding block. It is an open area of discussion.

However, without some commitment scheme that provides some guarantees that you are not pre-scheduling transactions another producer will commit to a block before your next quantum, pre-scheduling has limited benefits. Best case scenario, you have built a pre-schedule that you must drop some transactions from once you finally learn the preceding block's contents and the resulting schedule is still performant. Worst case, the transactions have "high-degree" scopes and the resulting schedule after there removal is unnecessarily synchronous without them.

I think more than likely, a set of well behaving producers will only have to schedule transactions broadcast during the previous producer-quantum and/or generated by the previous block. The discovered transactions may come in at the very end of the previous quantum and the block certainly will. In this case, pre-scheduling is largely busy-work as a majority of transactions will end up in blocks prior to the block a producer would get to generate.

nathanielhourt · 2017-08-08T18:23:56Z

What are the possible conflicts between transactions? I know of a couple:

Trxs A and B cannot be in separate threads of the same cycle if A and B both contain a message with code account Q
Trxs A and B cannot be in separate threads of the same cycle if A contains a message with code account Q and B contains a message with account Q in scope

Are there any others?

wanderingbort · 2017-08-08T18:46:19Z

as of now, conflicts only arise from these cases (though sheepishly I didnt realize that the code account in the messages may not be referenced in the scope array. if that is the case, I need to respect it in scheduling and I currently do not)

Pre-scheduling shouldn't result in an invalid schedule when transactions are dropped just potentially a poor schedule. Each additional transaction potentially adds restrictions to the scheduler, removing a transaction may reduce the restrictions a scheduler has. Naively, the fewer restrictions the more parallel a schedule can be. Having too many "phantom" restrictions from removed transactions has the same effect on the schedule efficiency for now benefit downstream (eg a retired transaction).

… when scheduling

nathanielhourt · 2017-08-08T21:02:56Z

One thing I think is worth bearing in mind is that "scheduling" does not matter to any node at all, except to the node scheduled to produce block N, between the time that node has received block N-1 and the time it produces block N. Outside that very small window, scheduling is irrelevant and probably shouldn't be done at all, unless we make it so cheap that it's basically free. At all other times, the only thing any node cares about is whether the transaction appears valid enough to propagate it through the P2P network.

During that scheduling window, transactions appear one at a time, except for possibly some pending transactions we had that block N-1 didn't include (these should be rare, so it's probably not worth optimizing for them). So we probably do want to optimize for that incoming trickle of transactions.

Hmm, it's just occurred to me that notifying accounts is hairy... When a message handler notifies account A, to deliver that notification we've got to evaluate a handler with A as the code account... but that makes scheduling pretty much impossible because the scheduler doesn't know that A is going to be notified, so it can't ensure that A isn't already code account in a different thread.... or did I make a mistake somewhere?

wanderingbort · 2017-08-08T22:07:26Z

or did I make a mistake somewhere?

Its my understanding after we had the big scheduling talk a few weeks back that all affected accounts would be listed explicitly and a transaction would fail if it violated this constraint. I would think this would include any notify actions that are delivered synchronously, those accounts must be in the transactions list of scopes.

During that scheduling window, transactions appear one at a time, except for possibly some pending transactions we had that block N-1 didn't include

During the quantum for block N, there is a lot to do that is post-scheduling such as actually executing the transactions in parallel and pruning any that fail during actual block dispatch (for whatever reason) and transmitting the block. I don't know that we can afford to wait for trickles. This may be a case where we have to trade transaction latency for throughput? I'm open to a JIT-scheduler but I do think that will limit the efficiency of scheduling and as a result decrease efficiency for downstream block learners (both partial and full).

That said, If every Producer is doing "one-time-batch" scheduling, then after validating block N-1 we should have a list of pending transactions ready to schedule which roughly resembles those that arrived since the time Producer(N-1) performed its scheduling. The time between these scheduling batches should be roughly the time between blocks. The latency expectation for a transaction would be [bt, 2bt] instead of [0, bt] where "bt" is the time between blocks. Likewise a scheduler has full knowledge of the transaction set which gives us the best shot at efficiency.

bytemaster · 2017-08-10T19:27:57Z

I have been reading the discussion and the "code account" should not be relevant to scheduling, only the scopes. The scopes define which areas of memory can be locked. The same code can execute in parallel in different scopes.

bytemaster · 2017-08-10T19:29:53Z