I want to distribute the processing of large batches. The idea is to use Spring Batch fire up a bunch of AMQP consumers in a cloud and then to load cheap tasks (like item IDs) and submit them to an AMQP exchange. Writing of the results will be done by the consumers themselves.
Is there a ready-made library to accomplish this?
A few thoughts:
- Spring Batch is totally negotiable.
- Batch size is several millions. I don't want to kill my message broker by brute-force submitting all these IDs at once but use some kind of throttling instead.
- I do want to know about what items have been processed so I can monitor the process. So the batch controlling process will have to receive replies from the consumers.