如果您愿意重新设计您的架构以适应它的消息传递基础架构,MPI 很可能会很好地解决这个问题。
Theoretically, at least when hosted on a single server, you may be able to do something faster if you wrap your own library, just because you won't have to do the transition into and out of the MPI message structures. That being said, MPI is very efficient (esp. MPI-2, which Open MPI supports), and very, very robust. You'd have a difficult time getting the same flexibility, configurability, and robustness out of your own library.
If you're going to be broadcasting between multiple machines, MPI is probably a better approach than trying to roll your own method.
Also, MPI supports quite a few modes of communication. It does support shared memory for very fast, single machine communication, as well as TCP for inter-machine communication (plus some commercial, faster options).