我正在训练一个 FL 模型。我每个周期选择 5 个客户。我想获得客户和最大数量客户之间的示例差距。服务器能否在这个周期内向其他客户端广播 5 个客户端中的最大示例数?合法吗?
1 回答
在 TFF 中,绝对可以实现广播附加信息。该 API 是tff.federated_broadcast,如果您希望在联合平均算法中对其进行扩展,则可以重新使用simple_fedavg它可能会在此处附近添加的实现。
Regarding whether something is an invasion pf privacy maybe useful to ask "what do other participants learn about each other?" and "should the information learned be considered sensitive?". A very strict interpretation of privacy might be "other participants learn nothing" and "all information is sensitive".
We can imagine a scenario where the server picks a maximum number of examples to process, not based on any data from the clients, and tells/broadcasts this number to each client. It seems unlikely the server or other participants would be able to learn something about an individual participant, nor something sensitive, since the number does is not derived from client data.
Alternatively, the server might first learn how many examples each client has, and then broadcast that number back to all the clients. This is definitely sharing something about one client with all other participants. It might also be sensitive, in particular if each client has a different number of examples and that number might now be used to uniquely identify a client.