libraries for multiple machine NN training?

Question

As detailed here, the way to go to break NN training over multiple machines/threads, is decompose training data set on multiple chunks and send to each node, then sum results back in main node.

There is some library who already implements these techniques? Agents to install on each node?

score 3 · Answer 1 · edited Jan 18 '21 at 16:13

3

TensorFlow and PyTorch both support distributed training.

edited Jan 18 '21 at 16:13

Rogelio Triviño

113
4

answered Jan 17 '21 at 21:22

Brian Spiering

21,136
2
26
109

libraries for multiple machine NN training?

1 Answers1