View source on GitHub |
Always do reduction to one device first and then do broadcasting.
Inherits From: CrossDeviceOps
tf.distribute.ReductionToOneDevice( reduce_to_device=None, accumulation_fn=None )
Batch reduction is done by reduction on each element one by one.
mirrored_strategy = tf.distribute.MirroredStrategy( cross_device_ops=tf.distribute.ReductionToOneDevice())
Args | |
---|---|
reduce_to_device | the intermediate device to reduce to. If None, reduce to the first device in destinations of the reduce() method. |
accumulation_fn | a function that does accumulation. If None, then tf.math.add_n is used. |
batch_reduce
batch_reduce( reduce_op, value_destination_pairs, experimental_hints=None )
Reduce PerReplica objects in a batch.
Reduce each first element in value_destination_pairs
to each second element which indicates the destinations.
This can be faster than multiple individual reduce
s because we can fuse several tensors into one or multiple packs before reduction.
Args | |
---|---|
reduce_op | An instance of tf.distribute.ReduceOp that indicates how the per_replica_value will be reduced. |
value_destination_pairs | A list or a tuple of PerReplica objects (or tensors with device set if there is one device) and destinations. |
experimental_hints | A tf.distrbute.experimental.CollectiveHints . Hints to perform collective operations. |
Returns | |
---|---|
a list of Mirrored objects. |
Raises | |
---|---|
ValueError | if value_destination_pairs is not an iterable of tuples of PerReplica objects and destinations. |
broadcast
broadcast( tensor, destinations )
Broadcast the tensor
to destinations.
Args | |
---|---|
tensor | the tensor to broadcast. |
destinations | the broadcast destinations. |
Returns | |
---|---|
a Mirrored object. |
reduce
reduce( reduce_op, per_replica_value, destinations, experimental_hints=None )
Reduce per_replica_value
to destinations
.
It runs the reduction operation defined by reduce_op
and put the result on destinations
.
Args | |
---|---|
reduce_op | An instance of tf.distribute.ReduceOp that indicates how per_replica_value will be reduced. |
per_replica_value | A tf.distribute.DistributedValues object or a tensor with device set. |
destinations | the reduction destinations. |
experimental_hints | A tf.distrbute.experimental.CollectiveHints . Hints to perform collective operations. |
Returns | |
---|---|
a Mirrored object. |
Raises | |
---|---|
ValueError | if per_replica_value can't be converted to a PerReplica object or if destinations aren't strings, Variables or DistributedValues |
© 2020 The TensorFlow Authors. All rights reserved.
Licensed under the Creative Commons Attribution License 3.0.
Code samples licensed under the Apache 2.0 License.
https://www.tensorflow.org/versions/r2.3/api_docs/python/tf/distribute/ReductionToOneDevice