Outputs a tensor containing the reduction across all input tensors.
tf.raw_ops.NcclAllReduce( input, reduction, num_devices, shared_name, name=None )
Outputs a tensor containing the reduction across all input tensors passed to ops within the same `shared_name.
The graph should be constructed so if one op runs with shared_name value c
, then num_devices
ops will run with shared_name value c
. Failure to do so will cause the graph execution to fail to complete.
input: the input to the reduction data: the value of the reduction across all num_devices
devices. reduction: the reduction operation to perform. num_devices: The number of devices participating in this reduction. shared_name: Identifier that shared between ops of the same reduction.
Args | |
---|---|
input | A Tensor . Must be one of the following types: half , float32 , float64 , int32 , int64 . |
reduction | A string from: "min", "max", "prod", "sum" . |
num_devices | An int . |
shared_name | A string . |
name | A name for the operation (optional). |
Returns | |
---|---|
A Tensor . Has the same type as input . |
© 2020 The TensorFlow Authors. All rights reserved.
Licensed under the Creative Commons Attribution License 3.0.
Code samples licensed under the Apache 2.0 License.
https://www.tensorflow.org/versions/r2.4/api_docs/python/tf/raw_ops/NcclAllReduce