W3cubDocs

tf.raw_ops.UnicodeEncode

Encode a tensor of ints into unicode strings.

View aliases

Compat aliases for migration

tf.compat.v1.raw_ops.UnicodeEncode

tf.raw_ops.UnicodeEncode(
    input_values, input_splits, output_encoding, errors='replace',
    replacement_char=65533, name=None
)

Returns a vector of strings, where output[i] is constructed by encoding the Unicode codepoints in input_values[input_splits[i]:input_splits[i+1]] using output_encoding.

Example:

input_values = [72, 101, 108, 108, 111, 87, 111, 114, 108, 100]
input_splits = [0, 5, 10]
output_encoding = 'UTF-8'

output = ['Hello', 'World']

Args
`input_values`	A `Tensor` of type `int32`. A 1D tensor containing the unicode codepoints that should be encoded.
`input_splits`	A `Tensor`. Must be one of the following types: `int32`, `int64`. A 1D tensor specifying how the unicode codepoints should be split into strings. In particular, `output[i]` is constructed by encoding the codepoints in the slice `input_values[input_splits[i]:input_splits[i+1]]`.
`output_encoding`	A `string` from: `"UTF-8", "UTF-16-BE", "UTF-32-BE"`. Unicode encoding of the output strings. Valid encodings are: `"UTF-8", "UTF-16-BE", and "UTF-32-BE"`.
`errors`	An optional `string` from: `"ignore", "replace", "strict"`. Defaults to `"replace"`. Error handling policy when there is invalid formatting found in the input. The value of 'strict' will cause the operation to produce a InvalidArgument error on any invalid input formatting. A value of 'replace' (the default) will cause the operation to replace any invalid formatting in the input with the `replacement_char` codepoint. A value of 'ignore' will cause the operation to skip any invalid formatting in the input and produce no corresponding output character.
`replacement_char`	An optional `int`. Defaults to `65533`. The replacement character codepoint to be used in place of any invalid formatting in the input when `errors='replace'`. Any valid unicode codepoint may be used. The default value is the default unicode replacement character is 0xFFFD (U+65533).
`name`	A name for the operation (optional).

Returns
A `Tensor` of type `string`.

© 2020 The TensorFlow Authors. All rights reserved.
Licensed under the Creative Commons Attribution License 3.0.
Code samples licensed under the Apache 2.0 License.
https://www.tensorflow.org/versions/r2.4/api_docs/python/tf/raw_ops/UnicodeEncode