Hook for hash_pandas_object.
Default is to use the values returned by _values_for_factorize.
Encoding for data & key when strings.
Hash_key for string key to encode.
Whether to first categorize object arrays before hashing. This is more efficient when the array contains duplicate values.
Examples
>>> pd.array([1, 2])._hash_pandas_object(encoding='utf-8',
... hash_key="1000000000000000",
... categorize=False
... )
array([ 6238072747940578789, 15839785061582574730], dtype=uint64)
© 2008–2011, AQR Capital Management, LLC, Lambda Foundry, Inc. and PyData Development Team
© 2011–2025, Open source contributors
Licensed under the 3-clause BSD License.
https://pandas.pydata.org/pandas-docs/version/2.3.0/reference/api/pandas.api.extensions.ExtensionArray._hash_pandas_object.html