pandas.util.hash_pandas_object

pandas.util.hash_pandas_object(obj, index=True, encoding='utf8', hash_key=None, categorize=True)[source]

Return a data hash of the Index/Series/DataFrame

New in version 0.19.2.

Parameters:
index : boolean, default True

include the index in the hash (if Series/DataFrame)

encoding : string, default ‘utf8’

encoding for data & key when strings

hash_key : string key to encode, default to _default_hash_key
categorize : bool, default True

Whether to first categorize object arrays before hashing. This is more efficient when the array contains duplicate values.

New in version 0.20.0.

Returns:
Series of uint64, same length as the object
Scroll To Top