pandas.factorize

pandas.factorize(values, sort=False, order=None, na_sentinel=-1)

Encode input values as an enumerated type or categorical variable

Parameters :

values : ndarray (1-d)

Sequence

sort : boolean, default False

Sort by values

order : deprecated

na_sentinel: int, default -1

Value to mark “not found”

Returns :

labels : the indexer to the original array

uniques : ndarray (1-d) or Index

the unique values. Index is returned when passed values is Index or Series

note: an array of Periods will ignore sort as it returns an always sorted PeriodIndex