What’s new in 2.0.2 (May 29, 2023)#
These are the changes in pandas 2.0.2. See Release notes for a full changelog including other versions of pandas.
Fixed regressions#
Fixed performance regression in
GroupBy.apply()
(GH53195)Fixed regression in
merge()
on Windows when dtype isnp.intc
(GH52451)Fixed regression in
read_sql()
dropping columns with duplicated column names (GH53117)Fixed regression in
DataFrame.loc()
losingMultiIndex
name when enlarging object (GH53053)Fixed regression in
DataFrame.to_string()
printing a backslash at the end of the first row of data, instead of headers, when the DataFrame doesn’t fit the line width (GH53054)Fixed regression in
MultiIndex.join()
returning levels in wrong order (GH53093)
Bug fixes#
Bug in
arrays.ArrowExtensionArray
incorrectly assigningdict
instead oflist
for.type
withpyarrow.map_
and raising aNotImplementedError
withpyarrow.struct
(GH53328)Bug in
api.interchange.from_dataframe()
was raisingIndexError
on empty categorical data (GH53077)Bug in
api.interchange.from_dataframe()
was returningDataFrame
’s of incorrect sizes when called on slices (GH52824)Bug in
api.interchange.from_dataframe()
was unnecessarily raising on bitmasks (GH49888)Bug in
merge()
when merging on datetime columns on different resolutions (GH53200)Bug in
read_csv()
raisingOverflowError
forengine="pyarrow"
andparse_dates
set (GH53295)Bug in
to_datetime()
was inferring format to contain"%H"
instead of"%I"
if date contained “AM” / “PM” tokens (GH53147)Bug in
DataFrame.convert_dtypes()
ignoresconvert_*
keywords when set to Falsedtype_backend="pyarrow"
(GH52872)Bug in
DataFrame.convert_dtypes()
losing timezone for tz-aware dtypes anddtype_backend="pyarrow"
(GH53382)Bug in
DataFrame.sort_values()
raising for PyArrowdictionary
dtype (GH53232)Bug in
Series.describe()
treating pyarrow-backed timestamps and timedeltas as categorical data (GH53001)Bug in
Series.rename()
not making a lazy copy when Copy-on-Write is enabled when a scalar is passed to it (GH52450)Bug in
pd.array()
raising forNumPy
array andpa.large_string
orpa.large_binary
(GH52590)Bug in
DataFrame.__getitem__()
not preserving dtypes forMultiIndex
partial keys (GH51895)
Other#
Raised a better error message when calling
Series.dt.to_pydatetime()
withArrowDtype
withpyarrow.date32
orpyarrow.date64
type (GH52812)
Contributors#
A total of 18 people contributed patches to this release. People with a “+” by their names contributed a patch for the first time.
Gianluca Ficarelli +
Guillaume Lemaitre
Joris Van den Bossche
Julian Badillo +
Luke Manley
Lumberbot (aka Jack) +
Marc Garcia
Marco Edward Gorelli
MarcoGorelli
Matt Richards
Matthew Roeschke
MeeseeksMachine
Pandas Development Team
Patrick Hoefler
Simon Høxbro Hansen +
Thomas Li
Yao Xiao +
dependabot[bot]