Table Of Contents
- What’s New
- v0.22.0 (December 29, 2017)
- v0.21.1 (December 12, 2017)
- v0.21.0 (October 27, 2017)
- New features
- Integration with Apache Parquet file format
infer_objects
type conversion- Improved warnings when attempting to create columns
drop
now also accepts index/columns keywordsrename
,reindex
now also accept axis keywordCategoricalDtype
for specifying categoricalsGroupBy
objects now have apipe
methodCategorical.rename_categories
accepts a dict-like- Other Enhancements
- Backwards incompatible API changes
- Dependencies have increased minimum versions
- Sum/Prod of all-NaN or empty Series/DataFrames is now consistently NaN
- Indexing with a list with missing labels is Deprecated
- NA naming Changes
- Iteration of Series/Index will now return Python scalars
- Indexing with a Boolean Index
PeriodIndex
resampling- Improved error handling during item assignment in pd.eval
- Dtype Conversions
- MultiIndex Constructor with a Single Level
- UTC Localization with Series
- Consistency of Range Functions
- No Automatic Matplotlib Converters
- Other API Changes
- Deprecations
- Removal of prior version deprecations/changes
- Performance Improvements
- Documentation Changes
- Bug Fixes
- New features
- v0.20.3 (July 7, 2017)
- v0.20.2 (June 4, 2017)
- v0.20.1 (May 5, 2017)
- New features
agg
API for DataFrame/Seriesdtype
keyword for data IO.to_datetime()
has gained anorigin
parameter- Groupby Enhancements
- Better support for compressed URLs in
read_csv
- Pickle file I/O now supports compression
- UInt64 Support Improved
- GroupBy on Categoricals
- Table Schema Output
- SciPy sparse matrix from/to SparseDataFrame
- Excel output for styled DataFrames
- IntervalIndex
- Other Enhancements
- Backwards incompatible API changes
- Possible incompatibility for HDF5 formats created with pandas < 0.13.0
- Map on Index types now return other Index types
- Accessing datetime fields of Index now return Index
- pd.unique will now be consistent with extension types
- S3 File Handling
- Partial String Indexing Changes
- Concat of different float dtypes will not automatically upcast
- Pandas Google BigQuery support has moved
- Memory Usage for Index is more Accurate
- DataFrame.sort_index changes
- Groupby Describe Formatting
- Window Binary Corr/Cov operations return a MultiIndex DataFrame
- HDFStore where string comparison
- Index.intersection and inner join now preserve the order of the left Index
- Pivot Table always returns a DataFrame
- Other API Changes
- Reorganization of the library: Privacy Changes
- Deprecations
- Removal of prior version deprecations/changes
- Performance Improvements
- Bug Fixes
- New features
- v0.19.2 (December 24, 2016)
- v0.19.1 (November 3, 2016)
- v0.19.0 (October 2, 2016)
- New features
merge_asof
for asof-style time-series joining.rolling()
is now time-series awareread_csv
has improved support for duplicate column namesread_csv
supports parsingCategorical
directly- Categorical Concatenation
- Semi-Month Offsets
- New Index methods
- Google BigQuery Enhancements
- Fine-grained numpy errstate
get_dummies
now returns integer dtypes- Downcast values to smallest possible dtype in
to_numeric
- pandas development API
- Other enhancements
- API changes
Series.tolist()
will now return Python typesSeries
operators for different indexesSeries
type promotion on assignment.to_datetime()
changes- Merging changes
.describe()
changesPeriod
changes- Index
+
/-
no longer used for set operations Index.difference
and.symmetric_difference
changesIndex.unique
consistently returnsIndex
MultiIndex
constructors,groupby
andset_index
preserve categorical dtypesread_csv
will progressively enumerate chunks- Sparse Changes
- Indexer dtype changes
- Other API Changes
- Deprecations
- Removal of prior version deprecations/changes
- Performance Improvements
- Bug Fixes
- New features
- v0.18.1 (May 3, 2016)
- v0.18.0 (March 13, 2016)
- New features
- Window functions are now methods
- Changes to rename
- Range Index
- Changes to str.extract
- Addition of str.extractall
- Changes to str.cat
- Datetimelike rounding
- Formatting of Integers in FloatIndex
- Changes to dtype assignment behaviors
- to_xarray
- Latex Representation
pd.read_sas()
changes- Other enhancements
- Backwards incompatible API changes
- Performance Improvements
- Bug Fixes
- New features
- v0.17.1 (November 21, 2015)
- v0.17.0 (October 9, 2015)
- New features
- Datetime with TZ
- Releasing the GIL
- Plot submethods
- Additional methods for
dt
accessor - Period Frequency Enhancement
- Support for SAS XPORT files
- Support for Math Functions in .eval()
- Changes to Excel with
MultiIndex
- Google BigQuery Enhancements
- Display Alignment with Unicode East Asian Width
- Other enhancements
- Backwards incompatible API changes
- Changes to sorting API
- Changes to to_datetime and to_timedelta
- Changes to Index Comparisons
- Changes to Boolean Comparisons vs. None
- HDFStore dropna behavior
- Changes to
display.precision
option - Changes to
Categorical.unique
- Changes to
bool
passed asheader
in Parsers - Other API Changes
- Deprecations
- Removal of prior version deprecations/changes
- Performance Improvements
- Bug Fixes
- New features
- v0.16.2 (June 12, 2015)
- v0.16.1 (May 11, 2015)
- v0.16.0 (March 22, 2015)
- v0.15.2 (December 12, 2014)
- v0.15.1 (November 9, 2014)
- v0.15.0 (October 18, 2014)
- v0.14.1 (July 11, 2014)
- v0.14.0 (May 31 , 2014)
- v0.13.1 (February 3, 2014)
- v0.13.0 (January 3, 2014)
- v0.12.0 (July 24, 2013)
- v0.11.0 (April 22, 2013)
- v0.10.1 (January 22, 2013)
- v0.10.0 (December 17, 2012)
- v0.9.1 (November 14, 2012)
- v0.9.0 (October 7, 2012)
- v0.8.1 (July 22, 2012)
- v0.8.0 (June 29, 2012)
- v.0.7.3 (April 12, 2012)
- v.0.7.2 (March 16, 2012)
- v.0.7.1 (February 29, 2012)
- v.0.7.0 (February 9, 2012)
- v.0.6.1 (December 13, 2011)
- v.0.6.0 (November 25, 2011)
- v.0.5.0 (October 24, 2011)
- v.0.4.3 through v0.4.1 (September 25 - October 9, 2011)
- Installation
- Contributing to pandas
- Package overview
- 10 Minutes to pandas
- Tutorials
- Cookbook
- Intro to Data Structures
- Essential Basic Functionality
- Working with Text Data
- Options and Settings
- Indexing and Selecting Data
- MultiIndex / Advanced Indexing
- Computational tools
- Working with missing data
- Group By: split-apply-combine
- Merge, join, and concatenate
- Reshaping and Pivot Tables
- Time Series / Date functionality
- Time Deltas
- Categorical Data
- Visualization
- Styling
- IO Tools (Text, CSV, HDF5, ...)
- Remote Data Access
- Enhancing Performance
- Sparse data structures
- Frequently Asked Questions (FAQ)
- rpy2 / R interface
- pandas Ecosystem
- Comparison with R / R libraries
- Comparison with SQL
- Comparison with SAS
- API Reference
- Developer
- Internals
- Release Notes
Search
Enter search terms or a module, class or function name.
What’s New¶
These are new features and improvements of note in each release.
v0.22.0 (December 29, 2017)¶
This is a major release from 0.21.1 and includes a single, API-breaking change. We recommend that all users upgrade to this version after carefully reading the release note (singular!).
Backwards incompatible API changes¶
Pandas 0.22.0 changes the handling of empty and all-NA sums and products. The summary is that
- The sum of an empty or all-NA
Series
is now0
- The product of an empty or all-NA
Series
is now1
- We’ve added a
min_count
parameter to.sum()
and.prod()
controlling the minimum number of valid values for the result to be valid. If fewer thanmin_count
non-NA values are present, the result is NA. The default is0
. To returnNaN
, the 0.21 behavior, usemin_count=1
.
Some background: In pandas 0.21, we fixed a long-standing inconsistency
in the return value of all-NA series depending on whether or not bottleneck
was installed. See Sum/Prod of all-NaN or empty Series/DataFrames is now consistently NaN. At the same
time, we changed the sum and prod of an empty Series
to also be NaN
.
Based on feedback, we’ve partially reverted those changes.
Arithmetic Operations¶
The default sum for empty or all-NA Series
is now 0
.
pandas 0.21.x
In [1]: pd.Series([]).sum()
Out[1]: nan
In [2]: pd.Series([np.nan]).sum()
Out[2]: nan
pandas 0.22.0
In [1]: pd.Series([]).sum()
Out[1]: 0.0
In [2]: pd.Series([np.nan]).sum()