pandas.read_parquet

pandas.read_parquet(path, engine='auto', columns=None, **kwargs)[source]

Load a parquet object from the file path, returning a DataFrame.

Parameters:
path : str, path object or file-like object

Any valid string path is acceptable. The string could be a URL. Valid URL schemes include http, ftp, s3, and file. For file URLs, a host is expected. A local file could be: file://localhost/path/to/table.parquet.

If you want to pass in a path object, pandas accepts any os.PathLike.

By file-like object, we refer to objects with a read() method, such as a file handler (e.g. via builtin open function) or StringIO.

engine : {‘auto’, ‘pyarrow’, ‘fastparquet’}, default ‘auto’

Parquet library to use. If ‘auto’, then the option io.parquet.engine is used. The default io.parquet.engine behavior is to try ‘pyarrow’, falling back to ‘fastparquet’ if ‘pyarrow’ is unavailable.

columns : list, default=None

If not None, only these columns will be read from the file.

**kwargs

Any additional kwargs are passed to the engine.

Returns:
DataFrame
Scroll To Top