pandas.
read_parquet
Load a parquet object from the file path, returning a DataFrame.
Any valid string path is acceptable. The string could be a URL. Valid URL schemes include http, ftp, s3, and file. For file URLs, a host is expected. A local file could be: file://localhost/path/to/table.parquet. A file URL can also be a path to a directory that contains multiple partitioned parquet files. Both pyarrow and fastparquet support paths to directories as well as file URLs. A directory path could be: file://localhost/path/to/tables or s3://bucket/partition_dir
file://localhost/path/to/table.parquet
file://localhost/path/to/tables
s3://bucket/partition_dir
If you want to pass in a path object, pandas accepts any os.PathLike.
os.PathLike
By file-like object, we refer to objects with a read() method, such as a file handler (e.g. via builtin open function) or StringIO.
read()
open
StringIO
Parquet library to use. If ‘auto’, then the option io.parquet.engine is used. The default io.parquet.engine behavior is to try ‘pyarrow’, falling back to ‘fastparquet’ if ‘pyarrow’ is unavailable.
io.parquet.engine
If not None, only these columns will be read from the file.
Any additional kwargs are passed to the engine.