I am trying to read a huge parquet dataset which is partitioned. Is it possible to use ray.data.read_parquet to read this dataset only to read a few selected partitions?
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Ray Dataset Cannot Read Parquet File | 1 | 673 | August 1, 2022 | |
|
Why isn't `ray.data.read_api._get_reader` parallelized?
|
0 | 199 | December 5, 2023 | |
|
Ray Data read Parquet loads all the data in one go
|
4 | 643 | October 21, 2023 | |
|
Hive Partitioned Datasets
|
0 | 472 | July 3, 2023 | |
| Data loading of parquet files is very memory consuming | 2 | 1460 | June 21, 2022 |