Single node, 4x GPU, map_batches only using 1
|
|
3
|
688
|
October 5, 2023
|
Writing one file for each block
|
|
6
|
423
|
October 5, 2023
|
Benchmarks for Ray Data?
|
|
13
|
1021
|
October 5, 2023
|
ray.data.Dataset.add_column / Ray 2.7
|
|
2
|
579
|
September 29, 2023
|
Ray data.read_csv keeps pausing
|
|
3
|
405
|
September 28, 2023
|
Cannot pickle '_thread.lock' object
|
|
2
|
2338
|
September 26, 2023
|
Implementation of sort is not optimal
|
|
1
|
300
|
September 20, 2023
|
Ray fails to pickle preprocess function
|
|
1
|
444
|
August 22, 2023
|
Use list of DAGNode objects to create Dataset
|
|
2
|
335
|
August 20, 2023
|
Ray data experience OOM issue during write_csv or write_parquet
|
|
2
|
496
|
August 2, 2023
|
Does ray dataset support ORC format? Would we support it?
|
|
5
|
356
|
July 31, 2023
|
Understanding distributed data loading and training xgboost ray
|
|
10
|
961
|
July 19, 2023
|
Does xgboost ray supports multi-output, many y labels?
|
|
3
|
475
|
July 17, 2023
|
Recommendational steps for processing big data?
|
|
1
|
559
|
July 7, 2023
|
Java to Python data transfer
|
|
1
|
354
|
July 7, 2023
|
Directory structure dataset help
|
|
0
|
351
|
July 4, 2023
|
Ray datasets streaming block split?
|
|
1
|
644
|
June 27, 2023
|
All tasks in PENDING_NODE_ASSIGNMENT but workers' CPUs are busy
|
|
1
|
377
|
June 14, 2023
|
Error: Can't get attribute 'FromItems'
|
|
1
|
339
|
June 9, 2023
|
Error in HuggingFaceTrainer (Transoformer) v2.4.0
|
|
6
|
826
|
June 9, 2023
|
Unexpected behavior when using generators with Ray Dataset
|
|
1
|
446
|
June 7, 2023
|
How to deal with labeled image datasets?
|
|
11
|
653
|
May 31, 2023
|
Ray Data streaming not streaming smoothly
|
|
8
|
755
|
May 30, 2023
|
Does `ray.data.Dataset.iter_batches` guarantee order of the original file?
|
|
5
|
491
|
May 29, 2023
|
Proper workflow to read local parquet file and use it on remote worker?
|
|
13
|
1467
|
May 24, 2023
|
Custom FileBasedDatasource requires dataset_uuid
|
|
0
|
283
|
May 23, 2023
|
Dataset write_csv AttributeError: 'Worker' object has no attribute 'core_worker'
|
|
2
|
1273
|
May 19, 2023
|
Read parquet files with wildcard/glob similar to Dask DataFrame?
|
|
4
|
1196
|
May 12, 2023
|
How can I set up numpy seed when doing map_batches?
|
|
1
|
306
|
May 4, 2023
|
KeyError: ‘Field “xxxxxyyyyy.png” does not exist in table schema’
|
|
3
|
519
|
April 20, 2023
|