How severe does this issue affect your experience of using Ray?
- Low: It annoys or frustrates me for a moment.
I follow the instructions in dolly_lightning_fsdp_finetuning. After training job started, I got the log message:
(RayTrainWorker pid=863426)
Epoch 0: | | 3/? [00:22<00:00, 0.13it/s, v_num=0, train_loss=3.090]
(RayTrainWorker pid=863426)
Epoch 0: | | 3/? [00:22<00:00, 0.13it/s, v_num=0, train_loss=0.370]
In the message, the total steps is equal to ?
which means unable to get it. So I can’t estimate how long the train job will take. However in the tutorial above shows the expect message:
Epoch 0: 26%|██▌ | 35/134 [10:34<29:54, 18.13s/it, v_num=0, train_loss=0.359]
Epoch 0: 26%|██▌ | 35/134 [10:34<29:54, 18.13s/it, v_num=0, train_loss=0.405]
Epoch 0: 27%|██▋ | 36/134 [10:52<29:35, 18.12s/it, v_num=0, train_loss=0.405]
Any help is welcome, thanks!