After running ray for a long time, it shows that the worker has been killed

I am conducting parallel training, but there is no problem in the early stage of training. In the later stage of training, it will take about 2 days and an error message will appear. The error message is as follows. How can I improve it.