Loading RLlib checkpoints on Google Colab

GallantWood · February 9, 2022, 6:46am

I have some saved checkpoints which I am able to load and test locally. However, when I load these same checkpoints on Google Colab I receive the following error message:

Colab is fixed using Python 3.7 and the error occurs with Ray v1.8, v1.9 and v1.10. The source of the above error is not clear to me. Any suggestions would be greatly appreciated.

sven1977 · February 9, 2022, 8:35am

Yeah, indeed looks strange. Maybe a pickle version issue?
Could you send us more information so we can debug this? Like a small repro script that we could run in colab under the given python and ray versions?

GallantWood · February 17, 2022, 2:06am

@sven1977
Thank you for the quick reply. I have built a reproducible example after trying to understand the problem better. Here is the central issue. I am in a position where I need to analyse checkpoints on Colab (Python = 3.7, Ray = 1.9/1.10) which have been trained and saved with Python 3.8 and Ray 1.9/1.10.

I have attached the most basic cartpole example. If you run cartpole_train with >= Python 3.8 and then run cartpole_test inside Colab, you will receive the error.

NB. I have tried installing pickle5 and “import pickle5 as pickle” in Colab to no avail.

GallantWood · February 28, 2022, 8:26am

@sven1977
Is there any update on this? There is a similar issue here

github.com/RedisGears/redisgears-py

`code() takes at most 15 arguments` when running any command.

opened 08:44PM - 29 Sep 20 UTC

scuml

Getting an error no matter what I run or register through gearsclient. Even n…oop code like: ``` gb = GearsBuilder('KeysOnlyReader', r=conn) gb.run() ``` returns ``` Traceback (most recent call last): File "<input>", line 1, in <module> gb3.run() File "/Users/<user>/Library/Caches/pypoetry/virtualenvs/gears-5k_d3sYO-py3.8/lib/python3.8/site-packages/gearsclient/redisgears_builder.py", line 314, in run ''' % selfBytes File "/Users/<user>/Library/Caches/pypoetry/virtualenvs/gears-5k_d3sYO-py3.8/lib/python3.8/site-packages/redis/client.py", line 901, in execute_command return self.parse_response(conn, command_name, **options) File "/Users/<user>/Library/Caches/pypoetry/virtualenvs/gears-5k_d3sYO-py3.8/lib/python3.8/site-packages/redis/client.py", line 915, in parse_response response = connection.read_response() File "/Users/<user>/Library/Caches/pypoetry/virtualenvs/gears-5k_d3sYO-py3.8/lib/python3.8/site-packages/redis/connection.py", line 756, in read_response raise response redis.exceptions.ResponseError: ['Traceback (most recent call last):\n', ' File "<string>", line 3, in <module>\n', 'TypeError: code() takes at most 15 arguments (16 given)\n'] ``` `conn.execute_command("RG.PYEXECUTE", "GB().run()")`. works fine. Running on python 3.8 on Mac OS 10.15.5 connecting to current RedisGears docker image. I did manually change .dumps() in the library to use pickle protocol 4 since python 3.8 defaults to protocol 5 which doesn't exist in the RedisGear docker image using python 3.7. eg: `selfBytes = cloudpickle.dumps(self.pipe, protocol=4)`

corresponding to pickle incompatibility between Python 3.7 (Colab) and Python 3.8. However, all my attempts to upgrade Colab to use pickle5 failed.

Topic		Replies	Views
Custon Env works in local PC but not in Google Colab RLlib	2	139	February 21, 2024
Pickle Error When Restoring RLLIB Checkpoint RLlib	0	429	August 1, 2022
Weights and Biases with RLLIB error RLlib	2	239	May 26, 2023
Loading pickle file in Mac after writing in linux causing issues	0	200	April 25, 2024
ModuleNotFoundError: No module named 'run_unittests' RLlib	2	756	February 12, 2023

Loading RLlib checkpoints on Google Colab

Related topics