Variable-Sharing between policies

klausk55 · March 5, 2021, 6:12pm

Hello,

I’m trying to share layers of my custom ModelV2 NN model between two policies.
I tried as noted in the documentation https://docs.ray.io/en/master/rllib-env.html#variable-sharing-between-policies where is said I can just put layers in global variables and directly share those layer objects between policy models.

But already for the “first connection” between a local layer and a globally shared layer I get this error:
ValueError: Tensor("hoist1/dense_self_hoist/kernel/Read/ReadVariableOp:0", shape=(512, 256), dtype=float32) must be from the same graph as Tensor("hoist1/embedded_self_hoist/Relu:0", shape=(?, ?, 1, 512), dtype=float32) (graphs are <tensorflow.python.framework.ops.Graph object at 0x7f1e3033d9d0> and <tensorflow.python.framework.ops.Graph object at 0x7f1d2ef7f0a0>).

What am I doing wrong? I globally share some layers in the manner showed in the example:
https://github.com/ray-project/ray/blob/ef944bc5f0d7764cd99d50500e470eac005a3d01/rllib/examples/models/shared_weights_model.py#L20

klausk55 · March 8, 2021, 12:46pm

I forgot to set framework='tf2' which seems to be mandatory for sharing layers as global variables between models.
But thereafter it occurs another value error:

ValueError: tf.enable_eager_execution must be called at program startup

I have opened an issue, see ValueError: tf.enable_eager_execution must be called at program startup · Issue #14533 · ray-project/ray · GitHub

sven1977 · March 12, 2021, 8:12pm

Thanks for this issue @klausk55 and for the repro script! Yeah, sharing layers like this is only possible in torch or tf-eager. In static-graph tf, you would have to do this via (old-school) tf scopes (and var re-uses).
I’ll take a look …

klausk55 · March 13, 2021, 7:45pm

Okay @sven1977, do you will take a look at why it’s working using the default (results = tune.run("PPO", stop=stop, config=config, verbose=1) in that example?

As you said, so far I have to use the old-school manner to run the PPOTrainer w/o including tune.

Topic		Replies	Views
Multi agent partial parameter sharing RLlib	2	411	November 30, 2023
How to save weights in tensorflow's form? RLlib	2	271	August 1, 2021
Handling multiple rewards to different branches of model RLlib	3	364	September 15, 2021
[RLlib] Problem with TFModelV2 loading after having saved one with `TFPolicy.export_model()` RLlib	5	2621	February 10, 2021
Value Branch In fcnet.py RLlib	1	463	September 12, 2021

Variable-Sharing between policies

Related topics