Loaded Torchscript models have wrong output shape

GallantWood · June 1, 2022, 6:21am

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

I am relatively new to RLlib. I have restored a checkpoint and saved my (non RNN) custom DQN model for inference using export_model. This generates a torchscript model (model.pt). One subtlety is that this model expects tensor inputs for (state, seq_lens) so ([ ], None) will throw an error. However, if an appropriate dummy input is provided, we can do a forward pass.

The problem is that the code:

model = torch.jit.load(model_path)
model.eval()
obs = torch.rand((1, 128, 111))
obs_dict = {“obs”: obs}
state = np.array([1.0])
state = [torch.from_numpy(state)]
seq_lens = np.ndarray((1,))
seq_lens = torch.from_numpy(seq_lens)
model(obs_dict, state, seq_lens)

has the wrong output shape, in this case 256, and not the model output shape. Can anyone direct me towards the solution to this?

If it helps, here are the final layers in print(model):

(to_logits): RecursiveScriptModule(
original_name=Sequential
(0): RecursiveScriptModule(original_name=LayerNorm)
(1): RecursiveScriptModule(original_name=Linear)
)
(value_branch): RecursiveScriptModule(
original_name=Sequential
(0): RecursiveScriptModule(original_name=LayerNorm)
(1): RecursiveScriptModule(original_name=Linear)
)
(advantage_module): RecursiveScriptModule(
original_name=Sequential
(dueling_A_0): RecursiveScriptModule(
original_name=SlimFC
(_model): RecursiveScriptModule(
original_name=Sequential
(0): RecursiveScriptModule(original_name=Linear)
(1): RecursiveScriptModule(original_name=ReLU)
)
)
(A): RecursiveScriptModule(
original_name=SlimFC
(_model): RecursiveScriptModule(
original_name=Sequential
(0): RecursiveScriptModule(original_name=Linear)
)
)
)

and this is the output to print(model.code):

def forward(self,
input_dict: Dict[str, Tensor],
state: List[Tensor],
seq_lens: Tensor) → Tuple[Tensor, List[Tensor]]:
to_logits = self.to_logits
hidden_layers = self.hidden_layers
obs = input_dict[“obs”]
_0, = state
_1 = (to_logits).forward((hidden_layers).forward(obs, ), )
return (_1, [_0])

Since this whole method is used for inference, shouldn’t the output of

model(obs_dict, state, seq_lens)

be of shape num_outputs where I guess argmax corresponds exactly to policy.compute_single_action(obs)[0] ?

In other words, how can I get this single action output through a forward pass of a saved torchscript model?

rliaw · June 2, 2022, 9:47pm

Uh oh, that’s not great. @gjoliver is working on improving the inference path. Can someone on RLlib side take a look?

BTW @GallantWood do you think you could provide a copy of the actual model somewhere? i.e., load the model_path somewhere we can download?

gjoliver · June 2, 2022, 10:36pm

I suspect what you get are the raw logits.
RLlib policies have build-in action distribution functions to sample an action from the distributions constructed based on these raw logit outputs.
can you open a new github issue with a stripped down script demonstrating this problem?
with the actual model config, we can help make sure.

btw, we will be providing utils to make it much easier to run a checkpointed policy. having to come up with dummy state and seq_lens is painful.

GallantWood · June 3, 2022, 7:43am

Thank you @rliaw @gjoliver
I have built a simple reproducible example using cartpole. I will open a github issue with the code.

GallantWood · June 7, 2022, 7:40am

@gjoliver @rliaw
Here is a link to the issue with complete script demonstrating the problem:

github.com/ray-project/ray

[RLlib] Loaded Torchscript models have wrong output shape

opened 08:32AM - 03 Jun 22 UTC

jwallbridge

bug P1 rllib-models

### What happened + What you expected to happen I have restored a checkpoint an…d saved my (non RNN) custom DQN model for inference using export_model. This generates a torchscript model (model.pt). One subtlety is that this model expects tensor inputs for (state, seq_lens) so ([ ], None) will throw an error. However, if an appropriate dummy input is provided, we can do a forward pass. The issue is that this forward pass returns the wrong output shape, in this case 256, and not the model output shape. ### Versions / Dependencies Ray 1.12.1 ### Reproduction script Here is a reproducible example for Cartpole. 1. Train: train a custom DQN model. 2. Save: restore a checkpoint and export the Torchscript model. 3. Test: use the saved Torchscript model for inference. Train: ```python import torch from torch import nn import ray from ray import tune from ray.rllib.models import ModelCatalog from ray.rllib.models.torch.torch_modelv2 import TorchModelV2 class MyCustomModel(TorchModelV2, nn.Module): def __init__(self, obs_space, action_space, num_outputs, model_config, name, **kwargs): TorchModelV2.__init__(self, obs_space, action_space, num_outputs, model_config, name) nn.Module.__init__(self) self.hidden_layers = nn.Sequential( nn.Linear(4, 256), nn.Linear(256, num_outputs) ) self.to_logits = nn.Linear(256, num_outputs) self.value_branch = nn.Linear(256, 1) self._output = None def forward(self, input_dict, state, seq_lens): inputs = input_dict["obs"] self._output = self.hidden_layers(inputs) logits = self.to_logits(self._output) return logits, state def value_function(self): value_out = self.value_branch(self._output) return torch.reshape(value_out, [-1]) if __name__ == "__main__": ray.shutdown() ray.init() ModelCatalog.register_custom_model('my_custom_model', MyCustomModel) config = { "env": "CartPole-v0", "model": { "custom_model": "my_custom_model", }, "framework": "torch", "num_gpus": 0, "num_workers": 1, } stop = {"timesteps_total": 1e4} analysis = tune.run( "DQN", config = config, stop = stop, checkpoint_at_end = True ) ``` Save: ```python import ray from ray import tune from ray.rllib.agents import dqn from ray.rllib.models import ModelCatalog from cartpole_dqn_train import MyCustomModel if __name__ == '__main__': ray.shutdown() ray.init() checkpoint_path = "DQN/DQN_CartPole-v0_85778_00000_0_2022-06-03_09-24-24/checkpoint_000010/checkpoint-10" export_path = "saved_models" ModelCatalog.register_custom_model('my_custom_model', MyCustomModel) config = { "env": "CartPole-v0", "model": { "custom_model": "my_custom_model", }, "framework": "torch", "num_gpus": 0, "num_workers": 1, } agent = dqn.DQNTrainer(config = config) agent.restore(checkpoint_path) policy = agent.get_policy() policy.export_model(export_path) ``` Test: ```python import os import numpy as np import torch if __name__ == '__main__': model_path = 'saved_models/model.pt' model = torch.jit.load(model_path) model.eval() obs = torch.rand((1, 4)) obs_dict = {"obs": obs} state = np.array([1.0]) state = [torch.from_numpy(state)] seq_lens = np.ndarray((1,)) seq_lens = torch.from_numpy(seq_lens) print(model(obs_dict, state, seq_lens)[0].shape) # = torch.Size([1, 256]) ``` ### Issue Severity High: It blocks me from completing my task.

gjoliver · June 7, 2022, 4:15pm

ok cool, let’s move our discussion there.

Topic		Replies	Views
[RLlib] Shape Error for custom PyTorch model RLlib	2	689	March 12, 2021
Why is my `rllib.models.torch.torch_modelv2.TorchModelV2` receiving a Tensor of shape ( 32, <observation size> )? Configure Algorithm, Training, Evaluation, Scaling	1	735	November 15, 2022
State shapes incorrect using custom model (TorchModelV2) (PPO) RLlib	2	430	July 15, 2021
Custom RNN Model with Examples - why do they fail? RLlib	11	2356	May 5, 2022
[RLlib] Exporting a TorchModelV2 to TorchScript RLlib	0	607	February 5, 2021

Loaded Torchscript models have wrong output shape

Related topics