Resources allocation during serve deployment

Hi,
I have one model which is around 1Gb
And have 16 gb 1 GPU.
In serve deployment i set (num of replicas=4,ray actor options=(num cpus= 2,num gpus=0.25)
But i can not see any improvement when hit this deployment with http request.
May i know where its getting wrong.