Hi all,
I’m building a data processing pipeline and I’m performing a transformation that uses numpy.random
.
How should I set up the numpy seed? Inside the map_batches
?
If I do outside, could I have problem of racing condition?
Thanks in advance!