Learning about shared memory

crackcomm · March 7, 2023, 6:57pm

Hi all, I’m trying to learn more about ray and how it’s sharing memory between processes. I know it was using plasma but from what I’ve heard it is now deprecated. Where can I learn details about how does ray allocate the shared memory? I’m interested in most up-to-date information and I tried to look in the Python code but there is many layers of abstraction that obstruct the view. If someone could point to a specific part of the code in Python and/or C++ side where does the allocation happen I would be greatly appreciative. I’m only interested in how it works on Linux.

Thank you all.

rickyyx · March 7, 2023, 10:21pm

Hey @crackcomm welcome to the community!

I guess in short, ray plasma store uses mmaped files for shared memory between processes.

I think these are the most relevant source codes for plasma store shared memory allocation:

ObjectStore: for managing the allocated objects
PlasmaAllocator: for the actual memory allocation for objects.
dlmalloc: a fork of the dlmalloc for efficient malloc on the mapped files.

crackcomm · March 8, 2023, 8:29am

Thank you for your answer. I have even more questions now.

It seems like much more complex memory management system than I have imagined.

My main question though is why does dlmalloc use malloc on the mapped files? Maybe you could refer me to some resource where I can read about it.

You referenced plasma allocator. From what I gathered it has it’s origins in ray project and was then a part of arrow project but is now deprecated. Is plasma only deprecated in context of arrow and still being big part of ray? It would be interesting to see what it was replaced with in arrow project.

I very much appreciate your time, if you are open to it I would have more questions.

rickyyx · March 8, 2023, 8:53pm

My main question though is why does dlmalloc use malloc on the mapped files? Maybe you could refer me to some resource where I can read about it.

Hmm, where do you see malloc being used on the mapped files? Could you provide a pointer to it?

From what I gathered it has it’s origins in ray project and was then a part of arrow project but is now deprecated. Is plasma only deprecated in context of arrow and still being big part of ray? It would be interesting to see what it was replaced with in arrow project.

Probably more of a maintainance issue : [ARROW-17860] [Plasma] Deprecate Plasma - ASF JIRA

crackcomm · March 9, 2023, 1:26am

I took it from your answer, I don’t understand it really.

rickyyx · March 9, 2023, 8:32pm

Oh I see, sorry about the confusion. We used the dlmalloc implementation for a malloc interface to manage memory on a mmaped files.

crackcomm · March 10, 2023, 2:17pm

I now understand the plasma implementation. Thank you for your support. I will mark your first response as solution.

Topic		Replies	Views
[Core] How to reduce Plasma memory Ray Core	2	479	March 20, 2021
Does the plasma deprecation affect ray's shared memory mechanism? Ray Core	3	455	April 6, 2023
Why plasma memory allocations are intentionally discontinuous to prevent dlmalloc to coalesce them? Ray Core	0	290	September 5, 2022
Ray/Plasma backed array	15	1264	March 8, 2021
Why is Ray spilling objects to disk even though there is enough memory Ray Core	6	959	January 19, 2021

Learning about shared memory

Related topics