Properly Implement brk/sbrk and mmap #4

yzhang71 · 2024-07-23T17:49:20Z

We implemented brk/sbrk in glibc (userspace) by over-allocating to a page-aligned address and exposing a pseudo-break to the caller. This makes malloc fully functional for small chunks using sbrk. For larger chunks, malloc triggers the mmap path, which is not yet handled. Now, we should move brk/sbrk to the runtime space as syscalls and add a mutex to the pseudo-break to prevent race conditions.

We also need to properly handle mmap. The WASI-libc implementation of mmap is an emulation of malloc, which, after discussing with Nick and Coulson, is not acceptable by us. Therefore, we need to handle it in the runtime. For now, I used an ad-hoc solution by using malloc as well.

The text was updated successfully, but these errors were encountered:

rennergade · 2024-07-23T18:14:17Z

My opinion here:

Eventually our goal should be to move all internals for mmap/brk out of glibc. The question then is: where should they go? For both we'll have to manipulate the linear memory allocation. I think the most intuitive way is to add some functionality into wasmtime to do this, though that messes with our goal of portability. We'll eventually have to add some runtime specific functionality to add fork/exec as well, and I'm not sure there's any way around this. I think maybe adding a single file with runtime specific implementations for all of these with a general API that can be eventually extended to other runtimes is probably our best option.
We need to properly manage memory within each wasm instance to be able to separately manage mmap and brk or else we run into situations where they would attempt to use the same resources. We've mentioned a hacky way where we allocate a large chunk for mmaps at runtime, but that obviously would 1. be very inefficient and 2. still require additional infrastructure to manage. The long term solution is to most likely implement a virtual memory map similar to NaCl.

JustinCappos · 2024-07-23T20:29:44Z

Even if we call from our microvisor back into the runtime to do the memory mapping, etc., to me this seems preferable. I'd advise we don't have system calls that are directly handled by the runtime / caging infrastructure, if we can at all avoid it.

rennergade · 2024-07-24T04:30:19Z

Yeah I agree. I think the three things that the runtime currently does that we'd move or modify is 1. Creating a new instance of a cage (important for fork) 2. Creating new threads (for pthread_create, currently done by wasi-threads) 3. Modifying the bounds of linear memory (important for brk/mmap). So those are the scenarios that we probably need to put some thought into.

rennergade · 2024-10-28T18:55:03Z

We discussed how to move this forward today in the weekly meeting. Thank you @yizhuoliang for joining us.

Here's how were breaking this down to proceed

libc/wasmtime integration
@qianxichen233

add MAKE_SYSCALL stubs to libc to export mmap/munmap/brk/shmat/shmdt calls to wasmtime/rawposix
Figure out how to initialize memory so we can use the all address space on cage startup. This probably involves increasing linear memory to max and then mapping unused PROT_NONE.

RawPOSIX integration

Add VMMap to RawPOSIX - @pranav-bhatt @Yaxuan-w
Add vmmap creation/destruction/book keeping to cage struct fork/exit @Yaxuan-w @ChinmayShringi
Integrate mmap()/munmap() @pranav-bhatt @ruchjoshi-nyu
integrat brk() @pranav-bhatt @ruchjoshi-nyu
integrate mprotect() @pranav-bhatt @ruchjoshi-nyu
integrate shmat()/shmdt() @pranav-bhatt @ruchjoshi-nyu
do address checking for buffers. write() etc... @ruchjoshi-nyu

Its important that any use of the vmmap occurs in the dispatcher step and not in the actual syscalls, ie mmap finds a hole address and sends that address w/ MAP_FIXED into mmap_syscall, or write() checks the address in the dispatcher before calling a valid write_syscall.

JustinCappos · 2024-10-31T00:24:20Z

I spoke with Dennis a bit about mmap yesterday and our thoughts are that there could be a separate memory region for each process for mmaps. We could statically allocate a larger part of the address space than mmap needs and then grow / shrink in response to requests. Happy to discuss if this isn't clear.

Let me know if this is similar to your thoughts. I'm more trying to understand what the options are, rather than to push hard for a specific solution.

rennergade · 2024-10-31T01:09:13Z

In theory this is how the VMMap would handle things on a pretty basic level. I think there's several reasons why using the VMMap instead of a "greedy" implementation is preferrable/necessary:

Not very straightforward to manage fragmentation, specifically in scenarios with large mappings. I could see this being potentially a problem with something like postgres.
If we don't track what addresses are valid we can potentially fault in trusted code. IE. write() sends an invalid buffer to an in-memory pipe and segfaults. This becomes less of an issue if these things are done in grates but I believe there are still trusted operations on memory in 3i that could be affected by this.
Managing mappings on fork(). In NaCl when we copy the address space we have to separately copy SHARED and unshared mappings. We handle shared mappings via mremap, while we copy unshared mappings via process_vm_writev. I'm not sure without tracking we know which mappings are shared or how to handle them.

rennergade · 2024-10-31T01:10:23Z

The VMMap port for Rust is like 95% finished and looks like it is implemented in a way that should be much more performant than NaCl's implementation. So I believe we have a path forward here.

rennergade · 2025-02-13T16:07:14Z

Added in #56

yzhang71 assigned JustinCappos, rennergade and yizhuoliang Jul 23, 2024

rennergade mentioned this issue Jul 25, 2024

Port NaCl VMMap as a Rust Library #7

Closed

yzhang71 mentioned this issue Sep 18, 2024

Lind-Wasm Project Tracker #14

Open

rennergade assigned pranav-bhatt, ruchjoshi-nyu, qianxichen233 and Yaxuan-w Oct 28, 2024

rennergade closed this as completed Feb 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Properly Implement brk/sbrk and mmap #4

Properly Implement brk/sbrk and mmap #4

yzhang71 commented Jul 23, 2024

rennergade commented Jul 23, 2024

JustinCappos commented Jul 23, 2024

rennergade commented Jul 24, 2024 •

edited

Loading

rennergade commented Oct 28, 2024

JustinCappos commented Oct 31, 2024 •

edited

Loading

rennergade commented Oct 31, 2024

rennergade commented Oct 31, 2024

rennergade commented Feb 13, 2025

Properly Implement brk/sbrk and mmap #4

Properly Implement brk/sbrk and mmap #4

Comments

yzhang71 commented Jul 23, 2024

rennergade commented Jul 23, 2024

JustinCappos commented Jul 23, 2024

rennergade commented Jul 24, 2024 • edited Loading

rennergade commented Oct 28, 2024

JustinCappos commented Oct 31, 2024 • edited Loading

rennergade commented Oct 31, 2024

rennergade commented Oct 31, 2024

rennergade commented Feb 13, 2025

rennergade commented Jul 24, 2024 •

edited

Loading

JustinCappos commented Oct 31, 2024 •

edited

Loading