ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-11 01:06:01 +03:00

Author	SHA1	Message	Date
Andreas Kling	197e73ee31	Kernel+LibELF: Enable SMAP protection during non-syscall exec() When loading a new executable, we now map the ELF image in kernel-only memory and parse it there. Then we use copy_to_user() when initializing writable regions with data from the executable. Note that the exec() syscall still disables SMAP protection and will require additional work. This patch only affects kernel-originated process spawns.	2020-01-10 10:57:06 +01:00
Andreas Kling	9eef39d68a	Kernel: Start implementing x86 SMAP support Supervisor Mode Access Prevention (SMAP) is an x86 CPU feature that prevents the kernel from accessing userspace memory. With SMAP enabled, trying to read/write a userspace memory address while in the kernel will now generate a page fault. Since it's sometimes necessary to read/write userspace memory, there are two new instructions that quickly switch the protection on/off: STAC (disables protection) and CLAC (enables protection.) These are exposed in kernel code via the stac() and clac() helpers. There's also a SmapDisabler RAII object that can be used to ensure that you don't forget to re-enable protection before returning to userspace code. THis patch also adds copy_to_user(), copy_from_user() and memset_user() which are the "correct" way of doing things. These functions allow us to briefly disable protection for a specific purpose, and then turn it back on immediately after it's done. Going forward all kernel code should be moved to using these and all uses of SmapDisabler are to be considered FIXME's. Note that we're not realizing the full potential of this feature since I've used SmapDisabler quite liberally in this initial bring-up patch.	2020-01-05 18:14:51 +01:00
Andreas Kling	ea1911b561	Kernel: Share code between Region::map() and Region::remap_page() These were doing mostly the same things, so let's just share the code.	2020-01-01 19:32:55 +01:00
Andreas Kling	5aeaab601e	Kernel: Move CPU feature detection to Arch/x86/CPU.{cpp.h} We now refuse to boot on machines that don't support PAE since all of our paging code depends on it. Also let's only enable SSE and PGE support if the CPU advertises it.	2020-01-01 12:57:00 +01:00
Andreas Kling	1f31156173	Kernel: Add a mode flag to sys$purge and allow purging clean inodes	2019-12-29 13:16:53 +01:00
Andreas Kling	0d5e0e4cad	Kernel+SystemMonitor: Expose amount of per-process dirty private memory Dirty private memory is all memory in non-inode-backed mappings that's process-private, meaning it's not shared with any other process. This patch exposes that number via SystemMonitor, giving us an idea of how much memory each process is responsible for all on its own.	2019-12-29 12:28:32 +01:00
Conrad Pankoff	17aef7dc99	Kernel: Detect support for no-execute (NX) CPU features Previously we assumed all hosts would have support for IA32_EFER.NXE. This is mostly true for newer hardware, but older hardware will crash and burn if you try to use this feature. Now we check for support via CPUID.80000001[20].	2019-12-26 10:05:51 +01:00
Andreas Kling	ce5f7f6c07	Kernel: Use the CPU's NX bit to enforce PROT_EXEC on memory mappings Now that we have PAE support, we can ask the CPU to crash processes for trying to execute non-executable memory. This is pretty cool! :^)	2019-12-25 13:35:57 +01:00
Andreas Kling	ae2d72377d	Kernel: Enable the x86 WP bit to catch invalid memory writes in ring 0 Setting this bit will cause the CPU to generate a page fault when writing to read-only memory, even if we're executing in the kernel. Seemingly the only change needed to make this work was to have the inode-backed page fault handler use a temporary mapping for writing the read-from-disk data into the newly-allocated physical page.	2019-12-21 16:21:13 +01:00
Andreas Kling	b6ee8a2c8d	Kernel: Rename vmo => vmobject everywhere	2019-12-19 19:15:27 +01:00
Andreas Kling	1d4d6f16b2	Kernel: Add a specific-page variant of Region::commit()	2019-12-18 22:43:32 +01:00
Andreas Kling	931e4b7f5e	Kernel+SystemMonitor: Prevent userspace access to process ELF image Every process keeps its own ELF executable mapped in memory in case we need to do symbol lookup (for backtraces, etc.) Until now, it was mapped in a way that made it accessible to the program, despite the program not having mapped it itself. I don't really see a need for userspace to have access to this right now, so let's lock things down a little bit. This patch makes it inaccessible to userspace and exposes that fact through /proc/PID/vm (per-region "user_accessible" flag.)	2019-12-15 20:11:57 +01:00
Andreas Kling	05a441afb2	Kernel: Don't turn private read-only regions into shared ones on fork Even if they are read-only now, they can be mprotect(PROT_WRITE)'d in the future, so we have to make sure they are CoW mapped.	2019-12-15 16:53:46 +01:00
Andreas Kling	3fbc50a350	Kernel+SystemMonitor: Expose the number of set CoW bits in each Region This number tells us how many more pages in a given region will trigger a CoW fault if written to.	2019-12-15 16:53:00 +01:00
Andreas Kling	dbb644f20c	Kernel: Start implementing purgeable memory support It's now possible to get purgeable memory by using mmap(MAP_PURGEABLE). Purgeable memory has a "volatile" flag that can be set using madvise(): - madvise(..., MADV_SET_VOLATILE) - madvise(..., MADV_SET_NONVOLATILE) When in the "volatile" state, the kernel may take away the underlying physical memory pages at any time, without notifying the owner. This gives you a guilt discount when caching very large things. :^) Setting a purgeable region to non-volatile will return whether or not the memory has been taken away by the kernel while being volatile. Basically, if madvise(..., MADV_SET_NONVOLATILE) returns 1, that means the memory was purged while volatile, and whatever was in that piece of memory needs to be reconstructed before use.	2019-12-09 19:12:38 +01:00
Andreas Kling	05c65fb4f1	Kernel: Don't CoW non-writable pages A page fault in a page marked for CoW should not trigger a CoW if the page is non-writable. I think this makes sense.	2019-12-02 19:20:09 +01:00
Andreas Kling	f41ae755ec	Kernel: Crash on memory access in non-readable regions This patch makes it possible to make memory regions non-readable. This is enforced using the "present" bit in the page tables. A process that hits an not-present page fault in a non-readable region will be crashed.	2019-12-02 19:18:52 +01:00
Andreas Kling	5b8cf2ee23	Kernel: Make syscall counters and page fault counters per-thread Now that we show individual threads in SystemMonitor and "top", it's also very nice to have individual counters for the threads. :^)	2019-11-26 21:37:38 +01:00
Andreas Kling	9a157b5e81	Revert "Kernel: Move Kernel mapping to 0xc0000000" This reverts commit `bd33c66273`. This broke the network card drivers, since they depended on kmalloc addresses being identity-mapped.	2019-11-23 17:27:09 +01:00
Jesse Buhagiar	bd33c66273	Kernel: Move Kernel mapping to 0xc0000000 The kernel is now no longer identity mapped to the bottom 8MiB of memory, and is now mapped at the higher address of `0xc0000000`. The lower ~1MiB of memory (from GRUB's mmap), however is still identity mapped to provide an easy way for the kernel to get physical pages for things such as DMA etc. These could later be mapped to the higher address too, as I'm not too sure how to go about doing this elegantly without a lot of address subtractions.	2019-11-22 16:23:23 +01:00
Andreas Kling	794758df3a	Kernel: Implement some basic stack pointer validation VM regions can now be marked as stack regions, which is then validated on syscall, and on page fault. If a thread is caught with its stack pointer pointing into anything that's not a Region with its stack bit set, we'll crash the whole process with SIGSTKFLT. Userspace must now allocate custom stacks by using mmap() with the new MAP_STACK flag. This mechanism was first introduced in OpenBSD, and now we have it too, yay! :^)	2019-11-17 12:15:43 +01:00
Andreas Kling	a6e9119537	Kernel: Tweak some outdated kprintfs in Region	2019-11-04 00:48:45 +01:00
Andreas Kling	d67c6a92db	Kernel: Move page fault handling from MemoryManager to Region After the page fault handler has found the region in which the fault occurred, do the rest of the work in the region itself. This patch also makes all fault types consistently crash the process if a new page is needed but we're all out of pages.	2019-11-04 00:47:03 +01:00
Andreas Kling	0e8f1d7cb6	Kernel: Don't expose a region's page directory to the outside world Now that region manages its own mapping/unmapping, there's no need for the outside world to be able to grab at its page directory.	2019-11-04 00:26:00 +01:00
Andreas Kling	6ed9cc4717	Kernel: Remove Region API's for setting/unsetting the page directory This is done implicitly by mapping or unmapping the region.	2019-11-04 00:24:20 +01:00
Andreas Kling	e3dda4e87b	Kernel: Fix weird Region constructor that took nullable RefPtr<Inode> It's never valid to construct a Region with a null Inode pointer using this constructor, so just take a NonnullRefPtr<Inode> instead.	2019-11-04 00:21:08 +01:00
Andreas Kling	9b2dc36229	Kernel: Merge MemoryManager::map_region_at_address() into Region::map()	2019-11-04 00:05:57 +01:00
Andreas Kling	4bf1a72d21	Kernel: Teach Region how to remap itself Now remapping (i.e flushing kernel metadata to the CPU page tables) is done by simply calling Region::remap().	2019-11-03 21:11:08 +01:00
Andreas Kling	3dce0f23f4	Kernel: Regions should be mapped into a PageDirectory, not a Process This patch changes the parameter to Region::map() to be a PageDirectory since that matches how we think about the memory model: Regions are views onto VMObjects, and are mapped into PageDirectories. Each Process has a PageDirectory. The kernel also has a PageDirectory.	2019-11-03 21:11:08 +01:00
Andreas Kling	2cfc43c982	Kernel: Move region map/unmap operations into the Region class The more Region can take care of itself, the better.	2019-11-03 21:11:08 +01:00
Andreas Kling	a221cddeec	Kernel: Clean up a bunch of wrong-looking Region/VMObject code Since a Region is merely a "window" onto a VMObject, it can both begin and end at a distance from the VMObject's boundaries. Therefore, we should always be computing indices into a VMObject's physical page array by adding the Region's "first_page_index()". There was a whole bunch of code that forgot to do that. This fixes many wrong behaviors for Regions that start part-way into a VMObject.	2019-11-03 15:44:13 +01:00
Andreas Kling	fe455c5ac4	Kernel: Move page remapping into Region::remap_page(index) Let Region deal with this, instead of everyone calling MemoryManager.	2019-11-03 15:32:11 +01:00
Andreas Kling	d481ae95b5	Kernel: Defer creation of Region CoW bitmaps until they're needed Instead of allocating and populating a Copy-on-Write bitmap for each Region up front, wait until we actually clone the Region for sharing with another process. In most cases, we never need any CoW bits and we save ourselves a lot of kmalloc() memory and time.	2019-10-01 19:58:41 +02:00
Andreas Kling	7f9a33dba1	Kernel: Make Region single-owner instead of ref-counted This simplifies the ownership model and makes Region easier to reason about. Userspace Regions are now primarily kept by Process::m_regions. Kernel Regions are kept in various OwnPtr<Regions>'s. Regions now only ever get unmapped when they are destroyed.	2019-09-27 14:25:42 +02:00
Andreas Kling	bf43d94a2f	Kernel: Disable interrupts throughout ~Region() We don't want an interrupt handler to access the VM data structures while their internal consistency is broken.	2019-09-05 11:15:05 +02:00
Andreas Kling	e25ade7579	Kernel: Rename "vmo" to "vmobject" everywhere	2019-09-04 11:27:14 +02:00
Andreas Kling	f5d779f47e	Kernel: Never forcibly page in entire executables We were doing this for the initial kernel-spawned userspace process(es) to work around instability in the page fault handler. Now that the page fault handler is more robust, we can stop worrying about this. Specifically, the page fault handler was previous not able to handle getting a page fault in anything but the currently executing task's page directory.	2019-08-26 13:20:01 +02:00
Andreas Kling	e29fd3cd20	Kernel: Display virtual addresses as V%p instead of L%x The L was a leftover from when these were called linear addresses.	2019-08-26 11:31:58 +02:00
Andreas Kling	6bdb81ad87	Kernel: Split VMObject into two classes: Anonymous- and InodeVMObject InodeVMObject is a VMObject with an underlying Inode in the filesystem. AnonymousVMObject has no Inode. I'm happy that InodeVMObject::inode() can now return Inode& instead of VMObject::inode() return Inode*. :^)	2019-08-07 18:09:32 +02:00
Andreas Kling	3364da388f	Kernel: Remove VMObject names The VMObject name was always either the owning region's name, or the absolute path of the underlying inode. We can reconstitute this information if wanted, no need to keep copies of these strings around.	2019-08-07 16:14:08 +02:00
Andreas Kling	5b2447a27b	Kernel: Track user accessibility per Region. Region now has is_user_accessible(), which informs the memory manager how to map these pages. Previously, we were just passing a "bool user_allowed" to various functions and I'm not at all sure that any of that was correct. All the Region constructors are now hidden, and you must go through one of these helpers to construct a region: - Region::create_user_accessible(...) - Region::create_kernel_only(...) That ensures that we don't accidentally create a Region without specifying user accessibility. :^)	2019-07-19 16:11:52 +02:00
Andreas Kling	5254a320d8	Kernel: Remove use of copy_ref() in favor of regular RefPtr copies. This is obviously more readable. If we ever run into a situation where ref count churn is actually causing trouble in the future, we can deal with it then. For now, let's keep it simple. :^)	2019-07-11 15:40:04 +02:00
Andreas Kling	27f699ef0c	AK: Rename the common integer typedefs to make it obvious what they are. These types can be picked up by including <AK/Types.h>: * u8, u16, u32, u64 (unsigned) * i8, i16, i32, i64 (signed)	2019-07-03 21:20:13 +02:00
Andreas Kling	90b1354688	AK: Rename RetainPtr => RefPtr and Retained => NonnullRefPtr.	2019-06-21 18:37:47 +02:00
Andreas Kling	77b9fa89dd	AK: Rename Retainable => RefCounted. (And various related renames that go along with it.)	2019-06-21 15:30:03 +02:00
Conrad Pankoff	aee9317d86	Kernel: Refactor MemoryManager to use a Bitmap rather than a Vector This significantly reduces the pressure on the kernel heap when allocating a lot of pages. Previously at about 250MB allocated, the free page list would outgrow the kernel's heap. Given that there is no longer a page list, this does not happen. The next barrier will be the kernel memory used by the page records for in-use memory. This kicks in at about 1GB.	2019-06-12 15:38:17 +02:00
Andreas Kling	de65c960e9	Kernel: Tweak some String&& => const String&. String&& is just not very practical. Also return const String& when the returned string is a member variable. The call site is free to make a copy if he wants, but otherwise we can avoid the retain count churn.	2019-06-07 20:58:12 +02:00
Andreas Kling	e42c3b4fd7	Kernel: Rename LinearAddress => VirtualAddress.	2019-06-07 12:56:50 +02:00
Andreas Kling	bc951ca565	Kernel: Run clang-format on everything.	2019-06-07 11:43:58 +02:00
Andreas Kling	ba58b4617d	VM: Don't remap each Region page twice in page_in(). page_in_from_inode() will map the page after reading it from disk, so we don't need to remap it once again.	2019-06-01 15:45:50 +02:00
Andreas Kling	baaede1bf9	Kernel: Make the Process allocate_region* API's understand "int prot". Instead of having to inspect 'prot' at every call site, make the Process API's take care of that so we can just pass it through.	2019-05-30 16:14:37 +02:00
Andreas Kling	87b54a82c7	Kernel: Let Region keep a Range internally.	2019-05-17 04:32:08 +02:00
Andreas Kling	01ffcdfa31	Kernel: Encapsulate the Region's COW map a bit better.	2019-05-14 17:31:57 +02:00
Andreas Kling	b9738fa8ac	Kernel: Move VM-related files into Kernel/VM/. Also break MemoryManager.{cpp,h} into one file per class.	2019-04-03 15:13:07 +02:00

1 2 3 4

154 Commits