ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-11 01:06:01 +03:00

Author	SHA1	Message	Date
Andreas Kling	84725ef3a5	Kernel+UserspaceEmulator: Add sys$emuctl() system call This returns ENOSYS if you are running in the real kernel, and some other result if you are running in UserspaceEmulator. There are other ways we could check if we're inside an emulator, but it seemed easier to just ask. :^)	2021-03-09 08:58:26 +01:00
Brian Gianforcaro	5f6ab77352	Kernel: Add bitwise operators for Thread::FileBlocker::BlockFlags enum Switch to using type-safe bitwise operators for the BlockFlags class, this cleans up a lot of boilerplate casts which are necessary when the enum is declared as `enum class`.	2021-03-08 18:47:40 +01:00
Ben Wiederhake	501952852c	Kernel: Fix pointer over/underflow in create_thread The expression (u8*)params.m_stack_location + stack_size … causes UBSan to spit out the warning KUBSAN: addition of unsigned offset to 0x00000002 overflowed to 0xb0000003 … even though there is no actual overflow happening here. This can be reproduced by running: $ syscall create_thread 0 [ 0 0 0 0 0xb0000001 2 ] Technically, this is a true-positive: The C++-reference is incredibly strict about pointer-arithmetic: > A pointer to non-array object is treated as a pointer to the first element > of an array with size 1. […] [A]ttempts to generate a pointer that isn't > pointing at an element of the same array or one past the end invoke > undefined behavior. https://en.cppreference.com/w/cpp/language/operator_arithmetic Frankly, this feels silly. So let's just use FlatPtr instead. Found by fuzz-syscalls. Undocumented bug. Note that FlatPtr is an unsigned type, so user_esp.value() - 4 is defined even if we end up with a user_esp of 0 (this can happen for example when params.m_stack_size = 0 and params.m_stack_location = 0). The result would be a Kernelspace-pointer, which would then be immediately flagged by 'MM.validate_user_stack' as invalid, as intended.	2021-03-07 17:31:25 +01:00
Andreas Kling	a819eb5016	Kernel: Skip TLB flushes while cloning regions in sys$fork() Since we know for sure that the virtual memory regions in the new process being created are not being used on any CPU, there's no need to do TLB flushes for every mapped page.	2021-03-03 22:57:45 +01:00
Andreas Kling	d96a44a738	Kernel: Avoid transient kmalloc heap allocations in sys$select() Dynamic Vector allocations in sys$select() were showing up in the full-system profile and since there will never be more than FD_SETSIZE file descriptors to worry about, we can confidently add enough inline capacity to this Vector that it never has to kmalloc. To compensate for the increased stack usage, reduce the size of the FDInfo struct while we're here. :^)	2021-03-03 20:37:23 +01:00
Andreas Kling	5e7abea31e	Kernel+Profiler: Capture metadata about all profiled processes The perfcore file format was previously limited to a single process since the pid/executable/regions data was top-level in the JSON. This patch moves the process-specific data into a top-level array named "processes" and we now add entries for each process that has been sampled during the profile run. This makes it possible to see samples from multiple threads when viewing a perfcore file with Profiler. This is extremely cool! :^)	2021-03-02 22:38:06 +01:00
Andreas Kling	ea500dd3e3	Kernel: Start work on full system profiling :^) The superuser can now call sys$profiling_enable() with PID -1 to enable profiling of all running threads in the system. The perf events are collected in a global PerformanceEventBuffer (currently 32 MiB in size.) The events can be accessed via /proc/profile	2021-03-02 22:38:06 +01:00
Andreas Kling	b425c2602c	Kernel: Better handling of allocation failure in profiling If we can't allocate a PerformanceEventBuffer to store the profiling events, we now fail sys$profiling_enable() and sys$perf_event() with ENOMEM instead of carrying on with a broken buffer.	2021-03-02 22:38:06 +01:00
Ben Wiederhake	5c15ca7b84	Kernel: Make sockets use AK::Time	2021-03-02 08:36:08 +01:00
Ben Wiederhake	336303bda4	Kernel: Make kgettimeofday use AK::Time	2021-03-02 08:36:08 +01:00
Ben Wiederhake	c040e64b7d	Kernel: Make TimeManagement use AK::Time internally I don't dare touch the multi-threading logic and locking mechanism, so it stays timespec for now. However, this could and should be changed to AK::Time, and I bet it will simplify the "increment_time_since_boot()" code.	2021-03-02 08:36:08 +01:00
Ben Wiederhake	2b6546c40a	Kernel: Make Thread use AK::Time internally This commit is very invasive, because Thread likes to take a pointer and write to it. This means that translating between timespec/timeval/Time would have been more difficult than just changing everything that hands a raw pointer to Thread, in bulk.	2021-03-02 08:36:08 +01:00
Ben Wiederhake	8598240193	Kernel: Sanitize all user-supplied timeval's/timespec's This also removes a bunch of unnecessary EINVAL. Most of them weren't even recommended by POSIX.	2021-03-02 08:36:08 +01:00
Andreas Kling	4d006de2b9	Kernel: Fix build with IO_DEBUG	2021-03-01 16:07:50 +01:00
Andreas Kling	272c2e6ec5	Kernel: Use Userspace<T> in sys${munmap,mprotect,madvise,msyscall}()	2021-03-01 15:53:33 +01:00
Andreas Kling	bebceaa32c	Kernel: Use Userspace<T> in sys$select()	2021-03-01 15:07:01 +01:00
Andreas Kling	a1a82c1d95	Kernel: Use Userspace<T> in sys$get_dir_entries()	2021-03-01 15:04:31 +01:00
Andreas Kling	b5f32be577	Kernel: Use Userspace<T> in sys$get_stack_bounds()	2021-03-01 14:50:36 +01:00
Andreas Kling	122c7b6cbb	Kernel: Use Userspace<T> in sys$write()	2021-03-01 14:35:06 +01:00
Andreas Kling	6a6eb8844a	Kernel: Use Userspace<T> in sys$sigaction() fuzz-syscalls found a bunch of unaligned accesses into struct sigaction via this syscall. This patch fixes that issue by porting the syscall to Userspace<T> which we should have done anyway. :^) Fixes #5500.	2021-03-01 14:06:20 +01:00
Andreas Kling	ac71775de5	Kernel: Make all syscall functions return KResultOr<T> This makes it a lot easier to return errors since we no longer have to worry about negating EFOO errors and can just return them flat.	2021-03-01 13:54:32 +01:00
Andreas Kling	4aa58aaab5	Kernel: Don't disable interrupts while exiting a thread or process This was another vestige from a long time ago, when exiting a thread would mutate global data structures that were only protected by the interrupt flag.	2021-02-25 19:36:36 +01:00
Andreas Kling	8eeb8db2ed	Kernel: Don't disable interrupts while dealing with a process crash This was necessary in the past when crash handling would modify various global things, but all that stuff is long gone so we can simplify crashes by leaving the interrupt flag alone.	2021-02-25 19:36:36 +01:00
Andreas Kling	8129f3da52	Kernel: Move SMAP disabler RAII helper to its own file Added this in a new directory called Kernel/Arch/x86/ where stuff that applies to both i386 and x86_64 can live.	2021-02-25 17:25:34 +01:00
Andreas Kling	8f70528f30	Kernel: Take some baby steps towards x86_64 Make more of the kernel compile in 64-bit mode, and make some things pointer-size-agnostic (by using FlatPtr.) There's a lot of work to do here before the kernel will even compile.	2021-02-25 16:27:12 +01:00
Andreas Kling	c11511a0ab	Kernel: Move sys$sigaction() implementation inside ARCH(i386)	2021-02-25 11:33:06 +01:00
Andreas Kling	53c6c29158	Kernel: Tighten some typing in Arch/i386/CPU.h Use more appropriate types for some things.	2021-02-25 11:32:27 +01:00
Brian Gianforcaro	303620ea85	Kernel: Fix pointer overflow in create_thread KUBSAN found this overflow from syscall fuzzing. Fixes #5498	2021-02-24 15:14:13 +01:00
Andreas Kling	ce1775d81d	Kernel: Oops, fix broken sys$uname() function definition	2021-02-24 14:42:38 +01:00
Andreas Kling	a48d54dfc5	Kernel: Don't dereference untrusted userspace pointer in sys$uname() Instead of writing to the userspace utsname struct one field at a time, build up a utsname on the kernel stack and copy it out to userspace once it's finished. This is both simpler and gets validity checking built-in for free. Found by KUBSAN! :^) Fixes #5499.	2021-02-24 14:37:36 +01:00
Andreas Kling	5d180d1f99	Everywhere: Rename ASSERT => VERIFY (...and ASSERT_NOT_REACHED => VERIFY_NOT_REACHED) Since all of these checks are done in release builds as well, let's rename them to VERIFY to prevent confusion, as everyone is used to assertions being compiled out in release. We can introduce a new ASSERT macro that is specifically for debug checks, but I'm doing this wholesale conversion first since we've accumulated thousands of these already, and it's not immediately obvious which ones are suitable for ASSERT.	2021-02-23 20:56:54 +01:00
Brian Gianforcaro	d934e77522	Kernel: Use copy_n_from_user in sys$setgroups to check for overflow	2021-02-21 17:12:01 +01:00
Brian Gianforcaro	4743afeaf4	Kernel: Use already computed nfds_checked value when copying from user mode. - We've already computed the number of fds * sizeof(pollfd), so use it instead of needlessly doing it again. - Use fds_copy.data() instead off address of indexing the vector.	2021-02-21 17:12:01 +01:00
Brian Gianforcaro	1c0e2947d7	Kernel: Use copy_n_from_user in sys$setkeymap	2021-02-21 17:12:01 +01:00
Brian Gianforcaro	26bba8e100	Kernel: Populate ELF::AuxilaryValue::Platform from Processor object. Move this to the processor object so it can easily be implemented when Serenity is compiled for a different architecture.	2021-02-21 17:06:24 +01:00
Brian Gianforcaro	a977cdd9ac	Kernel: Remove unneeded Thread::set_default_signal_dispositions The `default_signal_action(u8 signal)` function already has the full mapping. The only caveat being that now we need to make sure the thread constructor and clear_signals() method do the work of resetting the m_signal_action_data array, instead or relying on the previous logic in set_default_signal_dispositions.	2021-02-21 12:54:39 +01:00
Andreas Kling	84b2d4c475	Kernel: Add "map_fixed" pledge promise This is a new promise that guards access to mmap() with MAP_FIXED. Fixed-address mappings are rarely used, but can be useful if you are trying to groom the process address space for malicious purposes. None of our programs need this at the moment, as the only user of MAP_FIXED is DynamicLoader, but the fixed mappings are constructed before the process has had a chance to pledge anything.	2021-02-21 01:08:48 +01:00
Andreas Kling	6e83be67b8	Kernel: Release ptrace lock in exec before stopping due to PT_TRACE_ME If we have a tracer process waiting for us to exec, we need to release the ptrace lock before stopping ourselves, since otherwise the tracer will block forever on the lock. Fixes #5409.	2021-02-19 12:13:54 +01:00
Andreas Kling	eb92ec3149	Kernel: Factor out mmap & friends range expansion to a helper function sys$mmap() and related syscalls must pad to the nearest page boundary below the base address and above the end address of the specified range. Since we have to do this in many places, let's make a helper.	2021-02-18 18:04:58 +01:00
Andreas Kling	55a9a4f57a	Kernel: Use KResult a bit more in sys$execve()	2021-02-18 09:37:33 +01:00
Andreas Kling	5a595ef134	Kernel: Use dbgln_if() in sys$fork()	2021-02-17 15:34:32 +01:00
Andreas Kling	575c7ed414	Kernel: Make sys$msyscall() EFAULT on non-user address Fixes #5361.	2021-02-16 11:32:00 +01:00
Ben Wiederhake	fbb85f9b2f	Kernel: Refuse excessively long iovec list, also in readv This bug is a good example why copy-paste code should eventually be eliminated from the code base: Apparently the code was copied from read.cpp before `c6027ed7cc`, so the same bug got introduced here. To recap: A malicious program can ask the Kernel to prepare sys-ing to a huge amount of iovecs. The Kernel must first copy all the vector locations into 'vecs', and before that allocates an arbitrary amount of memory: vecs.resize(iov_count); This can cause Kernel memory exhaustion, triggered by any malicious userland program.	2021-02-15 22:09:01 +01:00
AnotherTest	4519950266	Kernel+LibC: Add the _SC_GETPW_R_SIZE_MAX sysconf enum It just returns 4096 :P	2021-02-15 17:32:56 +01:00
AnotherTest	a3a7ab83c4	Kernel+LibC: Implement readv We already had writev, so let's just add readv too.	2021-02-15 17:32:56 +01:00
Andreas Kling	68e3616971	Kernel: Forked children should inherit the signal trampoline address Fixes #5347.	2021-02-14 18:38:46 +01:00
Andreas Kling	6ee499aeb0	Kernel: Round old address/size in sys$mremap() to page size multiples Found by fuzz-syscalls. :^)	2021-02-14 13:15:05 +01:00
Andreas Kling	e47bffdc8c	Kernel: Add some bits of randomness to the userspace stack pointer This patch adds a random offset between 0 and 4096 to the initial stack pointer in new processes. Since the stack has to be 16-byte aligned, the bottom bits can't be randomized. Yet another thing to make things less predictable. :^)	2021-02-14 11:53:49 +01:00
Andreas Kling	4188373020	Kernel: Fix TOCTOU in syscall entry region validation We were doing stack and syscall-origin region validations before taking the big process lock. There was a window of time where those regions could then be unmapped/remapped by another thread before we proceed with our syscall. This patch closes that window, and makes sys$get_stack_bounds() rely on the fact that we now know the userspace stack pointer to be valid. Thanks to @BenWiederhake for spotting this! :^)	2021-02-14 11:47:14 +01:00
Ben Wiederhake	c0692f1f95	Kernel: Avoid magic number in sys$poll	2021-02-14 10:57:33 +01:00
Andreas Kling	cc341c95aa	Kernel: Panic on sys$get_stack_bounds() in stack-less process	2021-02-14 10:51:18 +01:00
Andreas Kling	781d29a337	Kernel+Userland: Give sys$recvfd() an options argument for O_CLOEXEC @bugaevc pointed out that we shouldn't be setting this flag in userspace, and he's right of course.	2021-02-14 10:39:48 +01:00
Andreas Kling	09b1b09c19	Kernel: Assert if rounding-up-to-page-size would wrap around to 0 If we try to align a number above 0xfffff000 to the next multiple of the page size (4 KiB), it would wrap around to 0. This is most likely never what we want, so let's assert if that happens.	2021-02-14 10:01:50 +01:00
Andreas Kling	1593219a41	Kernel: Map signal trampoline into each process's address space The signal trampoline was previously in kernelspace memory, but with a special exception to make it user-accessible. This patch moves it into each process's regular address space so we can stop supporting user-allowed memory above 0xc0000000.	2021-02-14 01:33:17 +01:00
Andreas Kling	ffdfbf1dba	Kernel: Fix wrong sizeof() type in sys$execve() argument overflow check	2021-02-14 00:15:01 +01:00
Andreas Kling	c877612211	Kernel: Round down base of partial ranges provided to munmap/mprotect We were failing to round down the base of partial VM ranges. This led to split regions being constructed that could have a non-page-aligned base address. This would then trip assertions in the VM code. Found by fuzz-syscalls. :^)	2021-02-13 01:49:44 +01:00
Andreas Kling	62f0f73bf0	Kernel: Limit the number of file descriptors sys$poll() can handle Just slap an arbitrary limit on there so we don't panic if somebody asks us to poll 1 fajillion fds. Found by fuzz-syscalls. :^)	2021-02-13 01:18:03 +01:00
Andreas Kling	7551090056	Kernel: Round up ranges to page size multiples in munmap and mprotect This prevents passing bad inputs to RangeAllocator who then asserts. Found by fuzz-syscalls. :^)	2021-02-13 01:18:03 +01:00
Ben Wiederhake	546cdde776	Kernel: clock_nanosleep's 'flags' is not a bitset This had the interesting effect that most, but not all, non-zero values were interpreted as an absolute value.	2021-02-13 00:40:31 +01:00
Ben Wiederhake	e1db8094b6	Kernel: Avoid casting arbitrary user-controlled int to enum This caused a load-invalid-value warning by KUBSan. Found by fuzz-syscalls. Can be reproduced by running this in the Shell: $ syscall waitid [ 1234 ]	2021-02-13 00:40:31 +01:00
Ben Wiederhake	c6027ed7cc	Kernel: Refuse excessively long iovec list If a program attempts to write from more than a million different locations, there is likely shenaniganery afoot! Refuse to write to prevent kmem exhaustion. Found by fuzz-syscalls. Can be reproduced by running this in the Shell: $ syscall writev 1 [ 0 ] 0x08000000	2021-02-13 00:40:31 +01:00
Ben Wiederhake	987b7f7917	Kernel: Forbid empty and whitespace-only process names Those only exist to confuse the user anyway. Found while using fuzz-syscalls.	2021-02-13 00:40:31 +01:00
Ben Wiederhake	1e630fb78a	Kernel: Avoid creating unkillable processes Found by fuzz-syscalls. Can be reproduced by running this in the Shell: $ syscall exit_thread This leaves the process in the 'Dying' state but never actually removes it. Therefore, avoid this scenario by pretending to exit the entire process.	2021-02-13 00:40:31 +01:00
Andreas Kling	1ef43ec89a	Kernel: Move get_interpreter_load_offset() out of Process class This is only used inside the sys$execve() implementation so just make it a execve.cpp local function.	2021-02-12 16:30:29 +01:00
Andreas Kling	1f277f0bd9	Kernel: Convert all *Builder::appendf() => appendff()	2021-02-09 19:18:13 +01:00
Andreas Kling	4ff0f971f7	Kernel: Prevent execve/ptrace race Add a per-process ptrace lock and use it to prevent ptrace access to a process after it decides to commit to a new executable in sys$execve(). Fixes #5230.	2021-02-08 23:05:41 +01:00
Andreas Kling	4b7b92c201	Kernel: Remove two unused fields from sys$execve's LoadResult	2021-02-08 22:31:03 +01:00
Andreas Kling	0d7af498d7	Kernel: Move ShouldAllocateTls enum from Process to execve.cpp	2021-02-08 22:24:37 +01:00
Andreas Kling	b1c9f93fa3	Kernel: Skip generic region lookup in sys$futex and sys$get_stack_bounds Just ask the process space directly instead of using the generic region lookup that also checks for kernel regions.	2021-02-08 22:23:29 +01:00
Andreas Kling	f39c2b653e	Kernel: Reorganize ptrace implementation a bit The generic parts of ptrace now live in Kernel/Syscalls/ptrace.cpp and the i386 specific parts are moved to Arch/i386/CPU.cpp	2021-02-08 19:34:41 +01:00
Andreas Kling	45231051e6	Kernel: Set the dumpable flag before switching spaces in sys$execve()	2021-02-08 19:15:42 +01:00
Andreas Kling	d746639171	Kernel: Remove outdated code to dump memory layout after exec load	2021-02-08 19:07:29 +01:00
Andreas Kling	f1b5def8fd	Kernel: Factor address space management out of the Process class This patch adds Space, a class representing a process's address space. - Each Process has a Space. - The Space owns the PageDirectory and all Regions in the Process. This allows us to reorganize sys$execve() so that it constructs and populates a new Space fully before committing to it. Previously, we would construct the new address space while still running in the old one, and encountering an error meant we had to do tedious and error-prone rollback. Those problems are now gone, replaced by what's hopefully a set of much smaller problems and missing cleanups. :^)	2021-02-08 18:27:28 +01:00
AnotherTest	09a43969ba	Everywhere: Replace dbgln<flag>(...) with dbgln_if(flag, ...) Replacement made by `find Kernel Userland -name '.h' -o -name '.cpp' \| sed -i -Ee 's/dbgln\b<(\w+)>\(/dbgln_if(\1, /g'`	2021-02-08 18:08:55 +01:00
Andreas Kling	b466ede1ea	Kernel: Make sure we can allocate kernel stack before creating thread Wrap thread creation in a Thread::try_create() helper that first allocates a kernel stack region. If that allocation fails, we propagate an ENOMEM error to the caller. This avoids the situation where a thread is half-constructed, without a valid kernel stack, and avoids having to do messy cleanup in that case.	2021-02-07 19:27:00 +01:00
Andreas Kling	d4dd4a82bb	Kernel: Don't allow sys$msyscall() on non-mmap regions	2021-02-02 20:16:13 +01:00
Andreas Kling	823186031d	Kernel: Add a way to specify which memory regions can make syscalls This patch adds sys$msyscall() which is loosely based on an OpenBSD mechanism for preventing syscalls from non-blessed memory regions. It works similarly to pledge and unveil, you can call it as many times as you like, and when you're finished, you call it with a null pointer and it will stop accepting new regions from then on. If a syscall later happens and doesn't originate from one of the previously blessed regions, the kernel will simply crash the process.	2021-02-02 20:13:44 +01:00
Ben Wiederhake	cbee0c26e1	Kernel+keymap+KeyboardMapper: New pledge for getkeymap	2021-02-01 09:54:32 +01:00
Ben Wiederhake	a2c21a55e1	Kernel+LibKeyboard: Enable querying the current keymap	2021-02-01 09:54:32 +01:00
Andreas Kling	6e4e3a7612	Kernel: Remove pledge exception for sys$getsockopt() with SO_PEERCRED We had an exception that allowed SOL_SOCKET + SO_PEERCRED on local socket to support LibIPC's PID exchange mechanism. This is no longer needed so let's just remove the exception.	2021-01-31 09:29:27 +01:00
Andreas Kling	4d777a9bf4	Kernel: Allow changing thread names with the "stdio" promise It's useful for programs to change their thread names to say something interesting about what they are working on. Let's not require "thread" for this since single-threaded programs may want to do it without pledging "thread".	2021-01-30 23:38:57 +01:00
Andreas Kling	90343eeaeb	Revert "Kernel: Return -ENOTDIR for non-directory mount target" This reverts commit `b7b09470ca`. Mounting a file on top of a file is a valid thing we support.	2021-01-30 13:52:12 +01:00
Andreas Kling	123c37e1c0	Kernel: Fix mix-up between MAP_STACK/MAP_ANONYMOUS in prot validation	2021-01-30 10:30:17 +01:00
Andreas Kling	e55ef70e5e	Kernel: Remove "has made executable exception for dynamic loader" flag As Idan pointed out, this flag is actually not needed, since we don't allow transitioning from previously-executable to writable anyway.	2021-01-30 10:06:52 +01:00
Andreas Kling	d0c5979d96	Kernel: Add "prot_exec" pledge promise and require it for PROT_EXEC This prevents sys$mmap() and sys$mprotect() from creating executable memory mappings in pledged programs that don't have this promise. Note that the dynamic loader runs before pledging happens, so it's unaffected by this.	2021-01-29 18:56:34 +01:00
Andreas Kling	51df44534b	Kernel: Disallow mapping anonymous memory as executable This adds another layer of defense against introducing new code into a running process. The only permitted way of doing so is by mmapping an open file with PROT_READ \| PROT_EXEC. This does make any future JIT implementations slightly more complicated but I think it's a worthwhile trade-off at this point. :^)	2021-01-29 14:52:34 +01:00
Andreas Kling	af3d3c5c4a	Kernel: Enforce W^X more strictly (like PaX MPROTECT) This patch adds enforcement of two new rules: - Memory that was previously writable cannot become executable - Memory that was previously executable cannot become writable Unfortunately we have to make an exception for text relocations in the dynamic loader. Since those necessitate writing into a private copy of library code, we allow programs to transition from RW to RX under very specific conditions. See the implementation of sys$mprotect()'s should_make_executable_exception_for_dynamic_loader() for details.	2021-01-29 14:52:27 +01:00
Linus Groh	dbbc378fb2	Kernel: Return -ENOTBLK for non-block device Ext2FS mount source When mounting an Ext2FS, a block device source is required. All other filesystem types are unaffected, as most of them ignore the source file descriptor anyway. Fixes #5153.	2021-01-29 08:45:56 +01:00
Linus Groh	b7b09470ca	Kernel: Return -ENOTDIR for non-directory mount target The absence of this check allowed silly things like this: # touch file # mount /dev/hda file	2021-01-29 08:45:56 +01:00
Sahan Fernando	6876b9a514	Kernel: Prevent mmap-ing as both fixed and randomized	2021-01-29 07:45:00 +01:00
Jorropo	22b0ff05d4	Kernel: sys$mmap PAGE_ROUND_UP size before calling allocate_randomized (#5154 ) `allocate_randomized` assert an already sanitized size but `mmap` were just forwarding whatever the process asked so it was possible to trigger a kernel panic from an unpriviliged process just by asking some randomly placed memory and a size non alligned with the page size. This fixes this issue by rounding up to the next page size before calling `allocate_randomized`. Fixes #5149	2021-01-28 22:36:20 +01:00
Andreas Kling	b6937e2560	Kernel+LibC: Add MAP_RANDOMIZED flag for sys$mmap() This can be used to request random VM placement instead of the highly predictable regular mmap(nullptr, ...) VM allocation strategy. It will soon be used to implement ASLR in the dynamic loader. :^)	2021-01-28 16:23:38 +01:00
Andreas Kling	e67402c702	Kernel: Remove Range "valid" state and use Optional<Range> instead It's easier to understand VM ranges if they are always valid. We can simply use an empty Optional<Range> to encode absence when needed.	2021-01-27 21:14:42 +01:00
Andreas Kling	5ab27e4bdc	Kernel: sys$mmap() without MAP_FIXED should consider address a hint If we can't use that specific address, it's still okay to put it anywhere else in VM.	2021-01-27 21:14:42 +01:00
asynts	7cf0c7cc0d	Meta: Split debug defines into multiple headers. The following script was used to make these changes: #!/bin/bash set -e tmp=$(mktemp -d) echo "tmp=$tmp" find Kernel $ -name '.cpp' -o -name '.h' $ \| sort > $tmp/Kernel.files find . $ -path ./Toolchain -prune -o -path ./Build -prune -o -path ./Kernel -prune $ -o $ -name '.cpp' -o -name '.h' $ -print \| sort > $tmp/EverythingExceptKernel.files cat $tmp/Kernel.files \| xargs grep -Eho '[A-Z0-9_]+_DEBUG' \| sort \| uniq > $tmp/Kernel.macros cat $tmp/EverythingExceptKernel.files \| xargs grep -Eho '[A-Z0-9_]+_DEBUG' \| sort \| uniq > $tmp/EverythingExceptKernel.macros comm -23 $tmp/Kernel.macros $tmp/EverythingExceptKernel.macros > $tmp/Kernel.unique comm -1 $tmp/Kernel.macros $tmp/EverythingExceptKernel.macros > $tmp/EverythingExceptKernel.unique cat $tmp/Kernel.unique \| awk '{ print "#cmakedefine01 "$1 }' > $tmp/Kernel.header cat $tmp/EverythingExceptKernel.unique \| awk '{ print "#cmakedefine01 "$1 }' > $tmp/EverythingExceptKernel.header for macro in $(cat $tmp/Kernel.unique) do cat $tmp/Kernel.files \| xargs grep -l $macro >> $tmp/Kernel.new-includes \|\|: done cat $tmp/Kernel.new-includes \| sort > $tmp/Kernel.new-includes.sorted for macro in $(cat $tmp/EverythingExceptKernel.unique) do cat $tmp/Kernel.files \| xargs grep -l $macro >> $tmp/Kernel.old-includes \|\|: done cat $tmp/Kernel.old-includes \| sort > $tmp/Kernel.old-includes.sorted comm -23 $tmp/Kernel.new-includes.sorted $tmp/Kernel.old-includes.sorted > $tmp/Kernel.includes.new comm -13 $tmp/Kernel.new-includes.sorted $tmp/Kernel.old-includes.sorted > $tmp/Kernel.includes.old comm -12 $tmp/Kernel.new-includes.sorted $tmp/Kernel.old-includes.sorted > $tmp/Kernel.includes.mixed for file in $(cat $tmp/Kernel.includes.new) do sed -i -E 's/#include <AK\/Debug\.h>/#include <Kernel\/Debug\.h>/' $file done for file in $(cat $tmp/Kernel.includes.mixed) do echo "mixed include in $file, requires manual editing." done	2021-01-26 21:20:00 +01:00
Linus Groh	e7183cc762	Kernel: Don't drop pledge()'d promises/execpromises when passing nullptr When passing nullptr for either promises or execpromises to pledge(), the expected behaviour is to not change their current value at all - we were accidentally resetting them to 0, effectively dropping previously pledge()'d promises.	2021-01-26 18:18:01 +01:00
Andreas Kling	c7858622ec	Kernel: Update process promise states on execve() and fork() We now move the execpromises state into the regular promises, and clear the execpromises state. Also make sure to duplicate the promise state on fork. This fixes an issue where "su" would launch a shell which immediately crashed due to not having pledged "stdio".	2021-01-26 15:26:37 +01:00
Andreas Kling	1e25d2b734	Kernel: Remove allocate_region() functions that don't take a Range Let's force callers to provide a VM range when allocating a region. This makes ENOMEM error handling more visible and removes implicit VM allocation which felt a bit magical.	2021-01-26 14:13:57 +01:00
Linus Groh	629180b7d8	Kernel: Support pledge() with empty promises This tells the kernel that the process wants to use pledge, but without pledging anything - effectively restricting it to syscalls that don't require a certain promise. This is part of OpenBSD's pledge() as well, which served as basis for Serenity's.	2021-01-25 23:22:21 +01:00
Andreas Kling	ab14b0ac64	Kernel: Hoist VM range allocation up to sys$mmap() itself Instead of letting each File subclass do range allocation in their mmap() override, do it up front in sys$mmap(). This makes us honor alignment requests for file-backed memory mappings and simplifies the code somwhat.	2021-01-25 18:57:06 +01:00
asynts	eea72b9b5c	Everywhere: Hook up remaining debug macros to Debug.h.	2021-01-25 09:47:36 +01:00
asynts	8465683dcf	Everywhere: Debug macros instead of constexpr. This was done with the following script: find . $ -name '.cpp' -o -name '.h' -o -name '.in' $ -not -path './Toolchain/' -not -path './Build/' -exec sed -i -E 's/dbgln<debug_([a-z_]+)>/dbgln<\U\1_DEBUG>/' {} \; find . $ -name '.cpp' -o -name '.h' -o -name '.in' $ -not -path './Toolchain/' -not -path './Build/' -exec sed -i -E 's/if constexpr \(debug_([a-z0-9_]+)/if constexpr \(\U\1_DEBUG/' {} \;	2021-01-25 09:47:36 +01:00
asynts	acdcf59a33	Everywhere: Remove unnecessary debug comments. It would be tempting to uncomment these statements, but that won't work with the new changes. This was done with the following commands: find . $ -name '.cpp' -o -name '.h' -o -name '.in' $ -not -path './Toolchain/' -not -path './Build/' -exec awk -i inplace '$0 !~ /\/\/#define/ { if (!toggle) { print; } else { toggle = !toggle } } ; $0 ~/\/\/#define/ { toggle = 1 }' {} \; find . $ -name '.cpp' -o -name '.h' -o -name '.in' $ -not -path './Toolchain/' -not -path './Build/' -exec awk -i inplace '$0 !~ /\/\/ #define/ { if (!toggle) { print; } else { toggle = !toggle } } ; $0 ~/\/\/ #define/ { toggle = 1 }' {} \;	2021-01-25 09:47:36 +01:00
asynts	1a3a0836c0	Everywhere: Use CMake to generate AK/Debug.h. This was done with the help of several scripts, I dump them here to easily find them later: awk '/#ifdef/ { print "#cmakedefine01 "$2 }' AK/Debug.h.in for debug_macro in $(awk '/#ifdef/ { print $2 }' AK/Debug.h.in) do find . $ -name '.cpp' -o -name '.h' -o -name '.in' $ -not -path './Toolchain/' -not -path './Build/*' -exec sed -i -E 's/#ifdef '$debug_macro'/#if '$debug_macro'/' {} \; done # Remember to remove WRAPPER_GERNERATOR_DEBUG from the list. awk '/#cmake/ { print "set("$2" ON)" }' AK/Debug.h.in	2021-01-25 09:47:36 +01:00
Andreas Kling	f5d916a881	Kernel: Make sys$anon_create() fail if size == 0 An empty anonymous file is useless since it cannot be resized anyway, so let's not support creating it.	2021-01-25 09:36:42 +01:00
Luke	50a2cb38e5	Kernel: Fix two error codes being returned as positive in Process::exec This made the assertion on line 921 think it was a successful exec, when it wasn't. Fixes #5084	2021-01-24 01:06:24 +01:00
asynts	1c1e577a5e	Everywhere: Deprecate dbg().	2021-01-23 16:46:26 +01:00
Andreas Kling	c32176db27	Kernel: Don't preserve set-uid bit in open() and bind() modes For some reason we were keeping the bits 04777 in file modes. That doesn't seem right and I can't think of a reason why the set-uid bit should be allowed to slip through.	2021-01-23 16:45:05 +01:00
Andreas Kling	54f421e170	Kernel: Clear coredump metadata on exec() If for some reason a process wants to exec after saving some coredump metadata, we should just throw away the data.	2021-01-23 09:41:11 +01:00
asynts	7b0a1a98d9	Everywhere: Replace a bundle of dbg with dbgln. These changes are arbitrarily divided into multiple commits to make it easier to find potentially introduced bugs with git bisect.	2021-01-22 22:14:30 +01:00
asynts	a348ab55b0	Everywhere: Replace a bundle of dbg with dbgln. These changes are arbitrarily divided into multiple commits to make it easier to find potentially introduced bugs with git bisect.	2021-01-22 22:14:30 +01:00
Andreas Kling	2cd07c6212	Kernel+Userland: Remove "dns" pledge promise alias This was just an alias for "unix" that I added early on back when there was some belief that we might be compatible with OpenBSD. We're clearly never going to be compatible with their pledges so just drop the alias.	2021-01-22 19:39:44 +01:00
Andreas Kling	19d3f8cab7	Kernel+LibC: Turn errno codes into a strongly typed enum ..and allow implicit creation of KResult and KResultOr from ErrnoCode. This means that kernel functions that return those types can finally do "return EINVAL;" and it will just work. There's a handful of functions that still deal with signed integers that should be converted to return KResults.	2021-01-20 23:20:02 +01:00
Linus Groh	2cc3d68615	Kernel+LibC: Add _SC_TTY_NAME_MAX	2021-01-18 22:28:56 +01:00
Tom	1d621ab172	Kernel: Some futex improvements This adds support for FUTEX_WAKE_OP, FUTEX_WAIT_BITSET, FUTEX_WAKE_BITSET, FUTEX_REQUEUE, and FUTEX_CMP_REQUEUE, as well well as global and private futex and absolute/relative timeouts against the appropriate clock. This also changes the implementation so that kernel resources are only used when a thread is blocked on a futex. Global futexes are implemented as offsets in VMObjects, so that different processes can share a futex against the same VMObject despite potentially being mapped at different virtual addresses.	2021-01-17 20:30:31 +01:00
Andreas Kling	992f513ad2	Kernel: Limit exec arguments and environment to 1/8th of stack each This sort-of matches what some other systems do and seems like a generally sane thing to do instead of allowing programs to spawn a child with a nearly full stack.	2021-01-17 18:29:56 +01:00
Andreas Kling	1730c23775	Kernel: Remove a bunch of no-longer-necessary SmapDisablers We forgot to remove the automatic SMAP disablers after fixing up all this code to not access userspace memory directly. Let's lock things down at last. :^)	2021-01-17 15:03:07 +01:00
Andreas Kling	bf0719092f	Kernel+Userland: Remove shared buffers (shbufs) All users of this mechanism have been switched to anonymous files and passing file descriptors with sendfd()/recvfd(). Shbufs got us where we are today, but it's time we say good-bye to them and welcome a much more idiomatic replacement. :^)	2021-01-17 09:07:32 +01:00
Andreas Kling	05dbfe9ab6	Kernel: Remove sys$shbuf_seal() and userland wrappers There are no remaining users of this syscall so let it go. :^)	2021-01-17 00:18:01 +01:00
Andreas Kling	b818cf898e	Kernel+Userland: Remove sys$shbuf_allow_all() and userland wrappers Nobody is using globally shared shbufs anymore, so let's remove them.	2021-01-16 22:43:03 +01:00
Ben Wiederhake	ea5825f2c9	Kernel+LibC: Make sys$getcwd truncate the result silently This gives us the superpower of knowing the ideal buffer length if it fails. See also https://github.com/SerenityOS/serenity/discussions/4357	2021-01-16 22:40:53 +01:00
Ben Wiederhake	68416d7293	Kernel: Make realpath return silently truncated data For context, see https://github.com/SerenityOS/serenity/discussions/4357	2021-01-16 22:40:53 +01:00
Brendan Coles	1fa9d9dd68	Kernel: execve: find_elf_interpreter_for_executable: Fix dbgln	2021-01-16 22:36:46 +01:00
Andreas Kling	01c2480eb3	Kernel+LibC+WindowServer: Remove unused thread/process boost mechanism The priority boosting mechanism has been broken for a very long time. Let's remove it from the codebase and we can bring it back the day someone feels like implementing it in a working way. :^)	2021-01-16 14:52:04 +01:00
Andreas Kling	43109f9614	Kernel: Remove unused syscall sys$minherit() This is no longer used. We can bring it back the day we need it.	2021-01-16 14:52:04 +01:00
Andreas Kling	de31e82f97	Kernel: Remove sys$shbuf_set_volatile() and userland wrappers There are no remaining users of this syscall so let's remove it! :^)	2021-01-16 14:52:04 +01:00
Linus Groh	1ccc2e6482	Kernel: Store process arguments and environment in coredumps Currently they're only pushed onto the stack but not easily accessible from the Process class, so this adds a Vector<String> for both.	2021-01-15 23:26:47 +01:00
Andreas Kling	64b0d89335	Kernel: Make Process::allocate_region() return KResultOr<Region> This allows region allocation to return specific errors and we don't have to assume every failure is an ENOMEM.	2021-01-15 19:10:30 +01:00
Andreas Kling	7899e14e72	Kernel: Make sys$anon_create() require the "stdio" promise if pledged	2021-01-15 19:10:30 +01:00
Andreas Kling	a525d0271c	Kernel: Fix bogus negation of alloc_fd() error in sys$anon_create() Thanks to Idan for spotting this!	2021-01-15 15:13:48 +01:00
Andreas Kling	fb4993f067	Kernel: Add anonymous files, created with sys$anon_create() This patch adds a new AnonymousFile class which is a File backed by an AnonymousVMObject that can only be mmap'ed and nothing else, really. I'm hoping that this can become a replacement for shbufs. :^)	2021-01-15 13:56:47 +01:00
Andreas Kling	4fa8435310	Kernel: Use current process EUID in doing profiling access control	2021-01-12 23:34:01 +01:00
Lenny Maiorani	e6f907a155	AK: Simplify constructors and conversions from nullptr_t Problem: - Many constructors are defined as `{}` rather than using the ` = default` compiler-provided constructor. - Some types provide an implicit conversion operator from `nullptr_t` instead of requiring the caller to default construct. This violates the C++ Core Guidelines suggestion to declare single-argument constructors explicit (https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#c46-by-default-declare-single-argument-constructors-explicit). Solution: - Change default constructors to use the compiler-provided default constructor. - Remove implicit conversion operators from `nullptr_t` and change usage to enforce type consistency without conversion.	2021-01-12 09:11:45 +01:00
Andreas Kling	f03800cee3	Kernel: Add dedicated "ptrace" pledge promise The vast majority of programs don't ever need to use sys$ptrace(), and it seems like a high-value system call to prevent a compromised process from using. This patch moves sys$ptrace() from the "proc" promise to its own, new "ptrace" promise and updates the affected apps.	2021-01-11 22:32:59 +01:00
asynts	723effd051	Everywhere: Replace a bundle of dbg with dbgln. These changes are arbitrarily divided into multiple commits to make it easier to find potentially introduced bugs with git bisect.Everything:	2021-01-11 11:55:47 +01:00
asynts	5931758dbc	Everywhere: Replace a bundle of dbg with dbgln. These changes are arbitrarily divided into multiple commits to make it easier to find potentially introduced bugs with git bisect.Everything:	2021-01-11 11:55:47 +01:00
Andreas Kling	5dafb72370	Kernel+Profiler: Make profiling per-process and without core dumps This patch merges the profiling functionality in the kernel with the performance events mechanism. A profiler sample is now just another perf event, rather than a dedicated thing. Since perf events were already per-process, this now makes profiling per-process as well. Processes with perf events would already write out a perfcore.PID file to the current directory on death, but since we may want to profile a process and then let it continue running, recorded perf events can now be accessed at any time via /proc/PID/perf_events. This patch also adds information about process memory regions to the perfcore JSON format. This removes the need to supply a core dump to the Profiler app for symbolication, and so the "profiler coredump" mechanism is removed entirely. There's still a hard limit of 4MB worth of perf events per process, so this is by no means a perfect final design, but it's a nice step forward for both simplicity and stability. Fixes #4848 Fixes #4849	2021-01-11 11:36:00 +01:00
Itamar	f259d96871	Kernel: Avoid collision between dynamic loader and main program When loading non position-independent programs, we now take care not to load the dynamic loader at an address that collides with the location the main program wants to load at. Fixes #4847.	2021-01-10 22:04:43 +01:00
Itamar	40a8159c62	Kernel: Plumb the elf header of the main program down to Process::load This will enable us to take the desired load address of non-position independent programs into account when randomizing the load address of the dynamic loader.	2021-01-10 22:04:43 +01:00
asynts	938e5c7719	Everywhere: Replace a bundle of dbg with dbgln. These changes are arbitrarily divided into multiple commits to make it easier to find potentially introduced bugs with git bisect.Everything: The modifications in this commit were automatically made using the following command: find . -name '.cpp' -exec sed -i -E 's/dbg << ("[^"{]");/dbgln$\1$;/' {} \;	2021-01-09 21:11:09 +01:00
Andreas Kling	8ff0afd829	Kernel: Defer switching the paging scope in ptrace(PT_POKE) a little If we can fail with EFAULT early, might as well avoid switching the paging scope.	2021-01-09 15:42:03 +01:00
Davide Carella	ca9e0a70f5	Syscall: Changed 'setkeymap' to take also the Shift+AltGr map.	2021-01-06 09:32:08 +01:00
Andreas Kling	d991658794	Kernel+LibC: Tidy up assertion failures with a dedicated syscall This patch adds sys$abort() which immediately crashes the process with SIGABRT. This makes assertion backtraces a lot nicer by removing all the gunk that otherwise happens between __assertion_failed() and actually crashing from the SIGABRT.	2021-01-04 21:57:30 +01:00
Tom	f98ca35b83	Kernel: Improve ProcFS behavior in low memory conditions When ProcFS could no longer allocate KBuffer objects to serve calls to read, it would just return 0, indicating EOF. This then triggered parsing errors because code assumed it read the file. Because read isn't supposed to return ENOMEM, change ProcFS to populate the file data upon file open or seek to the beginning. This also means that calls to open can now return ENOMEM if needed. This allows the caller to either be able to successfully open the file and read it, or fail to open it in the first place.	2021-01-03 22:12:19 +01:00
William Marlow	747e8de96a	Kernel+Loader.so: Allow dynamic executables without an interpreter Commit `a3a9016701` removed the PT_INTERP header from Loader.so which cleaned up some kernel code in execve. Unfortunately it prevents Loader.so from being run as an executable	2021-01-03 19:45:16 +01:00
Tom	e3190bd144	Revert "Kernel: Allocate shared memory regions immediately" This reverts commit `fe6b3f99d1`.	2021-01-02 20:56:35 +01:00
Andreas Kling	fe6b3f99d1	Kernel: Allocate shared memory regions immediately Lazily committed shared memory was not working in situations where one process would write to the memory and another would only read from it. Since the reading process would never cause a write fault in the shared region, we'd never notice that the writing process had added real physical pages to the VMObject. This happened because the lazily committed pages were marked "present" in the page table. This patch solves the issue by always allocating shared memory up front and not trying to be clever about it.	2021-01-02 16:57:31 +01:00
Andreas Kling	5dae85afe7	Kernel: Pass "shared" flag to Region constructor Before this change, we would sometimes map a region into the address space with !is_shared(), and then moments later call set_shared(true). I found this very confusing while debugging, so this patch makes us pass the initial shared flag to the Region constructor, ensuring that it's in the correct state by the time we first map the region.	2021-01-02 16:57:31 +01:00
Andreas Kling	9ec9d20e84	Kernel: Fix bad VMObject iteration in sys$purge() We were fooling ourselves into thinking all VMObjects are anonymous and then tried to call purge() on them as if they were.	2021-01-02 13:34:29 +01:00
Tom	e87eaf5df0	Kernel: Fix memory corruption when rolling back regions in execve We need to free the regions before reverting the paging scope to the original one when rolling back changes due to an error. This fixes silent memory corruption.	2021-01-01 23:43:44 +01:00
Tom	2f429bd2d5	Kernel: Pass new region owner to Region::clone	2021-01-01 23:43:44 +01:00
Tom	bf9be3ec01	Kernel: More gracefully handle out-of-memory when creating PageDirectory	2021-01-01 23:43:44 +01:00
Tom	476f17b3f1	Kernel: Merge PurgeableVMObject into AnonymousVMObject This implements memory commitments and lazy-allocation of committed memory.	2021-01-01 23:43:44 +01:00
Tom	b2a52f6208	Kernel: Implement lazy committed page allocation By designating a committed page pool we can guarantee to have physical pages available for lazy allocation in mappings. However, when forking we will overcommit. The assumption is that worst-case it's better for the fork to die due to insufficient physical memory on COW access than the parent that created the region. If a fork wants to ensure that all memory is available (trigger a commit) then it can use madvise. This also means that fork now can gracefully fail if we don't have enough physical pages available.	2021-01-01 23:43:44 +01:00
Tom	e21cc4cff6	Kernel: Remove MAP_PURGEABLE from mmap This brings mmap more in line with other operating systems. Prior to this, it was impossible to request memory that was definitely committed, instead MAP_PURGEABLE would provide a region that was not actually purgeable, but also not fully committed, which meant that using such memory still could cause crashes when the underlying pages could no longer be allocated. This fixes some random crashes in low-memory situations where non-volatile memory is mapped (e.g. malloc, tls, Gfx::Bitmap, etc) but when a page in these regions is first accessed, there is insufficient physical memory available to commit a new page.	2021-01-01 23:43:44 +01:00
Tom	c3451899bc	Kernel: Add MAP_NORESERVE support to mmap Rather than lazily committing regions by default, we now commit the entire region unless MAP_NORESERVE is specified. This solves random crashes in low-memory situations where e.g. the malloc heap allocated memory, but using pages that haven't been used before triggers a crash when no more physical memory is available. Use this flag to create large regions without actually committing the backing memory. madvise() can be used to commit arbitrary areas of such regions after creating them.	2021-01-01 23:43:44 +01:00
Tom	bc5d6992a4	Kernel: Memory purging improvements This adds the ability for a Region to define volatile/nonvolatile areas within mapped memory using madvise(). This also means that memory purging takes into account all views of the PurgeableVMObject and only purges memory that is not needed by all of them. When calling madvise() to change an area to nonvolatile memory, return whether memory from that area was purged. At that time also try to remap all memory that is requested to be nonvolatile, and if insufficient pages are available notify the caller of that fact.	2021-01-01 23:43:44 +01:00
Andreas Kling	7c3b6b10e4	Kernel: Remove the limited use of AK::TypeTraits we had in the kernel This was only used for VMObject and we can do without it there. This is preparation for migrating to dynamic_cast-based helpers in userspace.	2021-01-01 15:32:44 +01:00
Andrew Kaster	a3a9016701	DynamicLoader: Tell the linker to not add a PT_INTERP header Use the GNU LD option --no-dynamic-linker. This allows uncommenting some code in the Kernel that gets upset if your ELF interpreter has its own interpreter.	2021-01-01 02:12:28 +01:00
Linus Groh	91332515a6	Kernel: Add sys$set_coredump_metadata() syscall This can be used by applications to store information (key/value pairs) likely useful for debugging, which will then be embedded in the coredump.	2020-12-30 16:28:27 +01:00
Andreas Kling	af28a8ad11	Kernel: Hold InodeVMObject reference while inspecting it in sys$mmap()	2020-12-29 15:43:35 +01:00
Andreas Kling	30dbe9c78a	Kernel+LibC: Add a very limited sys$mremap() implementation This syscall can currently only remap a shared file-backed mapping into a private file-backed mapping.	2020-12-29 02:20:43 +01:00
Liav A	247517cd4a	Kernel: Introduce the DevFS The DevFS along with DevPtsFS give a complete solution for populating device nodes in /dev. The main purpose of DevFS is to eliminate the need of device nodes generation when building the system. Later on, DevFS will assist with exposing disk partition nodes.	2020-12-27 23:07:44 +01:00
Andreas Kling	0e2b7f9c9a	Kernel: Remove the per-process icon_id and sys$set_process_icon() This was a goofy kernel API where you could assign an icon_id (int) to a process which referred to a global shbuf with a 16x16 icon bitmap inside it. Instead of this, programs that want to display a process icon now retrieve it from the process executable instead.	2020-12-27 01:16:56 +01:00
AnotherTest	7b5aa06702	Kernel: Allow 'elevating' unveil permissions if implicitly inherited from '/' This can happen when an unveil follows another with a path that is a sub-path of the other one: ```c++ unveil("/home/anon/.config/whoa.ini", "rw"); unveil("/home/anon", "r"); // this would fail, as "/home/anon" inherits // the permissions of "/", which is None. ```	2020-12-26 16:10:04 +01:00
AnotherTest	a9184fcb76	Kernel: Implement unveil() as a prefix-tree Fixes #4530.	2020-12-26 11:54:54 +01:00
Andreas Kling	1cfdaf96c4	Kernel: Reset the process dumpable flag on successful non-setid exec Once we've committed to a new memory layout and non-setid credentials, we can reset the dumpable flag.	2020-12-26 01:31:24 +01:00
Andreas Kling	82f86e35d6	Kernel+LibC: Introduce a "dumpable" flag for processes This new flag controls two things: - Whether the kernel will generate core dumps for the process - Whether the EUID:EGID should own the process's files in /proc Processes are automatically made non-dumpable when their EUID or EGID is changed, either via syscalls that specifically modify those ID's, or via sys$execve(), when a set-uid or set-gid program is executed. A process can change its own dumpable flag at any time by calling the new sys$prctl(PR_SET_DUMPABLE) syscall. Fixes #4504.	2020-12-25 19:35:55 +01:00
Andreas Kling	ed5c26d698	AK: Remove custom %w format string specifier This was a non-standard specifier alias for %04x. This patch replaces all uses of it with new-style formatting functions instead.	2020-12-25 17:05:05 +01:00
Andreas Kling	89d3b09638	Kernel: Allocate new main thread stack before committing to exec If the allocation fails (e.g ENOMEM) we want to simply return an error from sys$execve() and continue executing the current executable. This patch also moves make_userspace_stack_for_main_thread() out of the Thread class since it had nothing in particular to do with Thread.	2020-12-25 16:22:01 +01:00
Andreas Kling	2f1712cc29	Kernel: Move ELF auxiliary vector building out of Process class Process had a couple of members whose only purpose was holding on to some temporary data while building the auxiliary vector. Remove those members and move the vector building to a free function in execve.cpp	2020-12-25 15:23:35 +01:00
Andreas Kling	40e9edd798	LibELF: Move AuxiliaryValue into the ELF namespace	2020-12-25 14:48:30 +01:00
Andreas Kling	6c9a6bea1e	Kernel+LibELF: Abort ELF executable load sooner when something fails Make it possible to bail out of ELF::Image::for_each_program_header() and then do exactly that if something goes wrong during executable loading in the kernel. Also make the errors we return slightly more nuanced than just ENOEXEC.	2020-12-25 14:42:42 +01:00
Andreas Kling	791b32e3c6	Kernel: Remove an unnecessary cast in sys$execve()	2020-12-25 14:16:35 +01:00
Andreas Kling	9c640e67ac	Kernel: Don't fetch full inode metadata in sys$execve() We only need the size, so let's not fetch all the metadata.	2020-12-25 14:15:33 +01:00
Andreas Kling	c3eddbcb49	Kernel: Add back missing ELF::Image validity check If the image is not a valid ELF we should just fail ASAP.	2020-12-25 14:13:44 +01:00
Andreas Kling	4986f268a5	Kernel: Convert dbg() => dbgln() in sys$execve()	2020-12-25 12:51:35 +01:00
Andreas Kling	09129782de	Kernel: Simplify ELF loading logic in sys$execve() somewhat Get rid of the lambda functions and put the logic inline in the program header traversal loop instead. This makes the code quite a bit shorter and hopefully makes it easier to see what's going on.	2020-12-25 02:33:57 +01:00
Andreas Kling	1e4c010643	LibELF: Remove ELF::Loader and move everyone to ELF::Image This commit gets rid of ELF::Loader entirely since its very ambiguous purpose was actually to load executables for the kernel, and that is now handled by the kernel itself. This patch includes some drive-by cleanup in LibDebug and CrashDaemon enabled by the fact that we no longer need to keep the ref-counted ELF::Loader around.	2020-12-25 02:14:56 +01:00
Andreas Kling	7551a66f73	Kernel+LibELF: Move sys$execve()'s loading logic from LibELF to Kernel It was really weird that ELF loading was performed by the ELF::Loader class instead of just being done by the kernel itself. This patch moves all the layout logic from ELF::Loader over to sys$execve(). The kernel no longer cares about ELF::Loader and instead only uses an ELF::Image as an interpreting wrapper around executables.	2020-12-25 01:22:55 +01:00
Itamar	0cb636078a	Kernel+LibELF: Allow Non ET_DYN executables to have an interpreter	2020-12-24 21:34:51 +01:00
Itamar	d64d0451e5	Kernel: Fix mmap with specific address for file backed mappings	2020-12-24 21:34:51 +01:00
Andreas Kling	1e21d49e86	Kernel: Fix wrong-looking overflow check in sys$execve() This was harmless since sizeof(length) and sizeof(strings) are both 4 on x86 but let's check the right things regardless.	2020-12-23 20:34:22 +01:00
Andreas Kling	6bfbc5f5f5	Kernel: Don't allow modifying IOPL via sys$ptrace() or sys$sigreturn() It was possible to overwrite the entire EFLAGS register since we didn't do any masking in the ptrace and sigreturn syscalls. This made it trivial to gain IO privileges by raising IOPL to 3 and then you could talk to hardware to do all kinds of nasty things. Thanks to @allesctf for finding these issues! :^) Their exploit/write-up: https://github.com/allesctf/writeups/blob/master/2020/hxpctf/wisdom2/writeup.md	2020-12-22 19:38:25 +01:00
Andreas Kling	2dfe5751f3	Kernel: Abort core dump generation if any substep fails And make an effort to propagate errors out from the inner parts. This fixes an issue where the kernel would infinitely loop in coredump generation if the TmpFS filled up.	2020-12-22 10:09:41 +01:00
Tom	5f51d85184	Kernel: Improve time keeping and dramatically reduce interrupt load This implements a number of changes related to time: * If a HPET is present, it is now used only as a system timer, unless the Local APIC timer is used (in which case the HPET timer will not trigger any interrupts at all). * If a HPET is present, the current time can now be as accurate as the chip can be, independently from the system timer. We now query the HPET main counter for the current time in CPU #0's system timer interrupt, and use that as a base line. If a high precision time is queried, that base line is used in combination with quering the HPET timer directly, which should give a much more accurate time stamp at the expense of more overhead. For faster time stamps, the more coarse value based on the last interrupt will be returned. This also means that any missed interrupts should not cause the time to drift. * The default system interrupt rate is reduced to about 250 per second. * Fix calculation of Thread CPU usage by using the amount of ticks they used rather than the number of times a context switch happened. * Implement CLOCK_REALTIME_COARSE and CLOCK_MONOTONIC_COARSE and use it for most cases where precise timestamps are not needed.	2020-12-21 18:26:12 +01:00
Lenny Maiorani	765936ebae	Everywhere: Switch from (void) to [[maybe_unused]] (#4473 ) Problem: - `(void)` simply casts the expression to void. This is understood to indicate that it is ignored, but this is really a compiler trick to get the compiler to not generate a warning. Solution: - Use the `[[maybe_unused]]` attribute to indicate the value is unused. Note: - Functions taking a `(void)` argument list have also been changed to `()` because this is not needed and shows up in the same grep command.	2020-12-21 00:09:48 +01:00
Andreas Kling	34e9df3c5e	Kernel: Randomize memory location of the dynamic loader :^) This should make it a little bit harder for those who would mess with our loader.	2020-12-20 18:49:24 +01:00
Andreas Kling	02ef3f6343	Kernel: Ptrace should not assert on poke in non-mapped tracee memory	2020-12-20 18:49:24 +01:00
Andreas Kling	9bf02c32c0	Kernel: Activate SUID/SGID credentials earlier in sys$execve() Switch on the new credentials before loading the new executable into memory. This ensures that attempts to ptrace() the program from an unprivileged process will fail. This covers one bug that was exploited in the 2020 HXP CTF: https://hxp.io/blog/79/hxp-CTF-2020-wisdom2/ Thanks to yyyyyyy for finding the bug! :^)	2020-12-20 18:49:18 +01:00
Andreas Kling	5505159a94	Kernel: Silence debug spam about select() being interrupted	2020-12-20 16:06:52 +01:00
Andreas Kling	e5eda151b4	Kernel: Silence debug spam when running dynamically linked programs	2020-12-20 16:06:39 +01:00
Andreas Kling	8e79bde2b7	Kernel: Move KBufferBuilder to the fallible KBuffer API KBufferBuilder::build() now returns an OwnPtr<KBuffer> and can fail. Clients of the API have been updated to handle that situation.	2020-12-18 19:22:26 +01:00
Tom	c4176b0da1	Kernel: Fix Lock race causing infinite spinning between two threads We need to account for how many shared lock instances the current thread owns, so that we can properly release such references when yielding execution. We also need to release the process lock when donating.	2020-12-16 23:38:17 +01:00
Andreas Kling	4befc2c282	Kernel: Avoid null dereference in sys$profiling_disable() If we can't create a profiling coredump object, we shouldn't try to call write() on it.	2020-12-15 11:25:51 +01:00
Andreas Kling	28c042e46f	Kernel: Make CoreDump::m_num_program_headers const This makes it an error to assign to it after construction.	2020-12-15 11:24:46 +01:00
Andreas Kling	ff8bf4db8d	Kernel: Don't take LexicalPath as argument LexicalPath is a big and heavy class that's really meant as a helper for extracting parts of a path, not for storage or passing around. Instead, pass paths around as strings and use LexicalPath locally as needed.	2020-12-15 11:17:01 +01:00
Itamar	1efbbf3ac3	Kernel: Don't generate a backtrace when a process exists with non-zero ..status	2020-12-14 23:05:53 +01:00
Itamar	5392f42731	Kernel: Generate coredumps for profiled processes These coredumps will be used by the Profile Viewer to symbolicate the profiling samples.	2020-12-14 23:05:53 +01:00
Itamar	39890af833	Kernel: Pass full path of output coredump file to CoreDump	2020-12-14 23:05:53 +01:00
Itamar	b4842d33bb	Kernel: Generate a coredump file when a process crashes When a process crashes, we generate a coredump file and write it in /tmp/coredumps/. The coredump file is an ELF file of type ET_CORE. It contains a segment for every userspace memory region of the process, and an additional PT_NOTE segment that contains the registers state for each thread, and a additional data about memory regions (e.g their name).	2020-12-14 23:05:53 +01:00
Itamar	efe4da57df	Loader: Stabilize loader & Use shared libraries everywhere :^) The dynamic loader is now stable enough to be used everywhere in the system - so this commit does just that. No More .a Files, Long Live .so's!	2020-12-14 23:05:53 +01:00
Itamar	9ca1a0731f	Kernel: Support TLS allocation from userspace This adds an allocate_tls syscall through which a userspace process can request the allocation of a TLS region with a given size. This will be used by the dynamic loader to allocate TLS for the main executable & its libraries.	2020-12-14 23:05:53 +01:00
Itamar	5b87904ab5	Kernel: Add ability to load interpreter instead of main program When the main executable needs an interpreter, we load the requested interpreter program, and pass to it an open file decsriptor to the main executable via the auxiliary vector. Note that we do not allocate a TLS region for the interpreter.	2020-12-14 23:05:53 +01:00
Tom	c455fc2030	Kernel: Change wait blocking to Process-only blocking This prevents zombies created by multi-threaded applications and brings our model back to closer to what other OSs do. This also means that SIGSTOP needs to halt all threads, and SIGCONT needs to resume those threads.	2020-12-12 21:28:12 +01:00
Tom	4bbee00650	Kernel: disown should unblock any potential waiters This is necessary because if a process changes the state to Stopped or resumes from that state, a wait entry is created in the parent process. So, if a child process does this before disown is called, we need to clear those entries to avoid leaking references/zombies that won't be cleaned up until the former parent exits. This also should solve an even more unlikely corner case where another thread is waiting on a pid that is being disowned by another thread.	2020-12-12 21:28:12 +01:00
Tom	da5cc34ebb	Kernel: Fix some issues related to fixes and block conditions Fix some problems with join blocks where the joining thread block condition was added twice, which lead to a crash when trying to unblock that condition a second time. Deferred block condition evaluation by File objects were also not properly keeping the File object alive, which lead to some random crashes and corruption problems. Other problems were caused by the fact that the Queued state didn't handle signals/interruptions consistently. To solve these issues we remove this state entirely, along with Thread::wait_on and change the WaitQueue into a BlockCondition instead. Also, deliver signals even if there isn't going to be a context switch to another thread. Fixes #4336 and #4330	2020-12-12 21:28:12 +01:00
Andreas Kling	97d789c75b	Kernel: Fix null dereference when execve'ing ELF without PT_TLS header Fixes #4387.	2020-12-11 22:59:46 +01:00
Tom	12cf6f8650	Kernel: Add CLOCK_REALTIME support to the TimerQueue This allows us to use blocking timeouts with either monotonic or real time for all blockers. Which means that clock_nanosleep() now also supports CLOCK_REALTIME. Also, switch alarm() to use CLOCK_REALTIME as per specification.	2020-12-02 13:02:04 +01:00
Tom	4c1e27ec65	Kernel: Use TimerQueue for SIGALRM	2020-12-02 13:02:04 +01:00
Andrew Kaster	3f808b0dda	LibELF+Kernel: Validate program headers in Image::parse This should catch more malformed ELF files earlier than simply checking the ELF header alone. Also change the API of validate_program_headers to take the interpreter_path by pointer. This makes it less awkward to call when we don't care about the interpreter, and just want the validation.	2020-12-01 09:58:21 +01:00
Tom	9e32d79e02	Kernel: Fix leaking a reference on thread creation New Thread objects should be adopted into a RefPtr upon creation. If creating a thread failed (e.g. out of memory), releasing the RefPtr will destruct the partially created object, but in the successful case the thread will add an additional reference that it keeps until it finishes execution. Adopting will drop it to 1 when returning from create_thread, or 0 if the thread could not be fully constructed.	2020-12-01 09:26:37 +01:00
Tom	046d6855f5	Kernel: Move block condition evaluation out of the Scheduler This makes the Scheduler a lot leaner by not having to evaluate block conditions every time it is invoked. Instead evaluate them as the states change, and unblock threads at that point. This also implements some more waitid/waitpid/wait features and behavior. For example, WUNTRACED and WNOWAIT are now supported. And wait will now not return EINTR when SIGCHLD is delivered at the same time.	2020-11-30 13:17:02 +01:00
Tom	6a620562cc	Kernel: Allow passing a thread argument for new kernel threads This adds the ability to pass a pointer to kernel thread/process. Also add the ability to use a closure as thread function, which allows passing information to a kernel thread more easily.	2020-11-30 13:17:02 +01:00
Tom	6cb640eeba	Kernel: Move some time related code from Scheduler into TimeManagement Use the TimerQueue to expire blocking operations, which is one less thing the Scheduler needs to check on every iteration. Also, add a BlockTimeout class that will automatically handle relative or absolute timeouts as well as overriding timeouts (e.g. socket timeouts) more consistently. Also, rework the TimerQueue class to be able to fire events from any processor, which requires Timer to be RefCounted. Also allow creating id-less timers for use by blocking operations.	2020-11-30 13:17:02 +01:00
Tom	68abd1cb29	Kernel: Fix SharedBuffer reference counting on fork We need to not only add a record for a reference, but we need to copy the reference count on fork as well, because the code in the fork assumes that it has the same amount of references, still. Also, once all references are dropped when a process is disowned, delete the shared buffer. Fixes #4076	2020-11-24 21:26:39 +01:00
Sergey Bugaev	098070b767	Kernel: Add unveil('b') This is a new "browse" permission that lets you open (and subsequently list contents of) directories underneath the path, but not regular files or any other types of files.	2020-11-23 18:37:40 +01:00
Andreas Kling	086522537e	Kernel: Don't leak ref on executable inode in sys$execve() We were leaking a ref on the executed inode in successful calls to sys$execve(). This meant that once a binary had ever been executed, it was impossible to remove it from the file system. The execve system call is particularly finicky since the function does not return normally on success, so extra care must be taken to ensure nothing is kept alive by stack variables. There is a big NOTE comment about this, and yet the bug still got in. It would be nice to enforce this, but I'm unsure how.	2020-11-23 16:08:42 +01:00
Tom	a89648e159	Kernel: Inherit shared buffers when forking We need to create a reference for the new PID for each shared buffer that the process had a reference to. If the process subsequently get replaced through exec, those references will be dropped again. But if exec for some reason fails then other code, such as global destructors could still expect having access to them. Fixes #4076	2020-11-23 09:39:32 +01:00
Andreas Kling	94ff04b536	Kernel: Make CLOCK_MONOTONIC respect the system tick frequency The time returned by sys$clock_gettime() was not aligned with the delay calculations in sys$clock_nanosleep(). This patch fixes that by taking the system's ticks_per_second value into account in both functions. This patch also removes the need for Thread::sleep_until() and uses Thread::sleep() for both absolute and relative sleeps. This was causing the nesalizer emulator port to sleep for a negative amount of time at the end of each frame, making it run way too fast.	2020-11-22 17:20:58 +01:00
Tom	75f61fe3d9	AK: Make RefPtr, NonnullRefPtr, WeakPtr thread safe This makes most operations thread safe, especially so that they can safely be used in the Kernel. This includes obtaining a strong reference from a weak reference, which now requires an explicit call to WeakPtr::strong_ref(). Another major change is that Weakable::make_weak_ref() may require the explicit target type. Previously we used reinterpret_cast in WeakPtr, assuming that it can be properly converted. But WeakPtr does not necessarily have the knowledge to be able to do this. Instead, we now ask the class itself to deliver a WeakPtr to the type that we want. Also, WeakLink is no longer specific to a target type. The reason for this is that we want to be able to safely convert e.g. WeakPtr<T> to WeakPtr<U>, and before this we just reinterpret_cast the internal WeakLink<T> to WeakLink<U>, which is a bold assumption that it would actually produce the correct code. Instead, WeakLink now operates on just a raw pointer and we only make those constructors/operators available if we can verify that it can be safely cast. In order to guarantee thread safety, we now use the least significant bit in the pointer for locking purposes. This also means that only properly aligned pointers can be used.	2020-11-10 19:11:52 +01:00
Nico Weber	323e727a4c	Kernel+LibC: Add adjtime(2) Most systems (Linux, OpenBSD) adjust 0.5 ms per second, or 0.5 us per 1 ms tick. That is, the clock is sped up or slowed down by at most 0.05%. This means adjusting the clock by 1 s takes 2000 s, and the clock an be adjusted by at most 1.8 s per hour. FreeBSD adjusts 5 ms per second if the remaining time adjustment is >= 1 s (0.5%) , else it adjusts by 0.5 ms as well. This allows adjusting by (almost) 18 s per hour. Since Serenity OS can lose more than 22 s per hour (#3429), this picks an adjustment rate up to 1% for now. This allows us to adjust up to 36s per hour, which should be sufficient to adjust the clock fast enough to keep up with how much time the clock currently loses. Once we have a fancier NTP implementation that can adjust tick rate in addition to offset, we can think about reducing this. adjtime is a bit old-school and most current POSIX-y OSs instead implement adjtimex/ntp_adjtime, but a) we have to start somewhere b) ntp_adjtime() is a fairly gnarly API. OpenBSD's adjfreq looks like it might provide similar functionality with a nicer API. But before worrying about all this, it's probably a good idea to get to a place where the kernel APIs are (barely) good enough so that we can write an ntp service, and once we have that we should write a way to automatically evaluate how well it keeps the time adjusted, and only then should we add improvements ot the adjustment mechanism.	2020-11-10 19:03:08 +01:00
Jesse Buhagiar	940380c986	Kernel: Prevent `unveil` returning ENOENT with cpath permissions This addresses the issue first enountered in #3644. If a path is first unveiled with "c" permissions, we should NOT return ENOENT if the node does not exist on the disk, as the program will most likely be creating it at a later time.	2020-11-10 09:53:18 +01:00
Tom	1e2e3eed62	Kernel: Fix a few deadlocks with Thread::m_lock and g_scheduler_lock g_scheduler_lock cannot safely be acquired after Thread::m_lock because another processor may already hold g_scheduler_lock and wait for the same Thread::m_lock.	2020-10-26 08:57:25 +01:00
Itamar	26b430bee7	Kernel: Fix sys$join_thread Previously, when we unblocked because the joinee has died, we didn't copy its exit value back to the user.	2020-10-16 11:42:20 +02:00
Andreas Kling	1d96ecf148	Everywhere: Add missing <AK/TemporaryChange.h> includes Don't rely on HashTable.h pulling this in.	2020-10-15 23:49:53 +02:00
Linus Groh	bcfc6f0c57	Everywhere: Fix more typos	2020-10-03 12:36:49 +02:00
Andreas Kling	b058852c62	Kernel: Fix overly eager fd closing in sys$execve() When obeying FD_CLOEXEC, we don't need to explicitly call close() on all the FileDescriptions. We can just clear them out from the process fd table. ~FileDescription() will call close() anyway. This fixes an issue where TelnetServer would shut down accepted sockets when exec'ing a shell for them. Since the parent process still has the socket open, we should not force-close it. Just let go.	2020-09-28 22:40:44 +02:00
Andreas Kling	0930e2323b	Kernel: Remove unnecessary capture in sys$execve()	2020-09-28 22:24:27 +02:00
Tom	838d9fa251	Kernel: Make Thread refcounted Similar to Process, we need to make Thread refcounted. This will solve problems that will appear once we schedule threads on more than one processor. This allows us to hold onto threads without necessarily holding the scheduler lock for the entire duration.	2020-09-27 19:46:04 +02:00
Luke	721788943d	Kernel: Implement _SC_OPEN_MAX	2020-09-27 01:02:11 +02:00
Tom	1727b2d7cd	Kernel: Fix thread joining issues The thread joining logic hadn't been updated to account for the subtle differences introduced by software context switching. This fixes several race conditions related to thread destruction and joining, as well as finalization which did not properly account for detached state and the fact that threads can be joined after termination as long as they're not detached. Fixes #3596	2020-09-26 13:03:13 +02:00
Ben Wiederhake	64cc3f51d0	Meta+Kernel: Make clang-format-10 clean	2020-09-25 21:18:17 +02:00
Nico Weber	47b3e98af8	Kernel+LibC+UserspaceEmulator: Add SO_TIMESTAMP, and cmsg definitions When SO_TIMESTAMP is set as an option on a SOCK_DGRAM socket, then recvmsg() will return a SCM_TIMESTAMP control message that contains a struct timeval with the system time that was current when the socket was received.	2020-09-17 17:23:01 +02:00
Nico Weber	416d470d07	Kernel: Plumb packet receive timestamp from NetworkAdapter to Socket::recvfrom Since the receiving socket isn't yet known at packet receive time, keep timestamps for all packets. This is useful for keeping statistics about in-kernel queue latencies in the future, and it can be used to implement SO_TIMESTAMP.	2020-09-17 17:23:01 +02:00
Nico Weber	b36a2d6686	Kernel+LibC+UserspaceEmulator: Mostly add recvmsg(), sendmsg() The implementation only supports a single iovec for now. Some might say having more than one iovec is the main point of recvmsg() and sendmsg(), but I'm interested in the control message bits.	2020-09-17 17:23:01 +02:00
Andreas Kling	219c0fbea9	Kernel: Unbreak sys$pledge() We were dropping all the incoming pledge promise strings and parsing "" instead. Fixes #3519.	2020-09-17 15:07:20 +02:00
Luke	68b361bd21	Kernel: Return ENOMEM in more places There are plenty of places in the kernel that aren't checking if they actually got their allocation. This fixes some of them, but definitely not all. Fixes #3390 Fixes #3391 Also, let's make find_one_free_page() return nullptr if it doesn't get a free index. This stops the kernel crashing when out of memory and allows memory purging to take place again. Fixes #3487	2020-09-16 20:38:19 +02:00
Andreas Kling	d1445cee6d	Kernel: Handle Thread::State::Dead in sys$waitid() I'm not sure how it happened, but it looks like I caught a thread in this state so let's just handle it the same way we do Dying.	2020-09-16 16:37:28 +02:00
Nico Weber	c9a3a5b488	Kernel: Use Userspace<> for sys$writev	2020-09-15 20:20:38 +02:00
Tom	c8d9f1b9c9	Kernel: Make copy_to/from_user safe and remove unnecessary checks Since the CPU already does almost all necessary validation steps for us, we don't really need to attempt to do this. Doing it ourselves doesn't really work very reliably, because we'd have to account for other processors modifying virtual memory, and we'd have to account for e.g. pages not being able to be allocated due to insufficient resources. So change the copy_to/from_user (and associated helper functions) to use the new safe_memcpy, which will return whether it succeeded or not. The only manual validation step needed (which the CPU can't perform for us) is making sure the pointers provided by user mode aren't pointing to kernel mappings. To make it easier to read/write from/to either kernel or user mode data add the UserOrKernelBuffer helper class, which will internally either use copy_from/to_user or directly memcpy, or pass the data through directly using a temporary buffer on the stack. Last but not least we need to keep syscall params trivial as we need to copy them from/to user mode using copy_from/to_user.	2020-09-13 21:19:15 +02:00
Tom	0fab0ee96a	Kernel: Rename Process::is_ring0/3 to Process::is_kernel/user_process Since "rings" typically refer to code execution and user processes can also execute in ring 0, rename these functions to more accurately describe what they mean: kernel processes and user processes.	2020-09-10 19:57:15 +02:00
Tom	92bfe40954	Kernel: Keep signal state in sync In `c3d231616c` we added the atomic variable m_have_any_unmasked_pending_signals tracking the state of pending signals. Add helper functions that automatically update this variable as needed.	2020-09-09 12:43:56 +02:00
asynts	ec1080b18a	Refactor: Replace usages of FixedArray with Vector.	2020-09-08 14:01:21 +02:00
Tom	c3d231616c	Kernel: Fix crash when delivering signal to barely created thread We need to wait until a thread is fully set up and ready for running before attempting to deliver a signal. Otherwise we may not have a user stack yet. Also, remove the Skip0SchedulerPasses and Skip1SchedulerPass thread states that we don't really need anymore with software context switching. Fixes the kernel crash reported in #3419	2020-09-07 16:49:19 +02:00
Nico Weber	e8131f503d	Kernel: Let TimeManagement keep epoch time as timespec Previously, it was kept as just a time_t and the sub-second offset was inferred from the monotonic clock. This means that sub-second time adjustments were ignored. Now that `ntpquery -s` can pass in a time with sub-second precision, it makes sense to keep time at that granularity in the kernel. After this, `ntpquery -s` immediately followed by `ntpquery` shows an offset of 0.02s (that is, on the order of network roundtrip time) instead of up to 0.75s previously.	2020-09-07 11:22:48 +02:00
Andreas Kling	5444cabd39	Kernel: Rename FileDescription::fstat() => stat()	2020-09-06 18:17:07 +02:00
Andreas Kling	57dd3b66c5	Kernel+LibC+UE: Implement sleep() via sys$clock_nanosleep() This doesn't need to be its own syscall either. :^)	2020-08-30 13:21:24 +02:00
Andreas Kling	f857f3ce4c	Kernel+LibC+UE: Implement usleep() via sys$clock_nanosleep() This doesn't need to be its own syscall. Thanks @BenWiederhake for the idea. :^)	2020-08-30 10:45:51 +02:00
Luke	453affb101	Kernel: Add shutdown commands for other virtualizers Source: https://wiki.osdev.org/Shutdown	2020-08-30 10:31:39 +02:00

... 3 4 5 6 7 ...

541 Commits