ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-09-21 18:37:58 +03:00

Author	SHA1	Message	Date
Andreas Kling	5c73c1bff8	Kernel: Don't dump perfcore for non-dumpable processes Fixes #4904	2021-01-11 18:53:45 +01:00
Andreas Kling	603147f47a	Kernel: Fix perfcore filename generation build error	2021-01-11 11:37:14 +01:00
Andreas Kling	5dafb72370	Kernel+Profiler: Make profiling per-process and without core dumps This patch merges the profiling functionality in the kernel with the performance events mechanism. A profiler sample is now just another perf event, rather than a dedicated thing. Since perf events were already per-process, this now makes profiling per-process as well. Processes with perf events would already write out a perfcore.PID file to the current directory on death, but since we may want to profile a process and then let it continue running, recorded perf events can now be accessed at any time via /proc/PID/perf_events. This patch also adds information about process memory regions to the perfcore JSON format. This removes the need to supply a core dump to the Profiler app for symbolication, and so the "profiler coredump" mechanism is removed entirely. There's still a hard limit of 4MB worth of perf events per process, so this is by no means a perfect final design, but it's a nice step forward for both simplicity and stability. Fixes #4848 Fixes #4849	2021-01-11 11:36:00 +01:00
asynts	019c9eb749	Everywhere: Replace a bundle of dbg with dbgln. These changes are arbitrarily divided into multiple commits to make it easier to find potentially introduced bugs with git bisect.	2021-01-09 21:11:09 +01:00
Andreas Kling	5dae85afe7	Kernel: Pass "shared" flag to Region constructor Before this change, we would sometimes map a region into the address space with !is_shared(), and then moments later call set_shared(true). I found this very confusing while debugging, so this patch makes us pass the initial shared flag to the Region constructor, ensuring that it's in the correct state by the time we first map the region.	2021-01-02 16:57:31 +01:00
Tom	476f17b3f1	Kernel: Merge PurgeableVMObject into AnonymousVMObject This implements memory commitments and lazy-allocation of committed memory.	2021-01-01 23:43:44 +01:00
Tom	b2a52f6208	Kernel: Implement lazy committed page allocation By designating a committed page pool we can guarantee to have physical pages available for lazy allocation in mappings. However, when forking we will overcommit. The assumption is that worst-case it's better for the fork to die due to insufficient physical memory on COW access than the parent that created the region. If a fork wants to ensure that all memory is available (trigger a commit) then it can use madvise. This also means that fork now can gracefully fail if we don't have enough physical pages available.	2021-01-01 23:43:44 +01:00
Tom	e21cc4cff6	Kernel: Remove MAP_PURGEABLE from mmap This brings mmap more in line with other operating systems. Prior to this, it was impossible to request memory that was definitely committed, instead MAP_PURGEABLE would provide a region that was not actually purgeable, but also not fully committed, which meant that using such memory still could cause crashes when the underlying pages could no longer be allocated. This fixes some random crashes in low-memory situations where non-volatile memory is mapped (e.g. malloc, tls, Gfx::Bitmap, etc) but when a page in these regions is first accessed, there is insufficient physical memory available to commit a new page.	2021-01-01 23:43:44 +01:00
Tom	c3451899bc	Kernel: Add MAP_NORESERVE support to mmap Rather than lazily committing regions by default, we now commit the entire region unless MAP_NORESERVE is specified. This solves random crashes in low-memory situations where e.g. the malloc heap allocated memory, but using pages that haven't been used before triggers a crash when no more physical memory is available. Use this flag to create large regions without actually committing the backing memory. madvise() can be used to commit arbitrary areas of such regions after creating them.	2021-01-01 23:43:44 +01:00
Tom	bc5d6992a4	Kernel: Memory purging improvements This adds the ability for a Region to define volatile/nonvolatile areas within mapped memory using madvise(). This also means that memory purging takes into account all views of the PurgeableVMObject and only purges memory that is not needed by all of them. When calling madvise() to change an area to nonvolatile memory, return whether memory from that area was purged. At that time also try to remap all memory that is requested to be nonvolatile, and if insufficient pages are available notify the caller of that fact.	2021-01-01 23:43:44 +01:00
Lenny Maiorani	b2316701a8	Everywhere: void arguments to C functions Problem: - C functions with no arguments require a single `void` in the argument list. Solution: - Put the `void` in the argument list of functions in C header files.	2020-12-26 10:10:27 +01:00
Andreas Kling	d7ad082afa	Kernel+LibELF: Stop doing ELF symbolication in the kernel Now that the CrashDaemon symbolicates crashes in userspace, let's take this one step further and stop trying to symbolicate userspace programs in the kernel at all.	2020-12-25 01:03:46 +01:00
Andreas Kling	2dfe5751f3	Kernel: Abort core dump generation if any substep fails And make an effort to propagate errors out from the inner parts. This fixes an issue where the kernel would infinitely loop in coredump generation if the TmpFS filled up.	2020-12-22 10:09:41 +01:00
Lenny Maiorani	765936ebae	Everywhere: Switch from (void) to [[maybe_unused]] (#4473 ) Problem: - `(void)` simply casts the expression to void. This is understood to indicate that it is ignored, but this is really a compiler trick to get the compiler to not generate a warning. Solution: - Use the `[[maybe_unused]]` attribute to indicate the value is unused. Note: - Functions taking a `(void)` argument list have also been changed to `()` because this is not needed and shows up in the same grep command.	2020-12-21 00:09:48 +01:00
Andreas Kling	8e79bde2b7	Kernel: Move KBufferBuilder to the fallible KBuffer API KBufferBuilder::build() now returns an OwnPtr<KBuffer> and can fail. Clients of the API have been updated to handle that situation.	2020-12-18 19:22:26 +01:00
Andreas Kling	4232874270	Kernel: Don't dump core when OOM-killing a process Trying to generate a core dump under low memory conditions is not the best idea. Fixes #4428.	2020-12-18 11:22:21 +01:00
Andreas Kling	ff8bf4db8d	Kernel: Don't take LexicalPath as argument LexicalPath is a big and heavy class that's really meant as a helper for extracting parts of a path, not for storage or passing around. Instead, pass paths around as strings and use LexicalPath locally as needed.	2020-12-15 11:17:01 +01:00
Itamar	5392f42731	Kernel: Generate coredumps for profiled processes These coredumps will be used by the Profile Viewer to symbolicate the profiling samples.	2020-12-14 23:05:53 +01:00
Itamar	39890af833	Kernel: Pass full path of output coredump file to CoreDump	2020-12-14 23:05:53 +01:00
Itamar	b4842d33bb	Kernel: Generate a coredump file when a process crashes When a process crashes, we generate a coredump file and write it in /tmp/coredumps/. The coredump file is an ELF file of type ET_CORE. It contains a segment for every userspace memory region of the process, and an additional PT_NOTE segment that contains the registers state for each thread, and a additional data about memory regions (e.g their name).	2020-12-14 23:05:53 +01:00
Tom	c455fc2030	Kernel: Change wait blocking to Process-only blocking This prevents zombies created by multi-threaded applications and brings our model back to closer to what other OSs do. This also means that SIGSTOP needs to halt all threads, and SIGCONT needs to resume those threads.	2020-12-12 21:28:12 +01:00
Tom	4bbee00650	Kernel: disown should unblock any potential waiters This is necessary because if a process changes the state to Stopped or resumes from that state, a wait entry is created in the parent process. So, if a child process does this before disown is called, we need to clear those entries to avoid leaking references/zombies that won't be cleaned up until the former parent exits. This also should solve an even more unlikely corner case where another thread is waiting on a pid that is being disowned by another thread.	2020-12-12 21:28:12 +01:00
Tom	da5cc34ebb	Kernel: Fix some issues related to fixes and block conditions Fix some problems with join blocks where the joining thread block condition was added twice, which lead to a crash when trying to unblock that condition a second time. Deferred block condition evaluation by File objects were also not properly keeping the File object alive, which lead to some random crashes and corruption problems. Other problems were caused by the fact that the Queued state didn't handle signals/interruptions consistently. To solve these issues we remove this state entirely, along with Thread::wait_on and change the WaitQueue into a BlockCondition instead. Also, deliver signals even if there isn't going to be a context switch to another thread. Fixes #4336 and #4330	2020-12-12 21:28:12 +01:00
Tom	4c1e27ec65	Kernel: Use TimerQueue for SIGALRM	2020-12-02 13:02:04 +01:00
Tom	046d6855f5	Kernel: Move block condition evaluation out of the Scheduler This makes the Scheduler a lot leaner by not having to evaluate block conditions every time it is invoked. Instead evaluate them as the states change, and unblock threads at that point. This also implements some more waitid/waitpid/wait features and behavior. For example, WUNTRACED and WNOWAIT are now supported. And wait will now not return EINTR when SIGCHLD is delivered at the same time.	2020-11-30 13:17:02 +01:00
Tom	6a620562cc	Kernel: Allow passing a thread argument for new kernel threads This adds the ability to pass a pointer to kernel thread/process. Also add the ability to use a closure as thread function, which allows passing information to a kernel thread more easily.	2020-11-30 13:17:02 +01:00
Tom	6cb640eeba	Kernel: Move some time related code from Scheduler into TimeManagement Use the TimerQueue to expire blocking operations, which is one less thing the Scheduler needs to check on every iteration. Also, add a BlockTimeout class that will automatically handle relative or absolute timeouts as well as overriding timeouts (e.g. socket timeouts) more consistently. Also, rework the TimerQueue class to be able to fire events from any processor, which requires Timer to be RefCounted. Also allow creating id-less timers for use by blocking operations.	2020-11-30 13:17:02 +01:00
Tom	75f61fe3d9	AK: Make RefPtr, NonnullRefPtr, WeakPtr thread safe This makes most operations thread safe, especially so that they can safely be used in the Kernel. This includes obtaining a strong reference from a weak reference, which now requires an explicit call to WeakPtr::strong_ref(). Another major change is that Weakable::make_weak_ref() may require the explicit target type. Previously we used reinterpret_cast in WeakPtr, assuming that it can be properly converted. But WeakPtr does not necessarily have the knowledge to be able to do this. Instead, we now ask the class itself to deliver a WeakPtr to the type that we want. Also, WeakLink is no longer specific to a target type. The reason for this is that we want to be able to safely convert e.g. WeakPtr<T> to WeakPtr<U>, and before this we just reinterpret_cast the internal WeakLink<T> to WeakLink<U>, which is a bold assumption that it would actually produce the correct code. Instead, WeakLink now operates on just a raw pointer and we only make those constructors/operators available if we can verify that it can be safely cast. In order to guarantee thread safety, we now use the least significant bit in the pointer for locking purposes. This also means that only properly aligned pointers can be used.	2020-11-10 19:11:52 +01:00
Tom	1e2e3eed62	Kernel: Fix a few deadlocks with Thread::m_lock and g_scheduler_lock g_scheduler_lock cannot safely be acquired after Thread::m_lock because another processor may already hold g_scheduler_lock and wait for the same Thread::m_lock.	2020-10-26 08:57:25 +01:00
Andreas Kling	ac8fe3d062	Kernel: Remove FIXME about unsurfaced error and log something If something goes wrong when trying to write out a perfcore file during process finalization, there's nowhere to report an error to, other than the debug log. So write it to the debug log.	2020-10-10 23:47:53 +02:00
Linus Groh	bcfc6f0c57	Everywhere: Fix more typos	2020-10-03 12:36:49 +02:00
Tom	838d9fa251	Kernel: Make Thread refcounted Similar to Process, we need to make Thread refcounted. This will solve problems that will appear once we schedule threads on more than one processor. This allows us to hold onto threads without necessarily holding the scheduler lock for the entire duration.	2020-09-27 19:46:04 +02:00
Tom	1727b2d7cd	Kernel: Fix thread joining issues The thread joining logic hadn't been updated to account for the subtle differences introduced by software context switching. This fixes several race conditions related to thread destruction and joining, as well as finalization which did not properly account for detached state and the fact that threads can be joined after termination as long as they're not detached. Fixes #3596	2020-09-26 13:03:13 +02:00
Andreas Kling	b99eaad693	Kernel: Remove a whole bunch of unnecessary includes in Process.cpp	2020-09-24 10:49:43 +02:00
Tom	c8d9f1b9c9	Kernel: Make copy_to/from_user safe and remove unnecessary checks Since the CPU already does almost all necessary validation steps for us, we don't really need to attempt to do this. Doing it ourselves doesn't really work very reliably, because we'd have to account for other processors modifying virtual memory, and we'd have to account for e.g. pages not being able to be allocated due to insufficient resources. So change the copy_to/from_user (and associated helper functions) to use the new safe_memcpy, which will return whether it succeeded or not. The only manual validation step needed (which the CPU can't perform for us) is making sure the pointers provided by user mode aren't pointing to kernel mappings. To make it easier to read/write from/to either kernel or user mode data add the UserOrKernelBuffer helper class, which will internally either use copy_from/to_user or directly memcpy, or pass the data through directly using a temporary buffer on the stack. Last but not least we need to keep syscall params trivial as we need to copy them from/to user mode using copy_from/to_user.	2020-09-13 21:19:15 +02:00
Tom	0fab0ee96a	Kernel: Rename Process::is_ring0/3 to Process::is_kernel/user_process Since "rings" typically refer to code execution and user processes can also execute in ring 0, rename these functions to more accurately describe what they mean: kernel processes and user processes.	2020-09-10 19:57:15 +02:00
asynts	ec1080b18a	Refactor: Replace usages of FixedArray with Vector.	2020-09-08 14:01:21 +02:00
Ben Wiederhake	081bb29626	Kernel: Unbreak building with extra debug macros, part 2	2020-08-30 09:43:49 +02:00
Andreas Kling	0addcb45b8	Kernel: Make Process::dump_regions() sort the regions before dumping	2020-08-22 18:01:59 +02:00
AnotherTest	688e54eac7	Kernel: Distinguish between new and old process groups with equal pgids This does not add any behaviour change to the processes, but it ties a TTY to an active process group via TIOCSPGRP, and returns the TTY to the kernel when all processes in the process group die. Also makes the TTY keep a link to the original controlling process' parent (for SIGCHLD) instead of the process itself.	2020-08-19 21:21:34 +02:00
Ben Wiederhake	42b057b0c9	Kernel: Mark compilation-unit-only functions as static This enables a nice warning in case a function becomes dead code. Also, in case of signal_trampoline_dummy, marking it external (non-static) prevents it from being 'optimized away', which would lead to surprising and weird linker errors. I found these places by using -Wmissing-declarations. The Kernel still shows these issues, which I think are false-positives, but don't want to touch: - Kernel/Arch/i386/CPU.cpp:1081:17: void Kernel::enter_thread_context(Kernel::Thread, Kernel::Thread) - Kernel/Arch/i386/CPU.cpp:1170:17: void Kernel::context_first_init(Kernel::Thread, Kernel::Thread, Kernel::TrapFrame) - Kernel/Arch/i386/CPU.cpp:1304:16: u32 Kernel::do_init_context(Kernel::Thread, u32) - Kernel/Arch/i386/CPU.cpp:1347:17: void Kernel::pre_init_finished() - Kernel/Arch/i386/CPU.cpp:1360:17: void Kernel::post_init_finished() No idea, not gonna touch it. - Kernel/init.cpp:104:30: void Kernel::init() - Kernel/init.cpp:167:30: void Kernel::init_ap(u32, Kernel::Processor) - Kernel/init.cpp:184:17: void Kernel::init_finished(u32) Called by boot.S. - Kernel/init.cpp:383:16: int Kernel::__cxa_atexit(void ()(void), void, void*) - Kernel/StdLib.cpp:285:19: void __cxa_pure_virtual() - Kernel/StdLib.cpp:300:19: void __stack_chk_fail() - Kernel/StdLib.cpp:305:19: void __stack_chk_fail_local() Not sure how to tell the compiler that the compiler is already using them. Also, maybe __cxa_atexit should go into StdLib.cpp? - Kernel/Modules/TestModule.cpp:31:17: void module_init() - Kernel/Modules/TestModule.cpp:40:17: void module_fini() Could maybe go into a new header. This would also provide type-checking for new modules.	2020-08-12 20:40:59 +02:00
Tom	49d5232f33	Kernel: Always return from Thread::wait_on We need to always return from Thread::wait_on, even when a thread is being killed. This is necessary so that the kernel call stack can clean up and release references held by it. Then, right before transitioning back to user mode, we check if the thread is supposed to die, and at that point change the thread state to Dying to prevent further scheduling of this thread. This addresses some possible resource leaks similar to #3073	2020-08-11 14:54:36 +02:00
Ben Wiederhake	083671ef2c	Kernel: Fix PID/TID confusion in send_signal This fixes the issue of a specific type of unkillable processes.	2020-08-10 11:51:45 +02:00
Ben Wiederhake	bee08a4b9f	Kernel: More PID/TID typing	2020-08-10 11:51:45 +02:00
Ben Wiederhake	7bdf54c837	Kernel: PID/PGID typing This compiles, and fixes two bugs: - setpgid() confusion (see previous commit) - tcsetpgrp() now allows to set a non-empty process group even if the group leader has already died. This makes Serenity slightly more POSIX-compatible.	2020-08-10 11:51:45 +02:00
Ben Wiederhake	f5744a6f2f	Kernel: PID/TID typing This compiles, and contains exactly the same bugs as before. The regex 'FIXME: PID/' should reveal all markers that I left behind, including: - Incomplete conversion - Issues or things that look fishy - Actual bugs that will go wrong during runtime	2020-08-10 11:51:45 +02:00
Brian Gianforcaro	c4c6d9367d	Kernel: Fix build break from missing KResult [[nodiscard]] suppressions Missed this somehow in previous change.	2020-08-05 14:06:54 +02:00
Tom	f011c420c1	Kernel: Fix signal delivery when no syscall is made This fixes a regression introduced by the new software context switching where the Kernel would not deliver a signal unless the process is making system calls. This is because the TSS no longer updates the CS value, so the scheduler never considered delivery as the process always appeared to be in kernel mode. With software context switching we can just set up the signal trampoline at any time and when the processor returns back to user mode it'll get executed. This should fix e.g. killing programs that are stuck in some tight loop that doesn't make any system calls and is only pre-empted by the timer interrupt. Fixes #2958	2020-08-02 20:50:29 +02:00
Tom	538b985487	Kernel: Remove ProcessInspectionHandle and make Process RefCounted By making the Process class RefCounted we don't really need ProcessInspectionHandle anymore. This also fixes some race conditions where a Process may be deleted while still being used by ProcFS. Also make sure to acquire the Process' lock when accessing regions. Last but not least, there's no reason why a thread can't be scheduled while being inspected, though in practice it won't happen anyway because the scheduler lock is held at the same time.	2020-08-02 17:15:11 +02:00
Tom	5bbf6ed46b	Kernel: Fix some crashes due to missing locks We need to hold m_lock when accessing m_regions.	2020-08-02 17:15:11 +02:00
Andreas Kling	be7add690d	Kernel: Rename region_from_foo() => find_region_from_foo() Let's emphasize that these functions actually go out and find regions.	2020-07-30 23:52:28 +02:00
Andreas Kling	2e2de125e5	Kernel: Turn Process::FileDescriptionAndFlags into a proper class	2020-07-30 23:50:31 +02:00
Andreas Kling	949aef4aef	Kernel: Move syscall implementations out of Process.cpp This is something I've been meaning to do for a long time, and here we finally go. This patch moves all sys$foo functions out of Process.cpp and into files in Kernel/Syscalls/. It's not exactly one syscall per file (although it could be, but I got a bit tired of the repetitive work here..) This makes hacking on individual syscalls a lot less painful since you don't have to rebuild nearly as much code every time. I'm also hopeful that this makes it easier to understand individual syscalls. :^)	2020-07-30 23:40:57 +02:00
Andreas Kling	b5f54d4153	Kernel+LibC: Add sys$set_process_name() for changing the process name	2020-07-27 19:10:18 +02:00
Ben Wiederhake	76c135ddcf	Kernel: Make clock_nanosleep aware of dynamic tick length On my system, ticks_per_second() returns 1280. So Serenity was very fast at sleeping! :P	2020-07-25 20:21:25 +02:00
Ben Wiederhake	4a5a7b68eb	Kernel: Make usleep aware of dynamic tick length On my system, ticks_per_second() returns 1280. So Serenity was always 20% too fast when sleeping!	2020-07-25 20:21:25 +02:00
Ben Wiederhake	b3472cb4a7	Kernel: Allow process creation during low-entropy condition Fixes #2871. Ignoring the 'securely generated bytes' constraint seems to be fine for Linux, so it's probably fine for Serenity. Note that there might be more bottlenecks down the road if Serenity is started in a non-GUI way. Currently though, loading the GUI seems to generate enough interrupts to seed the entropy pool, even on my non-RDRAND setup. Yay! :^)	2020-07-25 12:34:30 +02:00
Nico Weber	4eb967b5eb	LibC+Kernel: Start implementing sysconf For now, only the non-standard _SC_NPROCESSORS_CONF and _SC_NPROCESSORS_ONLN are implemented. Use them to make ninja pick a better default -j value. While here, make the ninja package script not fail if no other port has been built yet.	2020-07-15 00:07:20 +02:00
Tom	419703a1f2	Kernel: Fix checking BlockResult We now have BlockResult::WokeNormally and BlockResult::NotBlocked, both of which indicate no error. We can no longer just check for BlockResult::WokeNormally and assume anything else must be an interruption.	2020-07-07 15:46:58 +02:00
Andrew Kaster	f96b827990	Kernel+LibELF: Expose ELF Auxiliary Vector to Userspace The AT_* entries are placed after the environment variables, so that they can be found by iterating until the end of the envp array, and then going even further beyond :^)	2020-07-07 10:38:54 +02:00
Tom	bc107d0b33	Kernel: Add SMP IPI support We can now properly initialize all processors without crashing by sending SMP IPI messages to synchronize memory between processors. We now initialize the APs once we have the scheduler running. This is so that we can process IPI messages from the other cores. Also rework interrupt handling a bit so that it's more of a 1:1 mapping. We need to allocate non-sharable interrupts for IPIs. This also fixes the occasional hang/crash because all CPUs now synchronize memory with each other.	2020-07-06 17:07:44 +02:00
Tom	2a82a25fec	Kernel: Various context switch fixes These changes solve a number of problems with the software context swithcing: * The scheduler lock really should be held throughout context switches * Transitioning from the initial (idle) thread to another needs to hold the scheduler lock * Transitioning from a dying thread to another also needs to hold the scheduler lock * Dying threads cannot necessarily be finalized if they haven't switched out of it yet, so flag them as active while a processor is running it (the Running state may be switched to Dying while it still is actually running)	2020-07-06 10:00:24 +02:00
Tom	788b2d64c6	Kernel: Require a reason to be passed to Thread::wait_on The Lock class still permits no reason, but for everything else require a reason to be passed to Thread::wait_on. This makes it easier to diagnose why a Thread is in Queued state.	2020-07-06 10:00:24 +02:00
Sergey Bugaev	a8489967a3	Kernel: Add Plan9FS :^) This is an (incomplete, and not very stable) implementation of the client side of the 9P protocol.	2020-07-05 12:26:27 +02:00
Sergey Bugaev	3645b9e2a6	Kernel: Make sure to drop region with interrupts enabled A region can drop an inode if it was mmaped from the inode and held the last reference to it, and that may require some locking.	2020-07-05 12:26:27 +02:00
Sergey Bugaev	6111cfda73	AK: Make Vector::unstable_remove() return the removed value ...and rename it to unstable_take(), to align with other take...() methods.	2020-07-05 12:26:27 +02:00
Andreas Kling	11c4a28660	Kernel: Move headers intended for userspace use into Kernel/API/	2020-07-04 17:22:23 +02:00
Nico Weber	cbbd55bd6b	LibC: Remove a few comments now that we have man pages for this.	2020-07-03 19:37:28 +02:00
Tom	e373e5f007	Kernel: Fix signal delivery When delivering urgent signals to the current thread we need to check if we should be unblocked, and if not we need to yield to another process. We also need to make sure that we suppress context switches during Process::exec() so that we don't clobber the registers that it sets up (eip mainly) by a context switch. To be able to do that we add the concept of a critical section, which are similar to Process::m_in_irq but different in that they can be requested at any time. Calls to Scheduler::yield and Scheduler::donate_to will return instantly without triggering a context switch, but the processor will then asynchronously trigger a context switch once the critical section is left.	2020-07-03 19:32:34 +02:00
Andreas Kling	a98712035c	Kernel: Fix non-blocking write() blocking instead of short-writing If a partial write succeeded, we could then be in an unexpected state where the file description was non-blocking, but we could no longer write to it. Previously, the kernel would block in that state, but instead we now handle this as a proper short write and return the number of bytes we were able to write. Fixes #2645.	2020-07-03 13:54:18 +02:00
Tom	16783bd14d	Kernel: Turn Thread::current and Process::current into functions This allows us to query the current thread and process on a per processor basis	2020-07-01 12:07:01 +02:00
Tom	fb41d89384	Kernel: Implement software context switching and Processor structure Moving certain globals into a new Processor structure for each CPU allows us to eventually run an instance of the scheduler on each CPU.	2020-07-01 12:07:01 +02:00
Sergey Bugaev	6efbbcd4ba	Kernel: Port mounts to reference inodes directly ...instead of going through their identifiers. See the previous commit for reasoning.	2020-06-25 15:49:04 +02:00
Andreas Kling	d4195672b7	Kernel+LibC: Add sys$recvfd() and sys$sendfd() for fd passing These new syscalls allow you to send and receive file descriptors over a local domain socket. This will enable various privilege separation techniques and other good stuff. :^)	2020-06-24 23:08:09 +02:00
Nico Weber	d2684a8645	LibC+Kernel: Implement ppoll ppoll() is similar() to poll(), but it takes its timeout as timespec instead of as int, and it takes an additional sigmask parameter. Change the sys$poll parameters to match ppoll() and implement poll() in terms of ppoll().	2020-06-23 14:12:20 +02:00
Andreas Kling	4dbbe1885f	Kernel: Silence debug spam on exec	2020-06-22 21:18:25 +02:00
Nico Weber	d23e655c83	LibC: Implement pselect pselect() is similar() to select(), but it takes its timeout as timespec instead of as timeval, and it takes an additional sigmask parameter. Change the sys$select parameters to match pselect() and implement select() in terms of pselect().	2020-06-22 16:00:20 +02:00
Nico Weber	dd53e070c5	Kernel+LibC: Remove setreuid() / setregid() again It looks like they're considered a bad idea, so let's not add them before we need them. I figured it's good to have them in git history if we ever do need them though, hence the add/remove dance.	2020-06-18 23:19:16 +02:00
Nico Weber	a38754d9f2	Kernel+LibC: Implement seteuid() and friends! Add seteuid()/setegid() under _POSIX_SAVED_IDS semantics, which also requires adding suid and sgid to Process, and changing setuid()/setgid() to honor these semantics. The exact semantics aren't specified by POSIX and differ between different Unix implementations. This patch makes serenity follow FreeBSD. The 2002 USENIX paper "Setuid Demystified" explains the differences well. In addition to seteuid() and setegid() this also adds setreuid()/setregid() and setresuid()/setresgid(), and the accessors getresuid()/getresgid(). Also reorder uid/euid functions so that they are the same order everywhere (namely, the order that geteuid()/getuid() already have).	2020-06-18 23:19:16 +02:00
Andreas Kling	0609eefd57	Kernel: Add "setkeymap" pledge promise	2020-06-18 22:19:36 +02:00
Andreas Kling	10fd862a55	Kernel: Unbreak sys$setkeymap() This syscall was disabling SMAP too late and would crash every time when trying to set a new keymap.	2020-06-17 20:32:53 +02:00
Sergey Bugaev	47d83800e1	Kernel+LibC: Do not return -ENAMETOOLONG from sys$readlink() That's not how readlink() is supposed to work: it should copy as many bytes as fit into the buffer, and return the number of bytes copied. So do that, but add a twist: make sys$readlink() actually return the whole size, not the number of bytes copied. We fix up this return value in userspace, to make LibC's readlink() behave as expected, but this will also allow other code to allocate a buffer of just the right size. Also, avoid an extra copy of the link target.	2020-06-17 15:02:03 +02:00
Hüseyin ASLITÜRK	174987f930	Kernel: Replace char and u8 data types to u32 for code point Remove character property from event and add code_point property.	2020-06-16 13:15:17 +02:00
Hüseyin ASLITÜRK	f4d14c42d0	Kernel: Process, replace internal data type to CharacterMapData	2020-06-13 12:36:30 +02:00
Sergey Bugaev	31b025fcfc	Kernel: Allow sys$accept(address = nullptr)	2020-06-09 21:12:34 +02:00
Sergey Bugaev	05b7fec517	Kernel: Tighten up some promise checks Since we're not keeping compatibility with OpenBSD about what promises are required for which syscalls, tighten things up so that they make more sense.	2020-05-31 21:38:50 +02:00
Sergey Bugaev	3847d00727	Kernel+Userland: Support remounting filesystems :^) This makes it possible to change flags of a mount after the fact, with the caveats outlined in the man page.	2020-05-29 07:53:30 +02:00
Sergey Bugaev	d395b93b15	Kernel: Misc tweaks	2020-05-29 07:53:30 +02:00
Sergey Bugaev	fdb71cdf8f	Kernel: Support read-only filesystem mounts This adds support for MS_RDONLY, a mount flag that tells the kernel to disallow any attempts to write to the newly mounted filesystem. As this flag is per-mount, and different mounts of the same filesystems (such as in case of bind mounts) can have different mutability settings, you have to go though a custody to find out if the filesystem is mounted read-only, instead of just asking the filesystem itself whether it's inherently read-only. This also adds a lot of checks we were previously missing; and moves some of them to happen after more specific checks (such as regular permission checks). One outstanding hole in this system is sys$mprotect(PROT_WRITE), as there's no way we can know if the original file description this region has been mounted from had been opened through a readonly mount point. Currently, we always allow such sys$mprotect() calls to succeed, which effectively allows anyone to circumvent the effect of MS_RDONLY. We should solve this one way or another.	2020-05-29 07:53:30 +02:00
Sergey Bugaev	b6845de3f6	Kernel: Fix error case in Process::create_user_process() If we fail to exec() the target executable, don't leak the thread (this actually triggers an assertion when destructing the process), and print an error message.	2020-05-29 07:53:30 +02:00
Sergey Bugaev	6627c3ea3a	Kernel: Fix some failing assertions When mounting Ext2FS, we don't care if the file has a custody (it doesn't if it's a device, which is a common case). When doing a bind-mount, we do need a custody; if none is provided, let's return an error instead of crashing.	2020-05-29 07:53:30 +02:00
Sergey Bugaev	f945d7c358	Kernel: Always require read access when mmaping a file POSIX says, "The file descriptor fildes shall have been opened with read permission, regardless of the protection options specified."	2020-05-29 07:53:30 +02:00
Sergey Bugaev	602c3fdb3a	AK: Rename FileSystemPath -> LexicalPath And move canonicalized_path() to a static method on LexicalPath. This is to make it clear that FileSystemPath/canonicalized_path() only perform lexical canonicalization.	2020-05-26 14:35:10 +02:00
Sergey Bugaev	cddaeb43d3	Kernel: Introduce "sigaction" pledge You now have to pledge "sigaction" to change signal handlers/dispositions. This is to prevent malicious code from messing with assertions (and segmentation faults), which are normally expected to instantly terminate the process but can do other things if you change signal disposition for them.	2020-05-26 14:35:10 +02:00
Angel	6137475c39	Kernel: fix assertion on readlink() syscall The is_error() check on the KResultOr returned when reading the link target had a stray ! operator which causes link resolution to crash the kernel with an assertion error.	2020-05-26 12:45:01 +02:00
Brian Gianforcaro	6a74af8063	Kernel: Plumb KResult through FileDescription::read_entire_file() implementation. Allow file system implementation to return meaningful error codes to callers of the FileDescription::read_entire_file(). This allows both Process::sys$readlink() and Process::sys$module_load() to return more detailed errors to the user.	2020-05-26 10:15:40 +02:00
Andreas Kling	dd924b730a	Kernel+LibC: Fix various build issues introduced by ssize_t Now that ssize_t is derived from size_t, we have to	2020-05-23 15:27:33 +02:00
Andreas Kling	b3736c1b1e	Kernel: Use a FlatPtr for the "argument" to ioctl() Since it's often used to pass pointers, it should really be a FlatPtr.	2020-05-23 15:25:43 +02:00
Sergey Bugaev	7541122206	Kernel+LibC: Switch isatty() to use a fcntl() We would want it to work with only stdio pledged.	2020-05-20 08:31:31 +02:00
AnotherTest	8582a06899	Kernel + LibC: Handle running processes in do_waitid()	2020-05-17 11:58:08 +02:00
AnotherTest	9d54f21859	Kernel: wait() should not block if WNOHANG is specified	2020-05-17 11:58:08 +02:00
Andreas Kling	f7a75598bb	Kernel: Remove Process::any_thread() This was a holdover from the old times when each Process had a special main thread with TID 0. Using it was a total crapshoot since it would just return whichever thread was first on the process's thread list. Now that I've removed all uses of it, we don't need it anymore. :^)	2020-05-16 12:40:15 +02:00
Andreas Kling	0e7f85c24a	Kernel: Sending a signal to a process now goes to the main thread Instead of falling back to the suspicious "any_thread()" mechanism, just fail with ESRCH if you try to kill() a PID that doesn't have a corresponding TID.	2020-05-16 12:33:48 +02:00
Andreas Kling	21d5f4ada1	Kernel: Absorb LibBareMetal back into the kernel This was supposed to be the foundation for some kind of pre-kernel environment, but nobody is working on it right now, so let's move everything back into the kernel and remove all the confusion.	2020-05-16 12:00:04 +02:00
Andreas Kling	204fb27333	Kernel: Remove now-unused KernelInfoPage.h	2020-05-16 11:34:54 +02:00
Andreas Kling	2dc051c866	Kernel: Remove sys$getdtablesize() I'm not sure why this was a syscall. If we need this we can add it in LibC as a wrapper around sysconf(_SC_OPEN_MAX).	2020-05-16 11:34:01 +02:00
Andreas Kling	426c4e387d	Kernel: Use copy_to_user() in sys$gettimeofday()	2020-05-16 11:34:01 +02:00
Andreas Kling	3a92d0828d	Kernel: Remove the "kernel info page" used for fast gettimeofday() We stopped using gettimeofday() in Core::EventLoop a while back, in favor of clock_gettime() for monotonic time. Maintaining an optimization for a syscall we're not using doesn't make a lot of sense, so let's go back to the old-style sys$gettimeofday().	2020-05-16 11:33:59 +02:00
Sergey Bugaev	752617cbb2	Kernel: Disallow opening socket files You can still open files that have sockets attached to them from inside the kernel via VFS::open() (and in fact, that is what LocalSocket itslef uses), but trying to do that from userspace using open() will now fail with ENXIO.	2020-05-15 11:43:58 +02:00
Andreas Kling	5bfd893292	Kernel+Userland: Add "settime" pledge promise for setting system time We now require the "settime" promise from pledged processes who want to change the system time.	2020-05-08 22:54:17 +02:00
Andreas Kling	1cddb1055f	Kernel: Only allow superuser to call sys$clock_settime()	2020-05-08 22:47:21 +02:00
Andreas Kling	652b22ee9c	Kernel: Remove SmapDisabler in sys$clock_settime()	2020-05-08 22:47:03 +02:00
Andreas Kling	55f61c0004	Kernel: Add for_each_vmobject_of_type<T> This makes iterating over a specific type of VMObjects a bit nicer.	2020-05-08 22:10:47 +02:00
Andreas Kling	042b1f6814	Kernel: Propagate failure to commit VM regions in more places Ultimately we should not panic just because we can't fully commit a VM region (by populating it with physical pages.) This patch handles some of the situations where commit() can fail.	2020-05-08 21:47:08 +02:00
Andreas Kling	6fe83b0ac4	Kernel: Crash the current process on OOM (instead of panicking kernel) This patch adds PageFaultResponse::OutOfMemory which informs the fault handler that we were unable to allocate a necessary physical page and cannot continue. In response to this, the kernel will crash the current process. Because we are OOM, we can't symbolicate the crash like we normally would (since the ELF symbolication code needs to allocate), so we also communicate to Process::crash() that we're out of memory. Now we can survive "allocate 300 MB" (only the allocate process dies.) This is definitely not perfect and can easily end up killing a random innocent other process who happened to allocate one page at the wrong time, but it's a lot better than panicking on OOM. :^)	2020-05-06 22:28:23 +02:00
Ben Wiederhake	dce3faff08	Kernel: Don't crash on invalid fcntl	2020-05-03 22:46:28 +02:00
Michael Lelli	58a34fbe09	Kernel: Fix pledge syscall applying new pledges when it fails (#2076 ) If the exec promises fail to apply, then the normal promises should not apply either. Add a test for this fixed functionality.	2020-05-03 00:41:18 +02:00
Brian Gianforcaro	25a620a573	Kernel: Enable timeout support for sys$futex(FUTEX_WAIT) Utilize the new Thread::wait_on timeout parameter to implement timeout support for FUTEX_WAIT. As we compute the relative time from the user specified absolute time, we try to delay that computation as long as possible before we call into Thread::wait_on(..). To enable this a small bit of refactoring was done pull futex_queue fetching out and timeout fetch and calculation separation.	2020-04-26 21:31:52 +02:00
Andreas Kling	fb826aa59a	Kernel: Make sys$sethostname() superuser-only Also take the hostname string lock exclusively.	2020-04-26 15:51:57 +02:00
Luke Payne	f191b84b50	Kernel: Added the ability to set the hostname via new syscall Userland/hostname: Now takes parameter to set the hostname LibC/unistd: Added sethostname function	2020-04-26 12:59:09 +02:00
Brian Gianforcaro	0f3990cfa3	Kernel: Support signaling all processes with pid == -1 This is a special case that was previously not implemented. The idea is that you can dispatch a signal to all other processes the calling process has access to. There was some minor refactoring to make the self signal logic into a function so it could easily be easily re-used from do_killall.	2020-04-26 12:54:10 +02:00
Brian Gianforcaro	1f64e3eb16	Kernel: Implement FUTEX_WAKE of arbitrary count. Previously we just woke all waiters no matter how many were requested. Fix this by implementing WaitQueue::wake_n(..).	2020-04-26 12:35:35 +02:00
Drew Stratford	4a37362249	LibPthread: implicitly call pthread_exit on return from start routine. Previously, when returning from a pthread's start_routine, we would segfault. Now we instead implicitly call pthread_exit as specified in the standard. pthread_create now creates a thread running the new pthread_create_helper, which properly manages the calling and exiting of the start_routine supplied to pthread_create. To accomplish this, the thread's stack initialization has been moved out of sys$create_thread and into the userspace function create_thread.	2020-04-25 16:51:35 +02:00
Itamar	edaa9c06d9	LibELF: Make ELF::Loader RefCounted	2020-04-20 17:25:50 +02:00
Sergey Bugaev	54550365eb	Kernel: Use shared locking mode in some places The notable piece of code that remains to be converted is Ext2FS.	2020-04-18 13:58:29 +02:00
Sergey Bugaev	f18d6610d3	Kernel: Don't include null terminator in sys$readlink() result POSIX says, "Conforming applications should not assume that the returned contents of the symbolic link are null-terminated." If we do include the null terminator into the returning string, Python believes it to actually be a part of the returned name, and gets unhappy about that later. This suggests other systems Python runs in don't include it, so let's do that too. Also, make our userspace support non-null-terminated realpath().	2020-04-14 18:40:24 +02:00
Andreas Kling	815b73bdcc	Kernel: Simplify sys$setgroups(0, ...) If we're dropping all groups, just clear the extra_gids and return.	2020-04-14 15:30:25 +02:00
Andreas Kling	9962db5bf8	Kernel: Remove SmapDisablers in {peek,poke}_user_data()	2020-04-14 09:52:49 +02:00
Itamar	3e9a7175d1	Debugger: Add DebugSession The DebugSession class wraps the usage of Ptrace. It is intended to be used by cli & gui debugger programs. Also, call objdump for disassemly	2020-04-13 00:53:22 +02:00
Itamar	aae3f7b914	Process: Fix siginfo for code CLD_STOPPED si_code, si_status where swapped	2020-04-13 00:53:22 +02:00
Itamar	9e51e295cf	ptrace: Add PT_SETREGS PT_SETTREGS sets the regsiters of the traced thread. It can only be used when the tracee is stopped. Also, refactor ptrace. The implementation was getting long and cluttered the alraedy large Process.cpp file. This commit moves the bulk of the implementation to Kernel/Ptrace.cpp, and factors out peek & poke to separate methods of the Process class.	2020-04-13 00:53:22 +02:00
Itamar	0431712660	ptrace: Stop a traced thread when it exists from execve This was a missing feature in the PT_TRACEME command. This feature allows the tracer to interact with the tracee before the tracee has started executing its program. It will be useful for automatically inserting a breakpoint at a debugged program's entry point.	2020-04-13 00:53:22 +02:00
Itamar	b306ac9b2b	ptrace: Add PT_POKE PT_POKE writes a single word to the tracee's address space. Some caveats: - If the user requests to write to an address in a read-only region, we temporarily change the page's protections to allow it. - If the user requests to write to a region that's backed by a SharedInodeVMObject, we replace the vmobject with a PrivateIndoeVMObject.	2020-04-13 00:53:22 +02:00
Itamar	984ff93406	ptrace: Add PT_PEEK PT_PEEK reads a single word from the tracee's address space and returns it to the tracer.	2020-04-13 00:53:22 +02:00
Andreas Kling	c19b56dc99	Kernel+LibC: Add minherit() and MAP_INHERIT_ZERO This patch adds the minherit() syscall originally invented by OpenBSD. Only the MAP_INHERIT_ZERO mode is supported for now. If set on an mmap region, that region will be zeroed out on fork().	2020-04-12 20:22:26 +02:00
Andrew Kaster	61acca223f	LibELF: Move validation methods to their own file These validate_elf_* methods really had no business being static methods of ELF::Image. Now that the ELF namespace exists, it makes sense to just move them to be free functions in the namespace.	2020-04-11 22:41:05 +02:00
Andrew Kaster	21b5909dc6	LibELF: Move ELF classes into namespace ELF This is for consistency with other namespace changes that were made a while back to the other libraries :)	2020-04-11 22:41:05 +02:00
Andreas Kling	dec352dacd	Kernel: Ignore zero-length PROGBITS sections in sys$module_load()	2020-04-10 16:36:01 +02:00
Andreas Kling	c06d5ef114	Kernel+LibC: Remove ESUCCESS There's no official ESUCCESS==0 errno code, and it keeps breaking the Lagom build when we use it, so let's just say 0 instead.	2020-04-10 13:09:35 +02:00
Andreas Kling	871d450b93	Kernel: Remove redundant "ACPI" from filenames in ACPI/	2020-04-09 18:17:27 +02:00
Andreas Kling	4644217094	Kernel: Remove "non-operational" ACPI parser state If we don't support ACPI, just don't instantiate an ACPI parser. This is way less confusing than having a special parser class whose only purpose is to do nothing. We now search for the RSDP in ACPI::initialize() instead of letting the parser constructor do it. This allows us to defer the decision to create a parser until we're sure we can make a useful one.	2020-04-09 17:19:11 +02:00
Andreas Kling	dc7340332d	Kernel: Update cryptically-named functions related to symbolication	2020-04-08 17:19:46 +02:00
Liav A	23fb985f02	Kernel & Userland: Allow to mount image files formatted with Ext2FS	2020-04-06 15:36:36 +02:00
Andreas Kling	9ae3cced76	Revert "Kernel & Userland: Allow to mount image files formatted with Ext2FS" This reverts commit `a60ea79a41`. Reverting these changes since they broke things. Fixes #1608.	2020-04-03 21:28:57 +02:00
Liav A	a60ea79a41	Kernel & Userland: Allow to mount image files formatted with Ext2FS	2020-04-02 12:03:08 +02:00
Itamar	6b74d38aab	Kernel: Add 'ptrace' syscall This commit adds a basic implementation of the ptrace syscall, which allows one process (the tracer) to control another process (the tracee). While a process is being traced, it is stopped whenever a signal is received (other than SIGCONT). The tracer can start tracing another thread with PT_ATTACH, which causes the tracee to stop. From there, the tracer can use PT_CONTINUE to continue the execution of the tracee, or use other request codes (which haven't been implemented yet) to modify the state of the tracee. Additional request codes are PT_SYSCALL, which causes the tracee to continue exection but stop at the next entry or exit from a syscall, and PT_GETREGS which fethces the last saved register set of the tracee (can be used to inspect syscall arguments and return value). A special request code is PT_TRACE_ME, which is issued by the tracee and causes it to stop when it calls execve and wait for the tracer to attach.	2020-03-28 18:27:18 +01:00
Shannon Booth	757c14650f	Kernel: Simplify process assertion checking if region is in range Let's use the helper function for this :)	2020-03-22 08:51:40 +01:00
Liav A	b536547c52	Process: Use monotonic time for timeouts	2020-03-19 15:48:00 +01:00
Liav A	4484513b45	Kernel: Add new syscall to allow changing the system date	2020-03-19 15:48:00 +01:00
Liav A	9db291d885	Kernel: Introduce the new Time management subsystem This new subsystem includes better abstractions of how time will be handled in the OS. We take advantage of the existing RTC timer to aid in keeping time synchronized. This is standing in contrast to how we handled time-keeping in the kernel, where the PIT was responsible for that function in addition to update the scheduler about ticks. With that new advantage, we can easily change the ticking dynamically and still keep the time synchronized. In the process context, we no longer use a fixed declaration of TICKS_PER_SECOND, but we call the TimeManagement singleton class to provide us the right value. This allows us to use dynamic ticking in the future, a feature known as tickless kernel. The scheduler no longer does by himself the calculation of real time (Unix time), and just calls the TimeManagment singleton class to provide the value. Also, we can use 2 new boot arguments: - the "time" boot argument accpets either the value "modern", or "legacy". If "modern" is specified, the time management subsystem will try to setup HPET. Otherwise, for "legacy" value, the time subsystem will revert to use the PIT & RTC, leaving HPET disabled. If this boot argument is not specified, the default pattern is to try to setup HPET. - the "hpet" boot argumet accepts either the value "periodic" or "nonperiodic". If "periodic" is specified, the HPET will scan for periodic timers, and will assert if none are found. If only one is found, that timer will be assigned for the time-keeping task. If more than one is found, both time-keeping task & scheduler-ticking task will be assigned to periodic timers. If this boot argument is not specified, the default pattern is to try to scan for HPET periodic timers. This boot argument has no effect if HPET is disabled. In hardware context, PIT & RealTimeClock classes are merely inheriting from the HardwareTimer class, and they allow to use the old i8254 (PIT) and RTC devices, managing them via IO ports. By default, the RTC will be programmed to a frequency of 1024Hz. The PIT will be programmed to a frequency close to 1000Hz. About HPET, depending if we need to scan for periodic timers or not, we try to set a frequency close to 1000Hz for the time-keeping timer and scheduler-ticking timer. Also, if possible, we try to enable the Legacy replacement feature of the HPET. This feature if exists, instructs the chipset to disconnect both i8254 (PIT) and RTC. This behavior is observable on QEMU, and was verified against the source code: `ce967e2f33` The HPETComparator class is inheriting from HardwareTimer class, and is responsible for an individual HPET comparator, which is essentially a timer. Therefore, it needs to call the singleton HPET class to perform HPET-related operations. The new abstraction of Hardware timers brings an opportunity of more new features in the foreseeable future. For example, we can change the callback function of each hardware timer, thus it makes it possible to swap missions between hardware timers, or to allow to use a hardware timer for other temporary missions (e.g. calibrating the LAPIC timer, measuring the CPU frequency, etc).	2020-03-19 15:48:00 +01:00
Alex Muscar	d013753f83	Kernel: Resolve relative paths when there is a veil (#1474 )	2020-03-19 09:57:34 +01:00
Andreas Kling	ad92a1e4bc	Kernel: Add sys$get_stack_bounds() for finding the stack base & size This will be useful when implementing conservative garbage collection.	2020-03-16 19:06:33 +01:00
Andreas Kling	3803196edb	Kernel: Get rid of SmapDisabler in sys$fstat()	2020-03-10 13:34:24 +01:00
Liav A	0f45a1b5e7	Kernel: Allow to reboot in ACPI via PCI or MMIO access Also, we determine if ACPI reboot is supported by checking the FADT flags' field.	2020-03-09 10:53:13 +01:00
Ben Wiederhake	b066586355	Kernel: Fix race in waitid This is similar to `28e1da344d` and `4dd4dd2f3c`. The crux is that wait verifies that the outvalue (siginfo* infop) is writable before waiting, and writes to it after waiting. In the meantime, a concurrent thread can make the output region unwritable, e.g. by deallocating it.	2020-03-08 14:12:12 +01:00
Ben Wiederhake	d8cd4e4902	Kernel: Fix race in select This is similar to `28e1da344d` and `4dd4dd2f3c`. The crux is that select verifies that the filedescriptor sets are writable before blocking, and writes to them after blocking. In the meantime, a concurrent thread can make the output buffer unwritable, e.g. by deallocating it.	2020-03-08 14:12:12 +01:00
Andreas Kling	b1058b33fb	AK: Add global FlatPtr typedef. It's u32 or u64, based on sizeof(void*) Use this instead of uintptr_t throughout the codebase. This makes it possible to pass a FlatPtr to something that has u32 and u64 overloads.	2020-03-08 13:06:51 +01:00
Andreas Kling	c6693f9b3a	Kernel: Simplify a bunch of dbg() and klog() calls LogStream can handle VirtualAddress and PhysicalAddress directly.	2020-03-06 15:00:44 +01:00
Liav A	85eb1d26d5	Kernel: Run clang-format on Process.cpp & ACPIDynamicParser.h	2020-03-05 19:04:04 +01:00
Liav A	1b8cd6db7b	Kernel: Call ACPI reboot method first if possible Now we call ACPI reboot method first if possible, and if ACPI reboot is not available, we attempt to reboot via the keyboard controller.	2020-03-05 19:04:04 +01:00
Ben Wiederhake	4dd4dd2f3c	Kernel: Fix race in clock_nanosleep This is a complete fix of clock_nanosleep, because the thread holds the process lock again when returning from sleep()/sleep_until(). Therefore, no further concurrent invalidation can occur.	2020-03-03 20:13:32 +01:00
Liav A	0fc60e41dd	Kernel: Use klog() instead of kprintf() Also, duplicate data in dbg() and klog() calls were removed. In addition, leakage of virtual address to kernel log is prevented. This is done by replacing kprintf() calls to dbg() calls with the leaked data instead. Also, other kprintf() calls were replaced with klog().	2020-03-02 22:23:39 +01:00
Andreas Kling	47beab926d	Kernel: Remove ability to create kernel-only regions at user addresses This was only used by the mechanism for mapping executables into each process's own address space. Now that we remap executables on demand when needed for symbolication, this can go away.	2020-03-02 11:20:34 +01:00
Andreas Kling	e56f8706ce	Kernel: Map executables at a kernel address during ELF load This is both simpler and more robust than mapping them in the process address space.	2020-03-02 11:20:34 +01:00
Andreas Kling	678c87087d	Kernel: Load executables on demand when symbolicating Previously we would map the entire executable of a program in its own address space (but make it unavailable to userspace code.) This patch removes that and changes the symbolication code to remap the executable on demand (and into the kernel's own address space instead of the process address space.) This opens up a couple of further simplifications that will follow.	2020-03-02 11:20:34 +01:00
Andreas Kling	0acac186fb	Kernel: Make the "entire executable" region shared This makes Region::clone() do the right thing with it on fork().	2020-03-02 06:13:29 +01:00
Andreas Kling	5c2a296a49	Kernel: Mark read-only PT_LOAD mappings as shared regions This makes Region::clone() do the right thing for these now that we differentiate based on Region::is_shared().	2020-03-01 21:26:36 +01:00
Andreas Kling	ecfde5997b	Kernel: Use SharedInodeVMObject for executables after all I had the wrong idea about this. Thanks to Sergey for pointing it out! Here's what he says (reproduced for posterity): > Private mappings protect the underlying file from the changes made by > you, not the other way around. To quote POSIX, "If MAP_PRIVATE is > specified, modifications to the mapped data by the calling process > shall be visible only to the calling process and shall not change the > underlying object. It is unspecified whether modifications to the > underlying object done after the MAP_PRIVATE mapping is established > are visible through the MAP_PRIVATE mapping." In practice that means > that the pages that were already paged in don't get updated when the > underlying file changes, and the pages that weren't paged in yet will > load the latest data at that moment. > The only thing MAP_FILE \| MAP_PRIVATE is really useful for is mapping > a library and performing relocations; it's definitely useless (and > actively harmful for the system memory usage) if you only read from > the file. This effectively reverts `e2697c2ddd`.	2020-03-01 21:16:27 +01:00
Andreas Kling	bb7dd63f74	Kernel: Run clang-format on Process.cpp	2020-03-01 21:16:27 +01:00
Andreas Kling	687b52ceb5	Kernel: Name perfcore files "perfcore.PID" This way we can trace many things and we get one perfcore file per process instead of everyone trying to write to "perfcore"	2020-03-01 20:59:02 +01:00
Andreas Kling	fee20bd8de	Kernel: Remove some more harmless InodeVMObject miscasts	2020-03-01 12:27:03 +01:00
Andreas Kling	95e3aec719	Kernel: Fix harmless type miscast in Process::amount_clean_inode()	2020-03-01 11:23:23 +01:00
Andreas Kling	e2697c2ddd	Kernel: Use PrivateInodeVMObject for loading program executables This will be a memory usage pessimization until we actually implement CoW sharing of the memory pages with SharedInodeVMObject. However, it's a huge architectural improvement, so let's take it and improve on this incrementally. fork() should still be neutral, since all private mappings are CoW'ed.	2020-03-01 11:23:10 +01:00
Andreas Kling	88b334135b	Kernel: Remove some Region construction helpers It's now up to the caller to provide a VMObject when constructing a new Region object. This will make it easier to handle things going wrong, like allocation failures, etc.	2020-03-01 11:23:10 +01:00
Andreas Kling	4badef8137	Kernel: Return bytes written if sys$write() fails after writing some If we wrote anything we should just inform userspace that we did, and not worry about the error code. Userspace can call us again if it wants, and we'll give them the error then.	2020-02-29 18:42:35 +01:00
Andreas Kling	7cd1bdfd81	Kernel: Simplify some dbg() logging We don't have to log the process name/PID/TID, dbg() automatically adds that as a prefix to every line. Also we don't have to do .characters() on Strings passed to dbg() :^)	2020-02-29 13:39:06 +01:00
Andreas Kling	8fbdda5a2d	Kernel: Implement basic support for sys$mmap() with MAP_PRIVATE You can now mmap a file as private and writable, and the changes you make will only be visible to you. This works because internally a MAP_PRIVATE region is backed by a unique PrivateInodeVMObject instead of using the globally shared SharedInodeVMObject like we always did before. :^) Fixes #1045.	2020-02-28 23:25:00 +01:00
Andreas Kling	aa1e209845	Kernel: Remove some unnecessary indirection in InodeFile::mmap() InodeFile now directly calls Process::allocate_region_with_vmobject() instead of taking an awkward detour via a special Region constructor.	2020-02-28 20:29:14 +01:00
Andreas Kling	651417a085	Kernel: Split InodeVMObject into two subclasses We now have PrivateInodeVMObject and SharedInodeVMObject, corresponding to MAP_PRIVATE and MAP_SHARED respectively. Note that PrivateInodeVMObject is not used yet.	2020-02-28 20:20:35 +01:00
Andreas Kling	07a26aece3	Kernel: Rename InodeVMObject => SharedInodeVMObject	2020-02-28 20:07:51 +01:00
Andreas Kling	5af95139fa	Kernel: Make Process::m_master_tls_region a WeakPtr Let's not keep raw Region* variables around like that when it's so easy to avoid it.	2020-02-28 14:05:30 +01:00
Andreas Kling	b0623a0c58	Kernel: Remove SmapDisabler in sys$connect()	2020-02-28 13:20:26 +01:00
Andreas Kling	dcd619bd46	Kernel: Merge the shbuf_get_size() syscall into shbuf_get() Add an extra out-parameter to shbuf_get() that receives the size of the shared buffer. That way we don't need to make a separate syscall to get the size, which we always did immediately after.	2020-02-28 12:55:58 +01:00
Andreas Kling	f72e5bbb17	Kernel+LibC: Rename shared buffer syscalls to use a prefix This feels a lot more consistent and Unixy: create_shared_buffer() => shbuf_create() share_buffer_with() => shbuf_allow_pid() share_buffer_globally() => shbuf_allow_all() get_shared_buffer() => shbuf_get() release_shared_buffer() => shbuf_release() seal_shared_buffer() => shbuf_seal() get_shared_buffer_size() => shbuf_get_size() Also, "shared_buffer_id" is shortened to "shbuf_id" all around.	2020-02-28 12:55:58 +01:00
Liav A	db23703570	Process: Use dbg() instead of dbgprintf() Also, fix a bad derefernce in sys$create_shared_buffer() method.	2020-02-27 13:05:12 +01:00
Andreas Kling	4997dcde06	Kernel: Always disable interrupts in do_killpg() Will caught an assertion when running "kill 9999999999999" :^)	2020-02-27 11:05:16 +01:00
Andreas Kling	4a293e8a21	Kernel: Ignore signals sent to threadless (zombie) processes If a process doesn't have any threads left, it's in a zombie state and we can't meaningfully send signals to it. So just ignore them. Fixes #1313.	2020-02-27 11:04:15 +01:00
Andreas Kling	0c1497846e	Kernel: Don't allow profiling a dead process Work towards #1313.	2020-02-27 10:42:31 +01:00
Cristian-Bogdan SIRB	05ce8586ea	Kernel: Fix ASSERTION failed in join_thread syscall set_interrupted_by_death was never called whenever a thread that had a joiner died, so the joiner remained with the joinee pointer there, resulting in an assertion fail in JoinBlocker: m_joinee pointed to a freed task, filled with garbage. Thread::current->m_joinee may not be valid after the unblock Properly return the joinee exit value to the joiner thread.	2020-02-27 10:09:44 +01:00
Andreas Kling	d28fa89346	Kernel: Don't assert on sys$kill() with pid=INT32_MIN On 32-bit platforms, INT32_MIN == -INT32_MIN, so we can't expect this to always work: if (pid < 0) positive_pid = -pid; // may still be negative! This happens because the -INT32_MIN expression becomes a long and is then truncated back to an int. Fixes #1312.	2020-02-27 10:02:04 +01:00
Cristian-Bogdan SIRB	717cd5015e	Kernel: Allow process with multiple threads to call exec and exit This allows a process wich has more than 1 thread to call exec, even from a thread. This kills all the other threads, but it won't wait for them to finish, just makes sure that they are not in a running/runable state. In the case where a thread does exec, the new program PID will be the thread TID, to keep the PID == TID in the new process. This introduces a new function inside the Process class, kill_threads_except_self which is called on exit() too (exit with multiple threads wasn't properly working either). Inside the Lock class, there is the need for a new function, clear_waiters, which removes all the waiters from the Process::big_lock. This is needed since after a exit/exec, there should be no other threads waiting for this lock, the threads should be simply killed. Only queued threads should wait for this lock at this point, since blocked threads are handled in set_should_die.	2020-02-26 13:06:40 +01:00
Andreas Kling	ceec1a7d38	AK: Make Vector use size_t for its size and capacity	2020-02-25 14:52:35 +01:00
Andreas Kling	d0f5b43c2e	Kernel: Use Vector::unstable_remove() when deallocating a region Process::m_regions is not sorted, so we can use unstable_remove() to avoid shifting the vector contents. :^)	2020-02-24 18:34:49 +01:00
Andreas Kling	30a8991dbf	Kernel: Make Region weakable and use WeakPtr<Region> instead of Region* This turns use-after-free bugs into null pointer dereferences instead.	2020-02-24 13:32:45 +01:00
Andreas Kling	79576f9280	Kernel: Clear the region lookup cache on exec() Each process has a 1-level lookup cache for fast repeated lookups of the same VM region (which tends to be the majority of lookups.) The cache is used by the following syscalls: munmap, madvise, mprotect and set_mmap_name. After a succesful exec(), there could be a stale Region* in the lookup cache, and the new executable was able to manipulate it using a number of use-after-free code paths.	2020-02-24 12:37:27 +01:00
Liav A	895e874eb4	Kernel: Include the new PIT class in system components	2020-02-24 11:27:03 +01:00
Andreas Kling	fc5ebe2a50	Kernel: Disown shared buffers on sys$execve() When committing to a new executable, disown any shared buffers that the process was previously co-owning. Otherwise accessing the same shared buffer ID from the new program would cause the kernel to find a cached (and stale!) reference to the previous program's VM region corresponding to that shared buffer, leading to a Region* use-after-free. Fixes #1270.	2020-02-22 12:29:38 +01:00
Andreas Kling	ece2971112	Kernel: Disable profiling during the critical section of sys$execve() Since we're gonna throw away these stacks at the end of exec anyway, we might as well disable profiling before starting to mess with the process page tables. One less weird situation to worry about in the sampling code.	2020-02-22 11:09:03 +01:00
Andreas Kling	d7a13dbaa7	Kernel: Reset profiling state on exec() (but keep it going) We now log the new executable on exec() and throw away all the samples we've accumulated so far. But profiling keeps going.	2020-02-22 10:54:50 +01:00
Andreas Kling	2a679f228e	Kernel: Fix bitrotted DEBUG_IO logging	2020-02-21 15:49:30 +01:00

... 2 3 4 5 6 ...

1085 Commits