ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-10 13:00:29 +03:00

Author	SHA1	Message	Date
Daniel Bertalan	0a36cea9dc	Tests: Re-enable UserspaceEmulator tests on the Clang build Now that problems that made UE crash have been fixed, this test should now pass.	2021-08-14 18:42:14 +02:00
Itamar	e57fdb63f8	Tests: Add regression tests for the LibCpp preprocessor Similarly to the LibCpp parser regression tests, these tests run the preprocessor on the .cpp test files under Userland/LibCpp/Tests/preprocessor, and compare the output with existing .txt ground truth files.	2021-08-14 12:40:55 +02:00
Timothy Flynn	df14d11a11	LibRegex: Disallow invalid interval qualifiers in Unicode mode Fixes all remaining 'built-ins/RegExp/property-escapes' test262 tests.	2021-08-11 13:11:01 +02:00
Timothy Flynn	1e91334008	LibUnicode: Handle edge-case script extensions, Common and Inherited These script extensions have some peculiar behavior in the Unicode spec. The UCD ScriptExtension file does not contain these scripts. Rather, it is implied the code points which have these scripts as an extension are the code points that both: 1. Have Common or Inherited as their primary script value 2. Do not have any other script value in their script extension lists Because these are not explictly listed in the UCD, we must manually form these script extensions.	2021-08-11 13:11:01 +02:00
Timothy Flynn	47bb350ebd	LibUnicode: Generate separate tables for scripts and script extensions Notice that unlike the note in populate_general_category_unions(), script extension do indeed have code point ranges which overlap. Thus, this commit adds code to handle that, and hooks it into the GC unions.	2021-08-11 13:11:01 +02:00
Timothy Flynn	5ac23d244d	LibUnicode: Generate separate tables for Unicode properties Similar to General Categories, this generates separate tables for the Property list.	2021-08-11 13:11:01 +02:00
Timothy Flynn	b06c104076	LibUnicode: Include Unassigned code points in the Other General Category Now that the generator parses unassigned General Category properties, it can include Unassigned (Cn) in the Other (C) category.	2021-08-11 13:11:01 +02:00
Timothy Flynn	7dce2bfe23	LibUnicode: Generate separate tables for General Category properties Previously, each code point's General Category was part of the generated UnicodeData structure. This ultimately presented two problems, one functional and one performance related: * Some General Categories are applied to unassigned code points, for example the Unassigned (Cn) category. Unassigned code points are strictly excluded from UnicodeData.txt, so by relying on that file, the generator is unable to handle these categories. * Lookups for General Categories are slower when searching through the large UnicodeData hash map. Even though lookups are O(1), the hash function turned out to be slower than binary searching through a category-specific table. So, now a table is generated for each General Category. When querying a code point for a category, a binary search is done on each code point range in that category's table to check if code point has that category. Further, General Categories are now parsed from the UCD file DerivedGeneralCategory.txt. This file is a normal "prop list" file and contains the categories for unassigned code points.	2021-08-11 13:11:01 +02:00
Mandar Kulkarni	aaf232f903	Tests: Add test for String::bijective_base_from()	2021-08-09 14:14:07 +04:30
Daniel Bertalan	146dcf4856	Tests: Disable UserspaceEmulator tests for Clang builds There seems to be more incorrect assumptions about Clang-built executables' memory layout than expected. These make the CI fail even though the system is functional in all other aspects. While this is being fixed, let's just disable tests for UserspaceEmulator.	2021-08-08 10:55:36 +02:00
Daniel Bertalan	7396e4aedc	LibDebug: Store 64-bit numbers in AttributeValue This helps us avoid weird truncation issues and fixes a bug on Clang builds where truncation while reading caused the DIE offsets following large LEB128 numbers to be incorrect. This removes the need for the separate `LongUnsignedNumber` type.	2021-08-08 10:55:36 +02:00
Daniel Bertalan	5f2f460cc8	Tests: Add Clang pragma for turning off optimizations Clang does not accept `GCC optimize("O0")`, so it fails to build the system with it.	2021-08-08 10:55:36 +02:00
Itamar	4673a517f6	LibCpp: Do lexing in the Preprocessor We now call Preprocessor::process_and_lex() and pass the result to the parser. Doing the lexing in the preprocessor will allow us to maintain the original position information of tokens after substituting definitions.	2021-08-07 21:24:11 +02:00
Lenny Maiorani	8e949c5c91	Tests: Remove unused variables for clang build Problem: - Clang will not build `Tests/LibTLS` due to unused variables. Solution: - Remove the unused variables.	2021-08-06 23:55:27 +02:00
TheFightingCatfish	4e8e1b7b3a	AK: Improve the parsing of data urls Improve the parsing of data urls in URLParser to bring it more up-to- spec. At the moment, we cannot parse the components of the MIME type since it is represented as a string, but the spec requires it to be parsed as a "MIME type record".	2021-08-06 10:45:17 +02:00
Timothy Flynn	484ccfadc3	LibRegex: Support property escapes of Unicode script extensions	2021-08-04 13:50:32 +01:00
Timothy Flynn	06088df729	LibRegex: Support property escapes of the Unicode script property Note that unlike binary properties and general categories, scripts must be specified in the non-binary (Script=Value) form.	2021-08-04 13:50:32 +01:00
Brian Gianforcaro	4df1657898	Tests: Add coverage for sys$alarm() success case	2021-08-03 18:44:01 +02:00
Brian Gianforcaro	ea401fb3c3	Tests: Add coverage for sys$alarm() canceling a stale timer This is a regression test to validate the functionality that was reported broken in #9071, where the kernel would spin attempting to cancel a stale timer.	2021-08-03 18:44:01 +02:00
Timothy Flynn	dc9f516339	LibRegex: Generate negated property escapes as a single instruction These were previously generated as two instructions, Compare [Inverse] and Compare [Property].	2021-08-02 21:02:09 +04:30
Timothy Flynn	4de4312827	LibRegex: Support property escapes of the form \p{Type=Value} Before now, only binary properties could be parsed. Non-binary props are of the form "Type=Value", where "Type" may be General_Category, Script, or Script_Extension (or their aliases). Of these, LibUnicode currently supports General_Category, so LibRegex can parse only that type.	2021-08-02 21:02:09 +04:30
Timothy Flynn	1e10d6d7ce	LibRegex: Support property escapes of Unicode General Categories This changes LibRegex to parse the property escape as a Variant of Unicode Property & General Category values. A byte code instruction is added to perform matching based on General Category values.	2021-08-02 21:02:09 +04:30
Ali Mohammad Pur	85d87cbcc8	LibRegex: Add some tests for Fork{Stay,Jump} performance Without the previous fixes, these will blow up the stack.	2021-08-02 17:22:50 +04:30
Brian Gianforcaro	d1644c26d6	Tests: Remove unused header includes	2021-08-01 08:10:16 +02:00
Brian Gianforcaro	c54ae3afd6	Tests: Fix AK/TestJSON.cpp by not relying on disk resources The following commit broke Tests/AK/TestJSON.cpp as it removed the file that the test loaded from disk to validate JSON parsing. commit `ad141a2286` Author: Andreas Kling <kling@serenityos.org> Date: Sat Jul 31 15:26:14 2021 +0200 Base: Remove "test.frm" from HackStudio test project Instead of restoring the file, lets just embed a bit of JSON in the test case to avoid using external resources, as they obviously are surprising and make the test less portable across environments.	2021-07-31 23:56:40 +02:00
Timothy Flynn	d485cf29d7	LibRegex+LibUnicode: Begin implementing Unicode property escapes This supports some binary property matching. It does not support any properties not yet parsed by LibUnicode, nor does it support value matching (such as Script_Extensions=Latin).	2021-07-30 21:26:31 +01:00
Andreas Kling	bccdc08487	Kernel: Unmapping a non-mapped region with munmap() should be a no-op Not a regression per se from `0fcb9efd86` since we were crashing before that which is obviously worse.	2021-07-30 13:16:55 +02:00
Brian Gianforcaro	c9395d7e9a	Tests: Validate unmapping 0x0 doesn't crash the Kernel Previously unmapping any offset starting at 0x0 would assert in the kernel, add a regression test to validate the fix. Co-authored-by: Federico Guerinoni <guerinoni.federico@gmail.com>	2021-07-30 11:28:55 +02:00
Timothy Flynn	c4bfda7f7f	LibUnicode: Handle code points that are both cased and case-ignorable Apparently, some code points fit both categories, for example U+0345 (COMBINING GREEK YPOGEGRAMMENI). Handle this fact when determining if a code point is a final code point in a string.	2021-07-28 23:42:29 +02:00
Timothy Flynn	7827aede6f	LibUnicode: Check word break when deciding on case-ignorable code points	2021-07-28 23:42:29 +02:00
Timothy Flynn	c45a014645	LibUnicode: Check property list when deciding if a code point is cased	2021-07-28 23:42:29 +02:00
ovf	898b8ffcb6	LibWeb: Avoid assertion failure on parsing numeric character references	2021-07-28 18:32:22 +02:00
Timothy Flynn	39f971e42b	LibUnicode: Begin implementing special Unicode case folding This implements unconditional special case folding, and conditional folding for non-locale cases. Worth noting that the only conditional, non-locale special case is for converting an uppercase sigma to lowercase.	2021-07-27 21:04:36 +01:00
ovf	13c7d55320	LibWeb: Fix parsing of character references in attribute values	2021-07-27 00:03:43 +02:00
Timothy Flynn	4dda3edc9e	LibUnicode: Introduce a Unicode library for interacting with UCD files The Unicode standard publishes the Unicode Character Database (UCD) with information about every code point, such as each code point's upper case mapping. LibUnicode exists to download and parse UCD files at build time and to provide accessors to that data. As a start, LibUnicode includes upper- and lower-case code point converters.	2021-07-26 17:03:55 +01:00
brapru	7e40c17460	AK: Create MACAddress from string Previously there was no way to create a MACAddress by passing a direct address as a string. This will allow programs like the arp utility to create a MACAddress instance by user-passed addresses.	2021-07-25 17:57:08 +02:00
Luke	a00b5fc7b7	Tests: Add tests for the quoted printable decoder	2021-07-24 20:11:28 +04:30
Timothy Flynn	345ef6abba	LibRegex: Support ECMA-262 Unicode escapes of the form "\u{code_point}" When the Unicode flag is set, regular expressions may escape code points by surrounding the hexadecimal code point with curly braces, e.g. \u{41} is the character "A". When the Unicode flag is not set, this should be considered a repetition symbol - \u{41} is the character "u" repeated 41 times. This is left as a TODO for now.	2021-07-23 23:06:57 +01:00
Timothy Flynn	47f6bb38a1	LibRegex: Support UTF-16 RegexStringView and improve Unicode matching When the Unicode option is not set, regular expressions should match based on code units; when it is set, they should match based on code points. To do so, the regex parser must combine surrogate pairs when the Unicode option is set. Further, RegexStringView needs to know if the flag is set in order to return code point vs. code unit based string lengths and substrings.	2021-07-23 23:06:57 +01:00
Brian Gianforcaro	c2282ee28d	Tests: Add test coverage for sys$pledge(..) argument validation	2021-07-23 19:02:25 +02:00
Brian Gianforcaro	fa448456a9	Tests: Add test coverage for sys$unveil(..) argument validation	2021-07-23 19:02:25 +02:00
Ali Mohammad Pur	d40d10aae7	AK: Implement {any,all}_of(IterableContainer&&, Predicate) This is a generally nicer-to-use version of the existing {any,all}_of() that doesn't require the user to explicitly provide two iterators. As a bonus, it also allows arbitrary iterators (as opposed to the hard requirement of providing SimpleIterators in the iterator version).	2021-07-22 22:56:20 +02:00
Ali Mohammad Pur	6c9ef20010	AK: Add a CommonType<Ts...> type trait Also adds a simple-ish test for CommonType.	2021-07-22 22:56:20 +02:00
Timothy Flynn	9b83cd1abf	AK: Add Utf16View for decoding UTF-16 strings Also includes a way to transcode from and to UTF-8 strings.	2021-07-22 09:10:44 +02:00
Andreas Kling	c7d891765c	LibGfx: Use "try_" prefix for static factory functions Also mark them as [[nodiscard]].	2021-07-21 18:02:15 +02:00
Andrew Kaster	64aac345d3	AK: Use new Formatter for each element in Formatter<Vector<T>> The state of the formatter for the previous element should be thrown away for each iteration. This showed up when trying to format a Vector<String>, since Formatter<StringView> was unhappy about some state that gets set when it's called. Add a test for Formatter<Vector>.	2021-07-19 05:17:05 +04:30
Peter Bindels	ef85c4f747	Tests: Make mmap test point to new kernel address too During a recent commit the 64-bit kernel was moved to a different address, breaking this test (unnoticed). This fixes it, so we can turn on breaking x86_64 tests on the CI again.	2021-07-18 22:08:20 +02:00
Ali Mohammad Pur	f364fcec5d	LibRegex+Everywhere: Make LibRegex more unicode-aware This commit makes LibRegex (mostly) capable of operating on any of the three main string views: - StringView for raw strings - Utf8View for utf-8 encoded strings - Utf32View for raw unicode strings As a result, regexps with unicode strings should be able to properly handle utf-8 and not stop in the middle of a code point. A future commit will update LibJS to use the correct type of string depending on the flags.	2021-07-18 21:10:55 +04:30
Ali Mohammad Pur	e5af15a6e9	LibRegex: Don't do out-of-bound match accesses when a test fails	2021-07-18 21:10:55 +04:30
Peter Bindels	b748f11f2d	Tests: Disable test if platform has no UserspaceEmulator Disable test for component with same check that component itself has.	2021-07-18 12:49:33 +01:00

1 2 3 4 5

229 Commits