hledger

mirror of https://github.com/simonmichael/hledger.git synced 2025-01-07 19:28:26 +03:00

Author	SHA1	Message	Date
Alex Chen	3d2584d869	lib: switch to megaparsec 7	2018-09-30 20:15:12 -06:00
Alex Chen	855a8f1985	lib: Re-implement the 'ExceptT' layer of the parser We previously had another parser type, 'type ErroringJournalParser = ExceptT String ...' for throwing parse errors without the possibility of backtracking. This parser type was removed under the assumption that it would be possible to write our parser without this capability. However, after a hairy backtracking bug, we would now prefer to have the option to prevent backtracking. - Define a 'FinalParseError' type specifically for the 'ExceptT' layer - Any parse error can be raised as a "final" parse error - Tracks the stack of include files for parser errors, anticipating the removal of the tracking of stacks of include files in megaparsec 7 - Although a stack of include files is also tracked in the 'StateT Journal' layer of the parser, it seems easier to guarantee correct error messages in the 'ExceptT FinalParserError' layer - This does not make the 'StateT Journal' stack redundant because the 'ExceptT FinalParseError' stack cannot be used to detect cycles of include files	2018-09-29 22:33:34 -06:00
Simon Michael	cd67f8ea68	tests: clear out old boilerplate	2018-08-31 18:12:17 -07:00
Simon Michael	d778a92561	tests: export HUnit/EasyTest from Hledger.Utils.Test; more helpers	2018-08-18 15:19:59 +01:00
Simon Michael	d5430e7ddf	clean up debug helpers (api change)	2018-07-16 15:28:58 +01:00
Simon Michael	0ce9c5728a	switch to base-compat-batteries to fix ghc 7.10 support (#794 ) base-compat-batteries provides the same API across more ghc versions than base-compat does, at the cost of more dependencies. Eg it exports Prelude.Compat ((<>)) with ghc 7.10/base 4.8, which we expect. My belief is that several of our deps already require it so the added cost is not too great. We should probably go back to base-compat when possible though, eg when we stop supporting ghc 7.10.	2018-06-04 17:32:42 -07:00
Peter Simons	6db7f800ee	hledger-lib: fix doctest suite after recent package updates The new version of our package set apparently contains both base-compat and base-compat-batteries in its transitive closure. This breaks the doctest suite, which just imports everything into scope when the tests are run, thereby making module names like Prelude.Compat ambiguous.	2018-06-04 21:41:15 +02:00
Alex Chen	b245ec7b3d	lib: remove the megaparsec compatability module	2018-05-22 12:16:46 -07:00
Alex Chen	09fd8132b7	lib: refactor: weaken types of comment parsers	2018-05-17 18:15:06 -07:00
Dmitry Astapov	ecf49b1e4b	lib: auto postings generated before amount inference and balance checks (#729 )	2018-04-17 14:33:32 -07:00
Moritz Kiefer	d7b68fbd7d	Use skipMany/skipSome for parsing spacenonewline This avoids allocating the list of space characters only to then discard it.	2018-03-25 22:59:05 +01:00
Mykola Orliuk	b7dbe044b0	journal: use decimal sep hint for amount parser Make use of commodity format directive as a hint for parsing amount. Kinda resolves simonmichael/hledger#487	2017-11-27 15:47:56 -08:00
Simon Michael	580ad88dca	timedot: fix parsing of month quantities (Nmo) [ci skip]	2017-09-26 15:11:37 -10:00
Simon Michael	1ebf1fec28	timedot: also provide syntax for seconds, days, weeks, months & years	2017-08-21 17:28:57 -07:00
Simon Michael	5cdb60b69b	timedot: allow minutes to be logged as Nm	2017-08-20 13:00:29 -07:00
Simon Michael	d7d5f8a064	add support for megaparsec 6 (fixes #594 ) Older megaparsec is still supported. Also cleans up our custom parser types, and some text (un)packing is done in different places (possible performance impact).	2017-07-27 19:20:46 -07:00
Johannes Gerer	74502f7e50	more general parser types enabling reuse outside of IO (#439 )	2016-12-09 15:57:17 -08:00
Simon Michael	1f2276c100	lib: mark ledger reader as experimental, don't use automatically	2016-11-20 10:42:12 -08:00
Simon Michael	b6ff170688	lib: simplify format detection, avoid ledger reader by default When we don't know a file's format, instead of choosing a subset of readers based on content sniffing, now we just try them all. Also, LedgerReader is now used only as a last resort, as it's not yet competitive with JournalReader.	2016-11-18 13:24:57 -08:00
Simon Michael	3ddc9d7432	lib: clarify file format detectors	2016-11-16 13:25:33 -08:00
Moritz Kiefer	4141067428	Replace Parsec with Megaparsec (see #289 ) (#366 ) * Replace Parsec with Megaparsec (see #289) This builds upon PR #289 by @rasendubi * Revert renaming of parseWithState to parseWithCtx * Fix doctests * Update for Megaparsec 5 * Specialize parser to improve performance * Pretty print errors * Swap StateT and ParsecT This is necessary to get the correct backtracking behavior, i.e. discard state changes if the parsing fails.	2016-07-29 08:57:10 -07:00
Simon Michael	770dcee742	lib: textification: comments and tags No change. hledger -f data/100x100x10.journal stats <<ghc: 42859576 bytes, 84 GCs, 193781/269984 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.001 elapsed), 0.016 MUT (0.020 elapsed), 0.009 GC (0.011 elapsed) :ghc>> <<ghc: 42859576 bytes, 84 GCs, 193781/269984 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.001 elapsed), 0.015 MUT (0.018 elapsed), 0.009 GC (0.013 elapsed) :ghc>> hledger -f data/1000x1000x10.journal stats <<ghc: 349576344 bytes, 681 GCs, 1407388/4091680 avg/max bytes residency (7 samples), 11M in use, 0.000 INIT (0.000 elapsed), 0.124 MUT (0.130 elapsed), 0.047 GC (0.055 elapsed) :ghc>> <<ghc: 349576280 bytes, 681 GCs, 1407388/4091680 avg/max bytes residency (7 samples), 11M in use, 0.000 INIT (0.000 elapsed), 0.126 MUT (0.132 elapsed), 0.049 GC (0.058 elapsed) :ghc>> hledger -f data/10000x1000x10.journal stats <<ghc: 3424030664 bytes, 6658 GCs, 11403359/41071624 avg/max bytes residency (11 samples), 111M in use, 0.000 INIT (0.000 elapsed), 1.207 MUT (1.228 elapsed), 0.473 GC (0.528 elapsed) :ghc>> <<ghc: 3424030760 bytes, 6658 GCs, 11403874/41077288 avg/max bytes residency (11 samples), 111M in use, 0.000 INIT (0.002 elapsed), 1.234 MUT (1.256 elapsed), 0.470 GC (0.520 elapsed) :ghc>> hledger -f data/100000x1000x10.journal stats <<ghc: 34306547448 bytes, 66727 GCs, 76805504/414629288 avg/max bytes residency (14 samples), 1009M in use, 0.000 INIT (0.003 elapsed), 12.615 MUT (12.813 elapsed), 4.656 GC (5.291 elapsed) :ghc>> <<ghc: 34306547320 bytes, 66727 GCs, 76805504/414629288 avg/max bytes residency (14 samples), 1009M in use, 0.000 INIT (0.009 elapsed), 12.802 MUT (13.065 elapsed), 4.774 GC (5.441 elapsed) :ghc>>	2016-05-24 19:00:57 -07:00
Simon Michael	c89c33b36e	lib: textification: parse stream 10% more allocation, but 35% lower maximum residency, and slightly quicker. hledger -f data/100x100x10.journal stats <<ghc: 39327768 bytes, 77 GCs, 196834/269496 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.010 elapsed), 0.020 MUT (0.092 elapsed), 0.014 GC (0.119 elapsed) :ghc>> <<ghc: 42842136 bytes, 84 GCs, 194010/270912 avg/max bytes residency (3 samples), 2M in use, 0.000 INIT (0.009 elapsed), 0.016 MUT (0.029 elapsed), 0.012 GC (0.120 elapsed) :ghc>> hledger -f data/1000x1000x10.journal stats <<ghc: 314291440 bytes, 612 GCs, 2070776/6628048 avg/max bytes residency (7 samples), 16M in use, 0.000 INIT (0.000 elapsed), 0.128 MUT (0.144 elapsed), 0.059 GC (0.070 elapsed) :ghc>> <<ghc: 349558872 bytes, 681 GCs, 1397597/4106384 avg/max bytes residency (7 samples), 11M in use, 0.000 INIT (0.004 elapsed), 0.124 MUT (0.133 elapsed), 0.047 GC (0.053 elapsed) :ghc>> hledger -f data/10000x1000x10.journal stats <<ghc: 3070026824 bytes, 5973 GCs, 12698030/62951784 avg/max bytes residency (10 samples), 124M in use, 0.000 INIT (0.002 elapsed), 1.268 MUT (1.354 elapsed), 0.514 GC (0.587 elapsed) :ghc>> <<ghc: 3424013128 bytes, 6658 GCs, 11405501/41071624 avg/max bytes residency (11 samples), 111M in use, 0.000 INIT (0.001 elapsed), 1.343 MUT (1.406 elapsed), 0.511 GC (0.573 elapsed) :ghc>> hledger -f data/100000x1000x10.journal stats <<ghc: 30753387392 bytes, 59811 GCs, 117615462/666703600 avg/max bytes residency (14 samples), 1588M in use, 0.000 INIT (0.000 elapsed), 12.068 MUT (12.238 elapsed), 6.015 GC (7.190 elapsed) :ghc>> <<ghc: 34306530696 bytes, 66727 GCs, 76806196/414629312 avg/max bytes residency (14 samples), 1009M in use, 0.000 INIT (0.010 elapsed), 14.357 MUT (16.370 elapsed), 5.298 GC (6.534 elapsed) :ghc>>	2016-05-24 19:00:57 -07:00
Simon Michael	0f5ee154c4	lib: simplify parsers; cleanups (#275 ) The journal/timeclock/timedot parsers, instead of constructing (opaque) journal update functions which are later applied to build the journal, now construct the journal directly (by modifying the parser state). This is easier to understand and debug. It also removes any possibility of the journal updates being a space leak. (They weren't, in fact memory usage is now slightly higher, but that will be addressed in other ways.) Also: Journal data and journal parse info have been merged into one type (for now), and field names are more consistent. The ParsedJournal type alias has been added to distinguish being-parsed and finalised journals. Journal is now a monoid. stats: fixed an issue with ordering of include files journal: fixed an issue with ordering of included same-date transactions timeclock: sessions can no longer span file boundaries (unclocked-out sessions will be auto-closed at the end of the file). expandPath now throws a proper IO error (and requires the IO monad).	2016-05-23 00:44:19 -07:00
Simon Michael	7f5e09096f	lib: rename JournalContext to JournalParseState	2016-05-18 20:57:34 -07:00
Simon Michael	84097b75c7	journal: can now include timeclock/timedot files (#320 ) journal files can now include journal, timeclock or timedot files (but not yet CSV files). Also timeclock/timedot files no longer support default year directives. The Hledger.Read.* modules have been reorganised for better reuse. Hledger.Read.Utils has been renamed Hledger.Read.Common and holds low-level parsers & utilities; high-level read utilities have moved to Hledger.Read.	2016-05-17 19:46:54 -07:00
Simon Michael	a9afd7bcbe	lib: slightly better journal/time format detection The Journal, Timelog and Timedot readers' detectors now check each line in the sample data, not just the first one. I think the sample data is only about 30 chars right now, but even so this fixed a format detection issue I was seeing.	2016-02-19 23:02:10 -08:00
Simon Michael	70863ae40b	lib: timedot allow indenting	2016-02-19 22:58:08 -08:00
Simon Michael	4b4a4bacf7	lib: timedot parse order fix	2016-02-19 22:57:43 -08:00
Simon Michael	0adcdf21f8	lib: timedot parsing fix	2016-02-19 22:57:06 -08:00
Simon Michael	b26dd3d9b0	lib: fix timedot comments	2016-02-19 22:55:30 -08:00
Simon Michael	06b54bf05e	lib: timedot format, convenient for time logging Timedot is a plain text format for logging dated, categorised quantities (eg time), supported by hledger. It is convenient for approximate and retroactive time logging, eg when the real-time clock-in/out required with a timeclock file is too precise or too interruptive. It can be formatted like a bar chart, making clear at a glance where time was spent.	2016-02-19 17:55:57 -08:00

32 Commits