analytics

mirror of https://github.com/plausible/analytics.git synced 2024-12-28 12:01:39 +03:00

Author	SHA1	Message	Date
Adrian Gruntkowski	2d23db2982	Return warning when imported stats are skipped in Stats API due to unsupported query (#4116 )	2024-05-21 11:45:18 +02:00
Adrian Gruntkowski	7876802d05	Improve CSV importer tests and visit duration query for imported data (#4114 ) * Tidy up breakdown helper functions in CSV importer tests * Fix a typo * Extend city breakdown tests in CSV importer test suite * Make visit duration computation consistent across native and imported data	2024-05-21 09:24:10 +02:00
Uku Taht	b2860d1602	Fix unselecting filter from filter modal (#4109 )	2024-05-21 09:23:57 +02:00
Adrian Gruntkowski	756d9c28ce	Improve test coverage and fix stats API and dashboard CSV export (#4110 ) * Fix broken hostname property and handle missing imported metrics gracefully * Add test for CSV export of imported data * Add extra coverage for property and metric combox which were failing * Compute visit duration and bounce rate for exit pages in imported data * Drop support for breaking down by `event:hostname` property for now	2024-05-20 16:11:35 +01:00
RobertJoonas	df178ea2d5	Introduce :skip_imported_reason field into Query struct (#4115 ) * introduce new query field * improve tests	2024-05-20 15:46:25 +01:00
Karl-Aksel Puulmann	cfdd769984	Fix StatsAPI access for new accounts on business tier (#4105 ) Currently, business tier users created after business tier launch can't access Stats API due to faulty grandfathering logic. This change should fix that.	2024-05-15 11:50:41 +03:00
Adrian Gruntkowski	9374a95cf2	Export and import custom events via CSV (#4096 ) * Export and import custom events via CSV * Add prop support of url for cloaked links and path for 404s in imported queries * Handle custom events with empty URL and path properties gracefully * Make events with properties logic DRY and fix missed cloaked link * Add test for path property breakdown * Update raw CH data fixture and extend CSV importer tests * Fix broken query condition after rebase * Update CHANGELOG.md	2024-05-14 13:50:30 +02:00
Karl-Aksel Puulmann	baa99652f6	Refactor internal Query schema and introduce WhereBuilder (#4082 ) * New struct format for query after parsing * WIP refactoring * WIP: Validations working * WIP: tuple to list * continued refactoring * WIP: parsing defaults * Breakdown tests pass * Window functions fix * Fix default * Remove dead argument * Update filters tests * Update query_test.exs * Fix table_decider * sources tests pass * Filter suggestions fix * revenue/goal filter applied refactor * Update top_stats matching * Get stats_controller tests passing * Update neighbor_aggregate_time_on_page * Refactor Query.remove_event_filters into Query.remove_filters, add new callsites * Move goal where clause building to new WhereBuilder module * Move event:name filters * Move more filters to WhereBuilder * Update fragment to allow non-static meta columns * Build where clause for events table using WhereBuilder * Build sessions table where clause using WhereBuilder * Move time range filtering and site checking to WhereBuilder * WhereBuilder.build_condition method * Remove TODO * _rest pattern for TableDecider, Query pattern matching Future-proofing in a tiny way * Hacky fix to get tests passing for Google API tests * Typespec fix * Merge conflict * refactor special goal filter logic in imported.ex * Docs feedback * put_filter --------- Co-authored-by: Robert Joonas <robertjoonas16@gmail.com>	2024-05-14 11:58:10 +03:00
Mathijs Kadijk	9944b301ec	Add dmg to the list of default tracked files (#4085 ) Co-authored-by: Uku Taht <Uku.taht@gmail.com>	2024-05-14 10:10:18 +03:00
Uku Taht	06e8118dab	Filtering Search Console keywords (#4077 ) * Apply filters in search console request * Remove dead code from search console modal * Remove unimportant information from keyword modal * Show invalid filters from search console * Fix tests * Add/Fix tests * Fix typo * Remove unused variable * Fix typo * Changelog entry * Fix Credo * Display impressions, CTR and position in keyword modal * Undo change that should not have been committed * Fix test * Fix test * filters -> search_console_filters	2024-05-14 09:56:55 +03:00
Cenk Kücük	39cf8c4179	Upgrade Clickhouse version in CI (#4062 ) * Update elixir.yml * Fix error See https://github.com/ClickHouse/ClickHouse/issues/62431#issuecomment-2090283419 * fix order to pass test * Revert "fix order to pass test" This reverts commit `d79fbeb297`. * Update Makefile Update CH version	2024-05-13 21:53:29 +02:00
Karl-Aksel Puulmann	32a16a72fa	Speed up calculating monthly usage for users with many many sites (#4101 ) * Delete dead code * Speed up calculating usage for users with many many sites Currently, the settings page time outs for a user with 14k sites. This PR speeds things up by: 1. Doing the work in parallel (max 10 queries at once) 2. Increasing chunking size (300 -> 1000) Note that the query is relatively lightweight on clickhouse - running these queries manually takes ~70ms. If this becomes slow we can also introduce a PROJECTION to speed up the calculation, but this wasn't a bottleneck currently. On chunking size: ClickHouse can handle even 10k site_ids in a single query fast if run via clickhouse-client , but running the same query via ecto_ch it becomes really slow (60ms vs 1s). Not sure if this is a driver, serialization or networking issue.	2024-05-13 13:36:31 +03:00
ruslandoga	1f4346f4df	Add DATA_DIR (#4100 ) * add DATA_DIR * add test * changelog * fix test in CI where PERSISTENT_CACHE_DIR is always set * consistent fallback	2024-05-13 10:17:56 +02:00
ruslandoga	7af8273702	Handle s3 timeout on settings page (#4036 ) * handle s3 timeout * no-async approach * fewer changes * add tests * make s3 failure test EE-only	2024-05-13 10:17:27 +02:00
ruslandoga	c41e5980fb	Update links to configs (#4080 )	2024-05-10 14:00:51 +02:00
RobertJoonas	a6dcd19ccc	Autoconfigure event goals (#4093 ) * add new goal suggestions API * silence credo * Order suggestions from subqueries explicitly * allow autoconfiguring goals Co-authored-by: Adrian Gruntkowski <adrian.gruntkowski@gmail.com> * Fix form modal tab switching behavior * add test * Remove redundant and invalid action link title --------- Co-authored-by: Adrian Gruntkowski <adrian.gruntkowski@gmail.com>	2024-05-10 11:48:27 +02:00
Adrian Gruntkowski	6bbc8d69a4	Avoid crashing on empty GA4 HTTP report response (#4094 )	2024-05-10 09:12:22 +02:00
RobertJoonas	370aeb4b02	Refactor filter suggestions with a more DRY approach (#4090 ) * refactor filter suggestions with a more DRY approach * Avoid DRYing props string->atom translation --------- Co-authored-by: Adrian Gruntkowski <adrian.gruntkowski@gmail.com>	2024-05-10 09:12:02 +02:00
Adrian Gruntkowski	4e7e932a75	Add support for imported custom events (#4033 ) * Add Ecto schema for imported custom events * Start importing custom events from GA4 * query imported goals * make it possible to query events metric from imported * make it possible to query pageviews in goal breakdown * make it possible to query conversion rate * fix rate limiting test * add CR tests for dashboard API * implement imported link_url breakdown * override special custom event names coming from GA4 * allow specific goal filters in imported_q * update GA4 import tests to use Stats API * Improve tests slightly * Update CHANGELOG.md --------- Co-authored-by: Robert Joonas <robertjoonas16@gmail.com>	2024-05-09 13:13:19 +01:00
Adrian Gruntkowski	d8435f2e01	Remove `imports_exports` and `csv_imports_exports` feature flags (#4089 )	2024-05-09 10:09:24 +02:00
ruslandoga	b1c13617b3	Allow empty city geonames in exports (#4079 )	2024-05-09 09:17:28 +02:00
ruslandoga	84ed7988a8	Drop `local_start_date/1` (#4088 )	2024-05-09 08:59:23 +02:00
ruslandoga	1114686a0e	refactor filtering in export queries (#4086 )	2024-05-09 08:55:12 +02:00
ruslandoga	953af278d8	Fix type in date_range/2 (#4087 )	2024-05-09 08:47:06 +02:00
Cenk Kücük	435d708289	Update join_algorithm (#4078 )	2024-05-07 16:35:22 +02:00
Karl-Aksel Puulmann	0a883f10e7	Refactor: Use common current_visitors code (#4071 ) * Use common module for counting current visitors in external stats controller * Refactor spike notifier, remove now-dead code	2024-05-07 15:03:37 +03:00
Karl-Aksel Puulmann	9f6ea00a72	Clean up deleted sites from clickhouse cron (#4065 )	2024-05-07 15:03:32 +03:00
Karl-Aksel Puulmann	64850cd00f	Remove `maybe_drop_prop_filter` (#4066 ) * Fix event props paygate Previous code wasn't properly omitting event property filters from queries. Discovered while refactoring the code. Extracting fix from refactor for easier reviewability. * a * Drop function	2024-05-07 15:03:16 +03:00
RobertJoonas	c106595be0	Migration: add imported custom events (#4076 ) * Add `imported_custom_events` to CH * remove redundant table setting * add path column --------- Co-authored-by: Adrian Gruntkowski <adrian.gruntkowski@gmail.com>	2024-05-07 11:46:27 +01:00
ruslandoga	02d4709be7	add csv fixture for e2e export/import test (#4037 ) * add inline csv fixture * use new csvs * cleanup csv reading and site_id replacing * perform comparisons between native and imported queries * help help help * help help * help * eh * fin * exclude export/import e2e test when experimental_reduced_joins flag is enabled * adapt to new pageviews * adapt to experimental_reduced_joins * credo is formatter * cleanup * assert bounce rates equal in city breakdown * fix rebase against master * clean-up dataset * update comment * fix typo * apply csv changes to the files * use sessions timestamp for exports' dates --------- Co-authored-by: RobertJoonas <56999674+RobertJoonas@users.noreply.github.com>	2024-05-07 08:48:22 +01:00
Adrian Gruntkowski	62138e0dad	Apply sampling factor multiplication consistently across CSV export queries (#4074 ) * Apply sampling factor multiplication consistently across CSV export queries * Fix casting Co-authored-by: ruslandoga <doga.ruslan@gmail.com> --------- Co-authored-by: ruslandoga <doga.ruslan@gmail.com>	2024-05-07 09:15:43 +02:00
Adrian Gruntkowski	d7ca8d9600	Revert "Debug queries for super-admins (#4010 )" (#4073 ) This reverts commit `dd493fdad2`.	2024-05-06 15:05:17 +02:00
Adrian Gruntkowski	70ed30e871	Add override on CSV exports for super admins (#4070 )	2024-05-06 11:12:47 +02:00
ruslandoga	b0fc527eb2	Run Oban in CE too (#4061 )	2024-05-06 10:40:09 +02:00
ruslandoga	c49500204c	use released ecto_ch v0.3.5 (#4054 )	2024-05-06 10:39:45 +02:00
ruslandoga	6e3556d2aa	Use sessions timestamp for exports' dates (#4063 )	2024-05-06 10:39:00 +02:00
Adrian Gruntkowski	39357150e9	Improve styling of imports list (#4069 )	2024-05-06 10:38:37 +02:00
Karl-Aksel Puulmann	035cb3a379	Use ALIAS columns to simplify querying API (#4059 ) * Cleanup: remove @api_prop_name_to_db-related cruft * Use aliases elsewhere in code	2024-05-05 12:03:00 +03:00
Karl-Aksel Puulmann	17f812443d	Return session in each time bucket its active in for hourly/minute timeseries (#4052 ) * Fix typo in test name * Update test_helper, enable experimental_session_count together with experimental_reduced_joins * Return session in each time bucket its active in for hourly/minute timeseries The behavior is behind experimental_session_count flag This results in more accurate visitor showing compared to previous approach of showing each user only active the _last_ time they did a pageview. Were not doing this for monthly/weekly graphs due to query performance cost and it having a small effect there. See also https://3.basecamp.com/5308029/buckets/35611491/messages/7085680123 * Add tests for new behavior Note the new behavior mimics the old one precisely, these tests fail if only experimental_reduced_joins is on, but not experimental_session_count * Type erasure * Dead comment remove * Expected_plot change	2024-05-05 11:44:43 +03:00
Karl-Aksel Puulmann	850a843d82	Add migration to add ALIAS columns to common session/visit properties (#4058 ) This allows for simplifications in the API code	2024-05-05 11:30:39 +03:00
ruslandoga	972dd5d150	redirect to s3 url when downloading exports (#4002 ) * redirect to s3 url * use new on_ee macro, reduce wait time for email to five seconds	2024-05-02 19:53:12 +01:00
ruslandoga	07cf2b4682	use events_v2 for pageviews export (#4035 )	2024-05-02 14:50:31 +01:00
Karl-Aksel Puulmann	bcde2b8cf2	Add Yesterday as an time range option in the dashboard (#4040 )	2024-05-02 14:57:10 +03:00
ruslandoga	8712e91bcb	drop time on page from exports (#4051 )	2024-05-02 08:29:24 +01:00
RobertJoonas	bfdadc2eee	Include breakdown property in the Query struct (#4053 ) * keep breakdown prop in the query struct * Explicitly ignore property param in aggregate and timeseries Since parameter validation depends on the breakdown property, we need to make sure it doesn't have any unexpected effect in endpoints where it's not expected.	2024-04-30 18:43:46 +01:00
Adrian Gruntkowski	41fef85d29	Implement resumable GA4 imports to work around rate limiting (#4049 ) * Add support for resuming import to GA4 importer * Handle rate limiting gracefully for all remainig GA4 HTTP requests * Show notice tooltip for long running imports * Bump resume job schedule delay to 65 minutes * Fix tooltip styling	2024-04-30 18:06:18 +02:00
hq1	dd493fdad2	Debug queries for super-admins (#4010 ) * Debug queries for super-admins * Fixup * Update lib/plausible/clickhouse_repo.ex Co-authored-by: ruslandoga <doga.ruslan@gmail.com> * Try again with https://github.com/plausible/analytics/pull/3699 It's still clunky 😅 Co-authored-by: Karl-Aksel Puulmann <macobo@users.noreply.github.com> Co-authored-by: ruslandoga <doga.ruslan@gmail.com> * Move headers injection to a separate plug module * Add tests * Update repo test * Format * Add moduledoc * Don't assume order in query_log * Be patient about query_log maybe? * huh? * huh2 * Wait longer * Guard against \x00 in response header - testing on stage * Fixup * fixup * fixup * s/debug_label/label * Include `site_id` and `metadata` in `log_comment` * Tolerate non-serializable log_comment contents --------- Co-authored-by: ruslandoga <doga.ruslan@gmail.com> Co-authored-by: Karl-Aksel Puulmann <macobo@users.noreply.github.com>	2024-04-30 09:57:28 +02:00
hq1	ad9141a9d0	Display tooltips on plan change when limits exceeded (#4048 ) * Reapply "Display upgrade tooltips for exceeded limits (#4032)" This reverts commit `76e910d45c`. * Switch to alpinejs controlled tooltips Co-authored-by: Robert Joonas <robertjoonas16@gmail.com> * Remove unused selector * Refactor plan limits warning and extract tooltip component * Remove redundant check --------- Co-authored-by: Robert Joonas <robertjoonas16@gmail.com> Co-authored-by: Adrian Gruntkowski <adrian.gruntkowski@gmail.com>	2024-04-29 11:16:15 +02:00
hq1	b2009aa158	Rely on con_cache telemetry (#4019 ) * Rely on con_cache telemetry Now that https://github.com/sasa1977/con_cache/pull/76 is released, we don't have to use low-level operations to emit hit/miss events. This PR also wraps cache processes with a function returning appropriate child specs lists. Ideally each cache will have its own supervisor/child specs going forward. This is an intermediate step in that direction. * Update lib/plausible/application.ex Co-authored-by: Adrian Gruntkowski <adrian.gruntkowski@gmail.com> * Declare caches without warmers with plain child specs --------- Co-authored-by: Adrian Gruntkowski <adrian.gruntkowski@gmail.com>	2024-04-29 11:00:53 +02:00
RobertJoonas	726fe2d982	fix test (#4038 )	2024-04-29 09:30:30 +01:00

1 2 3 4 5 ...

2731 Commits