analytics/Plausible.Purge.html

309 lines
11 KiB
HTML
Raw Normal View History

<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta http-equiv="x-ua-compatible" content="ie=edge">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<meta name="generator" content="ExDoc v0.30.3">
<meta name="project" content="Plausible v0.0.1">
<title>Plausible.Purge — Plausible v0.0.1</title>
<link rel="stylesheet" href="dist/html-elixir-P5GXSCHE.css" />
<script src="dist/handlebars.runtime-NWIB6V2M.js"></script>
<script src="dist/handlebars.templates-NBND3S2D.js"></script>
<script src="dist/sidebar_items-72CEEEE8.js"></script>
<script src="docs_config.js"></script>
<script async src="dist/html-CGDDOCMI.js"></script>
</head>
<body data-type="modules" class="page-module">
<script>
try {
var settings = JSON.parse(localStorage.getItem('ex_doc:settings') || '{}');
if (settings.theme === 'dark' ||
((settings.theme === 'system' || settings.theme == null) &&
window.matchMedia('(prefers-color-scheme: dark)').matches)
) {
document.body.classList.add('dark')
}
} catch (error) { }
</script>
<div class="main">
<button class="sidebar-button sidebar-toggle" aria-label="toggle sidebar">
<i class="ri-menu-line ri-lg" title="Collapse/expand sidebar"></i>
</button>
<section class="sidebar">
<form class="sidebar-search" action="search.html">
<button type="submit" class="search-button" aria-label="Submit Search">
<i class="ri-search-2-line" aria-hidden="true" title="Submit search"></i>
</button>
<button type="button" tabindex="-1" class="search-close-button" aria-label="Cancel Search">
<i class="ri-close-line ri-lg" aria-hidden="true" title="Cancel search"></i>
</button>
<label class="search-label">
<p class="sr-only">Search</p>
<input name="q" type="text" class="search-input" placeholder="Search..." aria-label="Input your search terms" autocomplete="off" autocorrect="off" autocapitalize="off" spellcheck="false" />
</label>
</form>
<div class="autocomplete">
<div class="autocomplete-results">
</div>
</div>
<div class="sidebar-header">
<a href="readme.html">
<img src="assets/logo.png" alt="Plausible" class="sidebar-projectImage">
</a>
<div class="sidebar-projectDetails">
<a href="readme.html" class="sidebar-projectName" translate="no">
Plausible
</a>
<div class="sidebar-projectVersion" translate="no">
v0.0.1
</div>
</div>
<ul class="sidebar-listNav">
<li><a id="extras-list-link" href="#full-list">Pages</a></li>
<li><a id="modules-list-link" href="#full-list">Modules</a></li>
<li><a id="tasks-list-link" href="#full-list"><span translate="no">Mix</span> Tasks</a></li>
</ul>
</div>
<div class="gradient"></div>
<ul id="full-list"></ul>
</section>
<section class="content">
<output role="status" id="toast"></output>
<div class="content-outer">
<div id="content" class="content-inner">
<h1>
<button class="icon-action display-settings">
<i class="ri-settings-3-line"></i>
<span class="sr-only">Settings</span>
</button>
<a href="https://github.com/plausible/analytics/blob/main/lib/plausible/purge.ex#L1" title="View Source" class="icon-action" rel="help">
<i class="ri-code-s-slash-line" aria-hidden="true"></i>
<span class="sr-only">View Source</span>
</a>
<span translate="no">Plausible.Purge</span>
<small class="app-vsn" translate="no">(Plausible v0.0.1)</small>
</h1>
<section id="moduledoc">
<p>Deletes data from a site.</p><p>Stats are stored on Clickhouse, and unlike other databases data deletion is
done asynchronously.</p><p>All import tables have MergeTree's deduplication mechanism <em>disabled</em> by setting
<code class="inline">replicated_deduplication_window</code> from default 100 to 0. When enabled, every insert
into a given table is compared against hashes of 100 previous inserts (as complete
parts, not concrete rows) and ignored when match is found. The prupose of that
mechanism is making inserts of exact same batches idempotent when retrying them
shortly after - for instance due to timeout, when the client can't easily tell if
previous insert succeeded or not. Deduplication, however, only considers inserts,
not mutations. Deletions do not affect stored hashes, so further inserts of parts
that were deleted will still be treated as duplicates. That's why this feature
is disabled for import tables.</p><p>Although deletions are asynchronous, the parts to delete are &quot;remembered&quot;, so there's
no risk of overlapping deletion causing problems with import following right after it.</p><p>IMPORTANT: Deletion requires revision if/when import tables get moved to sharded CH
cluster setup. Mutation queries, which have to be run with <code class="inline">ON CLUSTER</code> in such setup,
dispatch independent queries across shards and those queries can start at different
times. This in turn means risk of deletions corrupting data of follow-up inserts
in some edge cases. Ideally, imported entries should be unique for a given import</p><ul><li><p>an extra <code class="inline">import_id</code> column can be introduced, holding identifier. Last processed
import identifier should be stored with other site data and should be used for scoping
imported stats queries. No longer used imports can then be safely removed fully
asynchronously.</p></li><li><p><a href="https://clickhouse.com/docs/en/sql-reference/statements/alter/delete">Clickhouse <code class="inline">ALTER TABLE ... DELETE</code> Statement</a></p></li><li><p><a href="https://clickhouse.com/docs/en/sql-reference/statements/alter/#synchronicity-of-alter-queries">Synchronicity of <code class="inline">ALTER</code> Queries</a></p></li></ul>
</section>
<section id="summary" class="details-list">
<h1 class="section-heading">
<a class="hover-link" href="#summary">
<i class="ri-link-m" aria-hidden="true"></i>
Summary
</a>
</h1>
<div class="summary-functions summary">
<h2>
<a href="#functions">Functions</a>
</h2>
<div class="summary-row">
<div class="summary-signature">
<a href="#delete_imported_stats!/1" translate="no">delete_imported_stats!(site)</a>
</div>
<div class="summary-synopsis"><p>Deletes imported stats from Google Analytics, and clears the
<code class="inline">stats_start_date</code> field.</p></div>
</div>
<div class="summary-row">
<div class="summary-signature">
<a href="#delete_native_stats!/1" translate="no">delete_native_stats!(site)</a>
</div>
<div class="summary-synopsis"><p>Move stats pointers so that no historical stats are available.</p></div>
</div>
<div class="summary-row">
<div class="summary-signature">
<a href="#reset!/1" translate="no">reset!(site)</a>
</div>
</div>
</div>
</section>
<section id="functions" class="details-list">
<h1 class="section-heading">
<a class="hover-link" href="#functions">
<i class="ri-link-m" aria-hidden="true"></i>
Functions
</a>
</h1>
<div class="functions-list">
<section class="detail" id="delete_imported_stats!/1">
<div class="detail-header">
<a href="#delete_imported_stats!/1" class="detail-link" title="Link to this function">
<i class="ri-link-m" aria-hidden="true"></i>
<span class="sr-only">Link to this function</span>
</a>
<h1 class="signature" translate="no">delete_imported_stats!(site)</h1>
<a href="https://github.com/plausible/analytics/blob/main/lib/plausible/purge.ex#L41" class="icon-action" rel="help" title="View Source">
<i class="ri-code-s-slash-line" aria-hidden="true"></i>
<span class="sr-only">View Source</span>
</a>
</div>
<section class="docstring">
<div class="specs">
<pre translate="no"><span class="attribute">@spec</span> delete_imported_stats!(<a href="Plausible.Site.html#t:t/0">Plausible.Site.t</a>()) :: :ok</pre>
</div>
<p>Deletes imported stats from Google Analytics, and clears the
<code class="inline">stats_start_date</code> field.</p>
</section>
</section>
<section class="detail" id="delete_native_stats!/1">
<div class="detail-header">
<a href="#delete_native_stats!/1" class="detail-link" title="Link to this function">
<i class="ri-link-m" aria-hidden="true"></i>
<span class="sr-only">Link to this function</span>
</a>
<h1 class="signature" translate="no">delete_native_stats!(site)</h1>
<a href="https://github.com/plausible/analytics/blob/main/lib/plausible/purge.ex#L56" class="icon-action" rel="help" title="View Source">
<i class="ri-code-s-slash-line" aria-hidden="true"></i>
<span class="sr-only">View Source</span>
</a>
</div>
<section class="docstring">
<div class="specs">
<pre translate="no"><span class="attribute">@spec</span> delete_native_stats!(<a href="Plausible.Site.html#t:t/0">Plausible.Site.t</a>()) :: :ok</pre>
</div>
<p>Move stats pointers so that no historical stats are available.</p>
</section>
</section>
<section class="detail" id="reset!/1">
<div class="detail-header">
<a href="#reset!/1" class="detail-link" title="Link to this function">
<i class="ri-link-m" aria-hidden="true"></i>
<span class="sr-only">Link to this function</span>
</a>
<h1 class="signature" translate="no">reset!(site)</h1>
<a href="https://github.com/plausible/analytics/blob/main/lib/plausible/purge.ex#L62" class="icon-action" rel="help" title="View Source">
<i class="ri-code-s-slash-line" aria-hidden="true"></i>
<span class="sr-only">View Source</span>
</a>
</div>
<section class="docstring">
</section>
</section>
</div>
</section>
<footer class="footer">
<p>
<span class="line">
<button class="a-main footer-button display-quick-switch" title="Search HexDocs packages">
Search HexDocs
</button>
<a href="Plausible.epub" title="ePub version">
Download ePub version
</a>
</span>
</p>
<p class="built-using">
Built using
<a href="https://github.com/elixir-lang/ex_doc" title="ExDoc" target="_blank" rel="help noopener" translate="no">ExDoc</a> (v0.30.3) for the
<a href="https://elixir-lang.org" title="Elixir" target="_blank" translate="no">Elixir programming language</a>
</p>
</footer>
</div>
</div>
</section>
</div>
<script src="https://cdn.jsdelivr.net/npm/mermaid/dist/mermaid.min.js"></script>
<script>mermaid.initialize({startOnLoad: true})</script>
</body>
</html>