Frequently Asked Questions

Everything you need to know about testing your content with Liftstack, explained in plain language.

Getting Started

What is Liftstack?

Liftstack is a multi-channel A/B testing platform for CRM marketers. It lets you test different versions of content against each other across email, push notifications, SMS, in-app messages, and in-app surfaces (content cards), then uses statistical analysis to tell you which version actually performs best and by how much. It works with Klaviyo, Customer.io, Iterable, and Braze.

What can I test?

You can test any piece of content that you'd swap between recipients across any supported channel. In Liftstack, these are called "snippets." Each snippet belongs to a specific channel (email, push, SMS, in-app message, or in-app surface). The most powerful feature is the ability to test custom HTML code blocks in email and in-app channels, which lets you experiment with virtually any element:

Custom HTML blocks. This is where Liftstack really shines. You can test entire sections of email markup: different layouts, content structures, visual treatments, or any HTML that your ESP supports. Examples include:
- Product recommendation grids (2-column vs 3-column, image sizes, product ordering)
- Social proof sections (star ratings vs review quotes vs "X people bought this")
- Header/navigation bar layouts (minimal vs full category links)
- Footer designs (stacked vs inline, with or without social icons)
- Countdown timer blocks vs static urgency text
- Loyalty points callouts and reward tier displays
- "Why buy from us" trust badge sections
- Dynamic content cards (editorial style vs product-focused vs testimonial)
- Shipping and returns policy callout blocks
- Cross-sell and upsell module formats
Subject lines. "Don't miss out!" vs "Your exclusive offer inside"
Hero blocks. Different images or headline/subheadline combinations
CTAs. "Shop Now" vs "Browse the Collection" vs "Claim Your Discount"
Copy blocks. Different tone, length, or messaging strategy
Discount framing. "20% off" vs "Save £10" vs no discount

Because Liftstack works at the HTML snippet level, you are not limited to testing simple text swaps. Any section of your email that you can express as an HTML block can become a testable snippet with multiple variants.

What is a "variant"?

A variant is one version of a snippet. If you're testing three different subject lines, each subject line is a variant. You need at least two variants to run a test.

Can I create a variant with blank content?

It depends on the snippet type:

Subject lines: content is always required. ESPs reject blank subject lines, and a blank subject line would corrupt your test results.
Copy and HTML blocks: content is always required. A blank variant would produce inflated uplift numbers for competing variants (since no one can click or convert on empty content) and poison Thompson Sampling posteriors for future campaigns.
Image snippets: text content (alt text) is optional, but you must provide either an uploaded image or an image URL.

These rules are enforced when you create or edit a variant. If you want to test "no content" vs "some content" for a slot, use a minimal placeholder (e.g., a single space or a neutral message) as your control variant instead.

What is a "control" variant?

The control is the version you'd send if you weren't testing. It represents your current standard or "safe" option. Marking a variant as the control lets Liftstack measure uplift: how much better the winning variant performed compared to what you would have done anyway.

You don't have to designate a control, but it's highly recommended. Without one, Liftstack can still find a winner, but the uplift numbers will be less precise.

What is a "slot"?

A slot is a position in your campaign where a snippet is being tested. If you're testing both a subject line and a hero image in the same campaign, that's two slots. Each slot is analysed independently, so you'll get separate results for each.

How does Liftstack assign variants to recipients?

Before your campaign sends, Liftstack randomly assigns each recipient a variant for each slot. These assignments are written to your CRM profiles as a property called lf_assignments. Your email template then uses conditional logic to show each person the content they were assigned.

This is important: the assignment happens before anyone sees anything. This is what makes it a proper experiment, because we know who was shown what before we see the results.

What is the onboarding checklist?

When you first open a new workspace, Liftstack displays a 5-step checklist on your dashboard to guide you through setup:

Connect a platform (link your ESP)
Create a snippet (set up content with at least two variants)
Launch a campaign (run the campaign wizard)
Sync an audience (pull recipients from your ESP)
Get your first verdict (wait for enough data to produce a result)

Each step auto-completes as you perform the action. You do not need to follow them in order. The checklist disappears once all steps are done, or you can dismiss it permanently at any time. Each workspace has its own checklist, so creating a new workspace starts the process again.

While onboarding is active, empty states on the campaigns, snippets, and integrations pages display contextual hints guiding you to the next step (for example, "Start by creating a snippet, then build your first campaign").

Can I try Liftstack with sample data?

Yes. Go to Settings > Workspace and click "Load demo data." Liftstack creates a set of sample snippets and campaigns in your workspace so you can explore the platform without connecting a real ESP or importing real data. Demo items are clearly badged with a "Demo" label on campaign and snippet list pages.

When you are done exploring, return to Settings > Workspace and click "Clear demo data" to remove all demo items. Only objects created by the demo loader are affected; your real snippets and campaigns are never touched. You can load and clear demo data as many times as you like.

Why Liftstack?

My ESP already has A/B testing built in. Why would I pay for this?

Native ESP testing and Liftstack solve different problems. Here's what each does:

What native ESP A/B testing does:

Splits your audience into two groups and sends each group a completely different email (or subject line)
Picks a winner based on opens or clicks over a short window (typically 1 to 4 hours)
Sends the winning version to the remaining audience

What Liftstack does differently:

Tests individual content blocks inside a single email, not whole emails against each other. You can test just the hero image, just the CTA, or just the product grid layout while keeping everything else identical. This isolates what actually drives the difference.
Runs multiple tests simultaneously in the same campaign. Test a subject line AND a hero block AND a CTA in one send, with independent results for each slot.
Uses Bayesian statistics that let you check results at any time without inflating error rates. No more guessing whether 4 hours was long enough.
Carries learning across campaigns. Smart Allocation uses historical performance to send more traffic to better-performing variants automatically.
Provides revenue attribution, not just click counting. Know which variant actually drives purchases, not just engagement.
Detects guardrail violations like unsubscribe spikes, bounce rate increases, and spam complaints that your ESP's A/B test won't flag.
Works across ESPs. If you use Klaviyo for lifecycle and Customer.io for transactional, your testing insights live in one place.

In short, native ESP testing tells you which of two emails got more opens in the first few hours. Liftstack tells you which specific content elements drive conversions and revenue over the full attribution window, protects your list health, and accumulates learning over time.

Can Liftstack do things my ESP cannot?

Yes. The core capability gap is in-template content testing. Native ESP tools treat the email as a single unit: you either send Email A or Email B. Liftstack injects conditional logic into your template so that different recipients see different content blocks within the same email. This is how you test a CTA without also changing the subject line, the layout, and the imagery at the same time.

Other things Liftstack does that native tools typically don't:

Multi-slot testing in a single send (subject line + hero + CTA, analysed independently)
Bayesian analysis with continuous monitoring (no fixed test duration needed)
Automatic bot filtering so inflated opens and security-scanner clicks don't corrupt your results
Revenue-per-exposure modelling that captures both conversion probability and order value
Cross-campaign learning via Thompson Sampling and content insights
Safety guardrails (unsubscribe, bounce, complaint) that block winners which damage list health or sender reputation

How Testing Works

How long does a test take?

It depends on your audience size and how different the variants are. As a rough guide:

Large audiences (50,000+) with meaningful content differences: often conclusive within a few days
Medium audiences (5,000 to 50,000): typically 3 to 7 days
Small audiences (under 5,000): may take multiple campaign sends

Liftstack will show you a progress estimate when your test is still collecting data.

Can I check results while the test is running?

Yes. The campaign report updates in real-time while your campaign is in tracking mode. You'll see live charts, preliminary numbers, and a confidence progression chart showing how close the test is to reaching a conclusion.

However, during the early data collection period, results will be labelled as preliminary. Liftstack enforces a minimum data threshold before declaring any verdict, which prevents premature conclusions from small, noisy samples.

What's the minimum audience size?

There's no hard minimum, but smaller audiences need larger differences between variants to reach a conclusion. As a planning guide:

Flow/automation campaigns (baseline 1-5%):

Baseline conversion rate	Min. difference to detect	Audience per variant
1%	0.5 percentage points	~6,300
2%	1.0 percentage point	~3,100
3%	1.0 percentage point	~4,700
5%	2.0 percentage points	~1,900

Broadcast/campaign sends (baseline 0.05-0.2%): conversion rates for one-off campaign sends are typically much lower than flows. At these rates, Liftstack automatically switches to relative effect sizes rather than fixed percentage point targets:

Baseline conversion rate	Relative lift to detect	Audience per variant
0.05%	100% (doubling)	~32,000
0.10%	100% (doubling)	~16,000
0.20%	50%	~12,800
0.50%	50%	~5,100

If your audience is too small to detect realistic differences, Liftstack will tell you the test needs more data rather than making a premature call.

When you set up a campaign, Liftstack automatically shows a sample size guidance card after your audience is synced. This tells you whether your audience is large enough for the number of variants you're testing, using your workspace's historical conversion rate (or a 3% default if you have no history). For low conversion rate campaigns (below 0.5%), the guidance automatically uses relative effect sizes and shows an additional warning with advice on alternative metrics or campaign types.

What is a "primary metric"?

The primary metric is the single measure you're optimising for. You choose it when setting up your campaign, and it cannot be changed once the campaign starts sending. This is deliberate: it prevents cherry-picking whichever metric happens to look best after the fact.

Your options are:

Conversion rate (default): what percentage of recipients took the desired action (purchase, sign-up, etc.)
Click rate: what percentage of recipients clicked a link in the email
Open rate: what percentage of recipients opened the email
Revenue per exposure: average revenue generated per recipient

All other metrics are still tracked and shown in your report as secondary/diagnostic metrics, but only the primary metric determines the winner.

Why can't I change the primary metric after sending?

This is a critical safeguard called pre-registration. If you could change the metric after seeing results, you might (even unconsciously) switch to whichever metric makes a particular variant look best. This would inflate your false positive rate, causing you to "find" winners that aren't real winners. Pre-registering the metric keeps the test honest.

What is the attribution window?

The attribution window is the time period after each recipient is assigned during which their engagement events (clicks, conversions, purchases) are credited to the test. The default is 7 days (168 hours).

Critically, the window is per-recipient, not per-campaign. Each person's 7-day clock starts from the moment they were assigned a variant. For a broadcast campaign where everyone is assigned at once, this is effectively the same as "7 days after send." For an automation or flow where new recipients enter over time, each person gets their own independent 7-day window starting from their entry date.

A click that happens 3 days after assignment counts. A purchase 10 days after assignment does not (by default). This prevents distant events, which are influenced by many other factors, from muddying your test results.

If a significant number of conversions are arriving after individual attribution windows close, Liftstack will flag this and suggest extending the window for future campaigns.

How does attribution work for automations and flows?

Automations and flows work the same way as broadcast campaigns, with one important difference: recipients enter the test over time rather than all at once.

Liftstack automatically detects whether a campaign is a broadcast (all messages sent within 48 hours) or an automation (messages sent over days or weeks). The per-recipient attribution window means you do not need to worry about when individual messages are sent. Each person's conversions are measured from their own assignment time, ensuring fair comparison regardless of entry date.

One practical consequence: automations take longer to produce results, because recipients are entering gradually rather than all at once. The campaign report updates as new data arrives, and verdicts may shift as the sample grows.

What is the Test Calculator?

The Test Calculator is a pre-test planning tool available under Analytics > Test Calculator. Select your channel, enter the number of variants, and provide your audience size. Liftstack will compute the minimum detectable effect, the required sample size per variant, and the estimated number of days to reach a conclusion. Use it before launching a campaign to avoid underpowered tests that end up stuck at "Insufficient Data."

The calculator is channel-aware: it adjusts sample size recommendations based on channel-specific baseline rates (email click rates differ from push tap rates, for example). If your workspace has completed campaigns on the selected channel, the calculator pre-fills the baseline conversion rate from your historical data, so the estimates reflect your actual audience behaviour rather than generic defaults.

How is the baseline conversion rate determined?

Liftstack computes the baseline conversion rate from your completed campaign history for the selected channel. If your workspace has no completed campaigns for that channel, a default rate is used. As you run more campaigns, the baseline becomes more accurate and the calculator's estimates improve.

What is attribution sensitivity?

Attribution sensitivity is an automatic check that compares your test results using two different attribution windows: the full window you configured (default 7 days) and a shorter comparison window. If the winner changes depending on which window is used, Liftstack flags this on the campaign report.

The short comparison window is context-aware:

Broadcast campaigns: 24 hours (captures immediate engagement)
Automations/flows: 72 hours / 3 days (captures the typical automation engagement cycle while leaving room for comparison against the full 7-day window)

If the winner is the same under both windows, you can be more confident the result is robust. If the winner differs, it suggests the result is sensitive to how long you wait for conversions, and you may want to investigate whether the full-window winner is benefiting from delayed, less-attributable events.

What are late conversions?

Late conversions are engagement events that arrive after a recipient's attribution window has closed. Liftstack automatically counts these and reports the percentage on each slot's results.

When more than 10% of conversions are "late" (arriving after the window), Liftstack shows a warning suggesting you extend the attribution window for future campaigns. A high late-conversion percentage means you may be missing real signal by cutting off attribution too early.

This is most common with long purchase cycles (e.g., high-value products where people browse for days before buying) or when using a short attribution window with an audience that converts slowly.

What happens if a recipient is in multiple active campaigns?

When a recipient is assigned to more than one active Liftstack campaign at the same time, conversion events are attributed to all active campaigns the recipient is assigned to. Each campaign gets full credit for the event.

This is the industry standard approach. It works because each campaign uses independent randomization: the treatment effect estimate within each campaign remains statistically valid regardless of what other campaigns are running concurrently.

Per-campaign reports are accurate. Each campaign's conversion rate, winner, and uplift reflect the true treatment effect.
Revenue may appear in multiple campaign reports. This is correct for per-campaign analysis.
Dashboard totals are deduplicated. The cumulative uplift figure on your home dashboard applies a deduplication factor to prevent inflation from shared recipients.

Integration & Setup

How does Liftstack connect to my ESP?

Liftstack connects via your ESP's API using credentials that you provide. The setup process is:

Go to Integrations in Liftstack and select your platform (Klaviyo, Customer.io, Iterable, or Braze)
Enter your credentials. What's required depends on the platform:
- Klaviyo: a private API key
- Customer.io: a Site ID, a Tracking API key, and an App API key
- Iterable: a standard API key
- Braze: a REST API key and your Braze instance (e.g. US-01, EU-01)
Liftstack validates the connection and confirms access

Your credentials are encrypted at rest using Fernet symmetric encryption. Liftstack never stores them in plain text, and they are only decrypted when making API calls on your behalf.

No developer is required. If you can find your API credentials in your ESP's settings, you can complete setup in under five minutes.

Braze-specific setup

Braze has a few differences from the other supported platforms:

Credentials. You need a REST API key (created in Settings > APIs and Identifiers) and your Braze instance. Your instance determines the API base URL (e.g. US-01 uses rest.iad-01.braze.com). Select your instance from the dropdown when connecting.

API key permissions. Your Braze REST API key needs the following permissions: segments.list, users.export.segment, users.track, templates.email.create, templates.email.update, email.hard_bounces, email.unsubscribes.

Audience sync. Braze's segment export is asynchronous. For large segments, configure Braze to export segment data to S3 and set up the S3 import in Liftstack. For smaller segments, the export may return a direct download URL.

Event tracking. Braze does not provide a direct HTTP webhook for email events (clicks, opens). To receive per-event data, you need to set up a Braze webhook campaign that POSTs to your Liftstack webhook URL when email events occur.

Template language. Braze uses Liquid with a custom syntax for accessing profile attributes. Liftstack generates conditionals using Braze's notation automatically.

What API permissions does Liftstack need?

Liftstack needs permission to:

Read segments/lists (to sync your audience)
Read and write profile properties (to write lf_assignments for variant targeting)
Create and update templates (to push the conditional template logic)
Read engagement events (clicks, opens, conversions) for attribution

For Klaviyo, this means a private API key with full read/write scope. For Customer.io, an App API key with tracking and API access. For Iterable, a standard API key. For Braze, a REST API key with segment, user export, user track, and template permissions. The exact permissions are documented in the integration setup flow.

Does writing assignments burn through my ESP's API limits?

Liftstack uses batch endpoints wherever available and includes built-in rate limiting that respects each platform's published limits. For a 500,000-person audience:

Klaviyo: uses bulk profile import endpoints; typically completes in 10 to 20 minutes
Customer.io: uses individual profile identify calls (Customer.io does not offer a bulk endpoint); typically completes in 15 to 30 minutes for large audiences
Iterable: uses bulk user update endpoints; typically completes in 10 to 20 minutes
Braze: uses /users/track with batches of 75 profiles per request; typically completes in 10 to 20 minutes

These API calls count toward your ESP's rate limits, but the built-in throttling means Liftstack won't spike your usage or trigger overage charges. If your ESP plan has very tight API limits, the writeback will simply take longer (it backs off automatically on 429 responses).

How long do I need to wait between assigning and sending?

The campaign wizard handles this in sequence: it syncs the audience, runs assignment, writes properties to profiles, and pushes the template. You'll see a progress indicator for each step. Once all steps show complete, you can send immediately. There is no additional waiting period.

For large audiences (100,000+), the profile writeback step is the longest part and can take 15 to 30 minutes. Plan accordingly, but you don't need to wait overnight or anything like that.

What happens if the API fails halfway through assigning?

Liftstack writes profile properties in batches with automatic retry. If a batch fails (network timeout, API error), the system retries with exponential backoff. If it hits a 429 (rate limit) response, it reads the Retry-After header and waits before continuing.

If some batches fail despite retries, the campaign still advances (the writeback is best-effort per batch). The progress indicator will report how many profiles succeeded and how many failed. You can re-trigger the writeback step from the campaign wizard, and since the operation is idempotent (writing the same property value twice is harmless), it will safely re-process all profiles from the beginning. It does not resume from a checkpoint.

Does Liftstack slow down my campaign sending?

No. Liftstack's work happens before you send. The variant assignments are written to CRM profiles as a property, and the conditional template is pushed to your ESP. When you actually hit send in your ESP, the email renders using the pre-written profile property. There is zero additional latency at send time.

Can I connect multiple ESPs to the same workspace?

Yes. Each plan tier allows a set number of platform connections (Starter: 1, Growth: 1, Scale: 3). You might connect Klaviyo for your lifecycle campaigns and Customer.io for transactional, and run tests on both from the same workspace with shared snippet libraries.

Understanding Your Results

What does "X% probability of being best" mean?

This is the single most important number in your report. It answers: "What is the probability that this variant truly has the highest conversion rate?"

For example, "93% probability of being best" means: given all the data we've collected, there's a 93% chance this variant genuinely outperforms all the others. There's a 7% chance one of the other variants is actually better and this one just got lucky in this particular test.

Where is the p-value?

Liftstack uses Bayesian statistics instead of the traditional frequentist approach you might be familiar with from other tools. This means you won't see p-values, and that's a good thing. Here's why:

P-values answer a confusing question: "If there were NO real difference between variants, what's the probability of seeing data this extreme?" That's hard to interpret and easy to misuse.

Probability of being best answers a direct question: "Given the data I have, what's the probability this variant is actually the best?" That's what you really want to know.

Think of it this way:

A p-value of 0.03 does NOT mean "there's a 97% chance variant A is better." (This is the most common misinterpretation of p-values.)
A "probability of being best" of 97% DOES mean "there's a 97% chance variant A is better." It's exactly what it says.

What about confidence intervals? I'm used to seeing those.

Liftstack shows credible intervals (displayed as "range" in the report), which look similar to confidence intervals but are easier to interpret:

A traditional 95% confidence interval means: "If we repeated this experiment many times, 95% of the resulting intervals would contain the true value." (Confusing, right?)
A 95% credible interval means: "There's a 95% probability the true value falls within this range." (Much more intuitive.)

You'll see these ranges throughout the report: for conversion rates, uplift estimates, and revenue figures. A narrow range means we're quite certain; a wide range means there's still meaningful uncertainty.

What does "expected loss" mean?

Expected loss answers: "If I pick this variant and it turns out not to be the best, how much conversion rate am I leaving on the table?"

For example, an expected loss of 0.05% means: if you go with this variant and it's not actually the winner, you'd lose about 0.05 percentage points of conversion rate on average. That's tiny, well within the "not worth worrying about" range.

Liftstack uses expected loss as part of its decision criteria. A variant isn't declared a winner just because it's probably best. It also needs to have a very low expected loss, ensuring that even in the unlikely scenario it's wrong, the cost is negligible.

What does "practical equivalence" mean?

Sometimes variants are so close in performance that the difference doesn't matter in practice. If variant A converts at 3.02% and variant B converts at 3.05%, that 0.03 percentage point difference is real but meaningless for your business.

Liftstack checks whether variants fall within a Region of Practical Equivalence (ROPE): a range around zero (default: 0.5 percentage points) where differences are too small to care about. For campaigns with low conversion rates, the ROPE width is automatically narrowed so it remains meaningful relative to the baseline. If all variants fall within this range with high probability, the verdict is EQUIVALENT, and you're told to pick whichever version you prefer. There's no statistical reason to favour one over another.

Reading the Campaign Report

What is the verdict card?

The verdict card is the hero element at the top of each slot's results. It gives you the bottom line in plain language. There are four possible verdicts:

Winner (green, trophy icon). A clear winner has been identified. The card shows which variant won, the conversion rates compared, the uplift (additional conversions and revenue), confidence level and probability of being best, and revenue range (best case to worst case).
Equivalent (grey, equals icon). All variants performed within a negligible range of each other. Pick whichever fits your brand best. There's no performance-based reason to choose one over another.
Insufficient Data (amber, hourglass icon). No conclusion yet. One variant is leading but not decisively. Shows which variant is currently leading, how likely it is that the leader is actually the best, and how many more exposures are estimated before a conclusion can be reached.
Guardrail Violation (red, warning icon). A variant triggered a safety guardrail, typically because it caused a meaningful increase in unsubscribe rates compared to the control. Even if it has a high probability of being best on the primary metric, it won't be declared a winner because it's damaging your audience.

What are the confidence levels?

Probability of Being Best	Confidence Level	What It Means
95% or higher	Very High	Extremely likely this is the true best variant. Declare a winner.
85% to 95%	High	Very probably the best, but a small chance you're wrong. Consider collecting more data if the stakes are high.
70% to 85%	Moderate	Leading, but there's meaningful uncertainty. Likely needs more data.
Below 70%	Low	Too early to tell. Keep testing.

What is the uplift callout?

The uplift callout is the key value statement of your test. It answers: "How much more did I get by using the winning variant instead of the control?"

It shows two numbers:

Additional conversions: how many extra people converted because of the winning content
Additional revenue: the estimated revenue those extra conversions generated

These numbers come with a range (e.g., "+£8,200 to +£16,800") so you know the realistic best and worst case. The number also includes the probability that this is a real improvement (not just noise).

What is the metrics table?

Below each slot's charts, there's an expandable metrics table showing the raw numbers for every variant. This includes exposures, opens, open rate, clicks, CTR, conversions, conversion rate, unsubscribes, bounces, complaints, revenue, and revenue per exposure.

This table is collapsed by default because the verdict card, charts, and uplift callout already tell you everything you need to make a decision.

Understanding the Charts

What is the Variant Comparison Chart (Raincloud Plot)?

A visual comparison of all variants' estimated true conversion rates, shown in the campaign report below the verdict card. Each variant gets a horizontal row with three visual layers:

The cloud (top half). A smooth density curve showing the range of likely conversion rates. Where the curve is tall, that rate is more likely. A tight, narrow cloud means more certainty.
The line and dot (middle). A horizontal line showing the 95% credible interval, with a dot at the estimated conversion rate.
The rain (bottom half). A scatter of small dots representing possible conversion rates drawn from the statistical model.

If the leading variant's cloud is clearly separated from the others (no overlap), it's a strong winner. If clouds overlap substantially, you may need more data.

What is the Chance of Winning chart?

A horizontal bar chart showing each variant's probability of being the best performer. A vertical dashed line marks the decision threshold (default: 90%). A variant needs to cross this line to be declared a winner.

The percentages always add up to 100% across all variants. If one bar dominates and crosses the threshold, you have a clear winner. If bars are close, more data is needed.

What is the Expected Improvement chart?

A density plot of the difference between the winning variant and the control, shown only when a winner has been declared. The area to the right of zero (shaded green) represents scenarios where the winner truly is better. The area to the left (shaded amber) represents scenarios where it's actually worse (unlikely, but possible).

The annotation below the chart (e.g., "92.4% chance of real improvement") tells you exactly how much of the curve is on the positive side.

What is the Confidence Progression chart?

A line chart tracking how the leading variant's probability of being best has evolved over time since the campaign was sent. A horizontal dashed line marks the decision threshold (default: 90%).

Watch for the leading variant's line climbing toward the threshold. A line that's climbing steadily suggests the test is heading toward a conclusion. A line that's flat or bouncing suggests the variants are very close. During live tracking, this chart auto-refreshes every 60 seconds.

What is the Cumulative Revenue Uplift chart?

Shown on the analytics dashboard, this is a running total of the additional revenue generated by all your winning variants across all campaigns over time. A shaded band around the line shows the confidence range.

This line should only go up (each new winner adds to the total). This is the single best chart for demonstrating ROI from your testing programme.

What is the Conversion Rate Sparkline?

Found on the snippet performance page, this small line chart shows how a specific variant's conversion rate has changed across every campaign it's appeared in. A flat line means consistent performance. An upward trend might indicate a primacy effect. A downward trend might indicate a novelty effect.

Verdicts & Decisions

How does Liftstack decide on a winner?

A variant is declared the winner when both of these conditions are met:

Probability of being best is at least 90% (configurable). We're highly confident this variant truly has the highest conversion rate.
Expected loss is at most 0.1% (configurable, automatically scaled down for low conversion rate campaigns). Even if we're wrong, the cost of choosing this variant over the true best is negligible.

Both conditions must hold simultaneously. A variant with 92% probability of being best but an expected loss of 0.3% won't be declared a winner yet because the potential downside is still too large.

How does Liftstack decide variants are equivalent?

Variants are declared equivalent when Liftstack is highly confident (90%+ probability) that the difference between all variants falls within the ROPE width (default: 0.5 percentage points, configurable, automatically narrowed for low conversion rate campaigns). At that point, the differences are real but too small to matter for your business.

When there are many variants (4+), Liftstack can detect partial equivalence. For example: "Variant A is the clear winner. Among the remaining variants, B, C, and D are practically equivalent to each other." This helps you understand the full picture, showing not just who won but which of the remaining variants are interchangeable.

What is a guardrail violation?

Guardrail metrics are safety checks that protect your audience. Even if a variant has a great conversion rate, it won't be declared a winner if it's damaging other important metrics. The specific guardrails depend on the channel:

Email guardrails:

Unsubscribe rate. If the variant causes unsubscribes to increase by more than 0.1 percentage points vs the control.
Spam complaint rate. If complaints increase by more than 0.05 percentage points vs the control.
Bounce rate. If bounces increase by more than 0.5 percentage points vs the control.

Push and SMS guardrails:

Opt-out rate. If the variant causes opt-outs to increase beyond the threshold vs the control.

In-app guardrails:

Dismiss rate. If the variant causes dismissals to increase beyond the threshold vs the control.

When multiple guardrails are checked simultaneously (e.g., all three email guardrails), Liftstack applies a Bonferroni correction to control the overall false alarm rate. A variant that drives clicks but damages your audience is destroying long-term value. The guardrail catches this and warns you.

What does "insufficient data" mean?

This means no conclusion can be reached yet. One variant is probably leading, but there isn't enough data to be confident. Common reasons:

The audience is small
The variants perform very similarly (requiring more data to distinguish them)
The campaign is still early in its tracking period
Not enough conversions have been recorded yet (each variant needs at least 3 conversions before a verdict can be computed)

The report will show an estimate of how many more recipients need to be exposed before a conclusion can be reached.

Can I override the verdict?

The verdict is the system's statistical recommendation. You're free to take a different action, such as continuing to test a variant even after it's been declared equivalent, or choosing a variant other than the winner based on brand considerations.

What you can't do is change the primary metric after seeing results, or retroactively adjust the analysis to favour a particular outcome. These safeguards keep the testing process honest.

Metrics & What They Mean

What are the primary metrics?

Metric	What It Measures	Best For
Conversion rate	Percentage of recipients who completed the desired action	Most campaigns (the default)
Click rate	Percentage of recipients who clicked any link	Quick-signal tests, smaller audiences
Open rate	Percentage of recipients who opened the email	Subject line and preview text testing
Revenue per exposure	Average revenue generated per recipient	When variants might influence order size

What are secondary/diagnostic metrics?

All metrics not selected as primary become diagnostics. They're shown in the metrics table for context. For example, you might optimise for conversion rate but still want to see the click rate and revenue per variant. Diagnostic metrics are never used to determine the winner.

Why is open rate marked with a warning?

Open tracking is unreliable because of Apple Mail Privacy Protection (MPP) and email client pre-fetching. These technologies automatically trigger "opens" for every email, whether or not the recipient actually looked at it.

The good news: this noise affects all variants equally (since recipients are randomly assigned), so relative comparisons remain valid. If Variant A has a higher open rate than Variant B, that ranking is trustworthy. The bad news: absolute open rates are inflated, and the true difference between variants appears smaller than it really is. This means tests using open rate as the primary metric need more data to reach a conclusion.

What is "revenue per exposure"?

Revenue per exposure (RPE) measures the average revenue each recipient generates. It captures two effects:

Conversion probability. Does this variant make people more likely to buy?
Order value. When people do buy, do they spend more?

A variant could win on RPE even if it doesn't have the highest conversion rate, because it might encourage larger orders. Liftstack uses a specialised compound model for RPE that analyses these two components separately and then combines them.

What are the safety guardrails?

Even if a variant drives conversions, it might be doing so in a way that damages your audience health. Liftstack monitors guardrail metrics automatically, with channel-specific checks:

Email: unsubscribe rate (threshold: 0.1pp), spam complaint rate (0.05pp), bounce rate (0.5pp)

Push/SMS: opt-out rate

In-app: dismiss rate

Each guardrail checks whether the winning variant's rate exceeds the control's rate by more than the threshold. When multiple guardrails are checked for the same channel (e.g., all three email guardrails), a Bonferroni correction raises the per-test probability threshold so that the combined false alarm rate stays at 10%.

When any guardrail fires, Liftstack shows a red warning and prevents the variant from being declared a winner. This protects you from inadvertently adopting content that's eroding your subscriber base, sender reputation, or app engagement.

Smart Allocation (Thompson Sampling)

What is "Smart Allocation"?

When you've tested the same snippet variants across multiple campaigns, Liftstack can use historical performance data to send more traffic to the variants that have been performing well, while still sending some traffic to underperforming variants to make sure we aren't missing something. This is called Thompson Sampling.

How is it different from an equal split?

With a standard A/B test (equal split), each variant gets the same number of recipients, say 33% each for three variants. This is fair but wasteful: you're sending just as much traffic to a clearly underperforming variant as to the front-runner.

With Smart Allocation, Liftstack might split traffic 60/25/15 based on past performance. The likely winner gets more traffic (fewer wasted exposures), while alternatives still get enough to confirm whether they've improved or the leader has slipped.

Does this bias the test?

No. The system still tracks performance for every variant and runs the full statistical analysis. The unequal allocation actually makes the test more efficient. You reach conclusions faster because more recipients are exposed to the likely best variant, so uplift is captured sooner.

Can I override the smart allocation?

Yes. When Liftstack recommends an allocation, you'll see a transparency panel showing the recommended traffic split and why. You have three options: Accept, Adjust Manually (drag sliders), or Use Equal Split.

What is the "Smart Allocation Uplift"?

When a campaign uses Thompson Sampling, the report shows the additional conversions captured by the smart allocation compared to what an equal split would have produced. This isolates the value of the allocation strategy from the value of testing itself.

How does the system handle a brand-new variant with no history?

New variants (those that have never appeared in a completed campaign) receive a guaranteed minimum of 20% of traffic on their first campaign, regardless of what Thompson Sampling would recommend. This prevents established variants from starving newcomers of exposure.

Does historical data expire?

Yes. Liftstack applies a recency decay to historical data: performance from campaigns 60 days ago counts half as much as recent campaigns, and very old data fades away almost entirely. This ensures the allocation reflects current audience preferences, not stale data.

Operational Workflow

Can I fix a typo in a variant after the test starts?

It depends on how far the campaign has progressed:

Before sending (DRAFT through PROPERTIES_WRITTEN): Yes. You can edit variant content in the snippet editor at any time before you confirm the send. If properties have already been written, you can redo them to regenerate the template code.
After sending (SENT, TRACKING, COMPLETED): No. Once the campaign is sent, the content that recipients saw is fixed. Editing the variant in Liftstack would update it for future campaigns, but it won't change what was already delivered. This is by design: retroactively changing variant content would make the test results meaningless.

If you spot a serious error after sending (like a broken link), the right approach is to fix it in your ESP's template directly. The Liftstack test results for that variant will be affected (broken links mean lower clicks), and the report will reflect that.

Can I add a variant to a test that is already running?

No. Adding a variant mid-test would mean that variant has a different exposure period and audience size, which makes statistical comparison invalid. If you want to test an additional variant, create a new campaign with all the variants you want to compare (including the new one).

This is a deliberate constraint. Mixed-exposure tests produce unreliable results, and Liftstack prioritises correct conclusions over flexibility.

Can I stop or pause a single variant without killing the whole campaign?

Not currently. The campaign operates as a single unit: it's either tracking or completed. If a variant has a serious problem (offensive content, broken rendering), your best option is to fix the issue in the ESP template directly so recipients no longer see the problematic content. The statistical results for that variant will be affected, but the test continues for the remaining variants.

Can I duplicate a campaign setup?

Yes. Liftstack supports Campaign Templates, which let you save a campaign's configuration (channel, slots, snippet selections, allocation strategy, and settings) as a reusable template. From any campaign's detail page, click "Save as Template." When creating a new campaign, you can select a saved template from the template list page to pre-fill the setup, then adjust anything you need. If you're using Smart Allocation, historical performance from previous campaigns also carries over automatically.

What happens if I delete a snippet that's active in a campaign?

You can't. Snippets that are referenced by campaign slots are protected at the database level. If you attempt to delete one, the operation will fail. You would need to remove the snippet from all campaign slots first. This prevents accidentally orphaning a running test.

Can I re-run the same test on a different audience?

Yes. Create a new campaign, select the same snippets and variants, and point it at a different segment. Liftstack treats each campaign as an independent experiment with fresh assignments. If Smart Allocation is enabled, the new campaign will benefit from the performance data gathered in the original test.

What is cohort tracking?

After a campaign completes, you can activate Monitoring mode (30, 60, or 90 days). While monitoring is active, Liftstack continues tracking conversions and revenue for the campaign's assigned recipients to measure long-term variant impact. Results are reported at 7, 14, 30, 60, and 90 day intervals post-assignment, giving you a clear picture of whether the winning variant's advantage holds, grows, or fades over time.

This is most valuable for high-AOV brands where repeat purchase behaviour matters, subscription businesses, and loyalty campaigns where long-term engagement is the real goal.

How do I enable monitoring?

From the campaign report of a completed campaign, click "Enable Monitoring" and select a duration (30, 60, or 90 days). The campaign status changes to Monitoring. Liftstack will continue collecting conversion and revenue data for the duration you selected. When the monitoring window expires, the campaign automatically returns to Completed status with the extended results available on the report.

What is the campaign activity feed?

Each campaign has a collapsible activity feed on its detail page showing a chronological log of everything that has happened: status transitions, assignment runs, property writes, verdict changes, approval actions, comments, and deployments. The feed loads on demand via HTMX so it does not slow down the page. It pulls from the same workspace activity log but is filtered to events related to that specific campaign.

Where can I see the content calendar?

Go to Campaigns > Calendar (or click the calendar icon in the campaign list header). The content calendar is an interactive timeline view powered by a D3.js Gantt chart. Each campaign appears as a horizontal bar spanning its active dates, colour-coded by channel. You can filter by channel, status, and date range. When two campaigns have overlapping audiences and overlapping dates, the calendar highlights them with a warning indicator so you can spot potential conflicts before they affect your results.

Campaign Collaboration & Governance

What are campaign templates?

Campaign templates let you save a campaign's configuration as a reusable starting point. A template captures the channel, slot setup (which snippets are assigned), allocation strategy, and key settings. You can save a template from any campaign's detail page and browse all saved templates on the Campaign Templates list page. When creating a new campaign, selecting a template pre-fills the wizard so you do not have to reconfigure slots and settings from scratch each time.

What are approval workflows?

Approval workflows add a review gate before a campaign can be sent. When enabled for a workspace (under Settings > Workspace), campaigns require explicit approval from an Owner or Admin before the Send step becomes available. The flow is: the campaign creator clicks "Request Approval," which notifies all workspace admins. An admin reviews the campaign setup and either approves or rejects it, optionally leaving a note explaining the decision. The approval status is shown on the campaign detail page. This is useful for teams where a manager or compliance lead needs to sign off on test content before it reaches the audience.

How do I enable approval workflows?

Go to Settings > Workspace and toggle on "Require campaign approval." Once enabled, all new campaigns in that workspace will require approval before sending. Existing campaigns already past the approval stage are unaffected.

Can I discuss a campaign with my team inside Liftstack?

Yes. Each campaign has a Comments section on its detail page where team members can leave threaded messages. Comments are loaded on demand and support replies. The campaign creator and workspace admins can delete comments. Use comments to discuss variant choices, review results, or coordinate next steps without switching to email or Slack.

Notifications & Automations

How do I know when a campaign reaches a verdict?

Liftstack sends in-app notifications for important campaign events. A bell icon in the top-right of the app header shows your unread count. Click it to see recent notifications, or click "View all" for the full paginated list. Each notification links directly to the relevant campaign report.

What events generate notifications?

By default, notifications are generated by automation rules. Liftstack seeds two default rules on every workspace (both disabled by default): "Notify on winner" and "Alert on guardrail violation." You can enable these or create your own rules under Settings > Automations.

Notification types include: winner declared, variants equivalent, guardrail violation, SRM detected, and monitoring period ended.

What are automation rules?

Automation rules define actions that fire automatically when campaign events occur. Each rule has a trigger (the event), optional filters (channel, campaign name keyword), and an action (in-app notification, email, Slack webhook, or auto-complete campaign).

How do I set up a Slack notification?

Create an automation rule under Settings > Automations, select a trigger (e.g., "Winner declared"), choose "Send Slack webhook" as the action, and paste your Slack incoming webhook URL. When the trigger fires and any filters match, Liftstack sends a message to your Slack channel with campaign name, verdict, and a link to the report.

How do I set up email notifications?

Create a rule with "Send email notification" as the action. Optionally add recipient email addresses in the rule configuration. The campaign creator is automatically included as a recipient.

Can I filter which campaigns trigger a rule?

Yes. Each rule has two optional filters: channel (only fire for campaigns on a specific channel) and campaign name (only fire when the campaign name contains a keyword). Leave both blank to match all campaigns.

Can a rule auto-complete a campaign?

Yes. Choose "Mark campaign as completed" as the action. When the trigger fires (e.g., winner declared), Liftstack transitions the campaign from Tracking or Monitoring to Completed automatically.

How do I know if a rule failed?

Each rule's detail page (click the rule name in Settings > Automations) shows an execution log with the last 50 runs. Failed executions show the error message, which helps debug webhook URL issues or email delivery problems.

Segmentation & Audience

Can I preview the audience size before syncing?

Yes. During the campaign wizard's Audience step, Liftstack shows an audience size estimate before you commit to the full sync. This calls your ESP's API to fetch the segment or list member count without downloading the full profile list. The estimate tells you approximately how many recipients will be included, so you can check whether the audience is large enough for the number of variants you plan to test. If the estimated size is too small, you can switch to a larger segment before spending time on a full sync.

Does Liftstack work with my existing ESP segments?

Yes. When you set up a campaign, you select a segment (or list) from your ESP. Liftstack syncs the audience from that segment via the API. Whatever targeting, filtering, or segmentation logic you've built in your ESP applies as normal. Liftstack doesn't bypass or override your segmentation; it tests content within the audience you've already defined.

Can I see results broken down by segment?

The standard campaign report shows results for the full audience. For deeper breakdowns, see Segment Analysis below.

You can also achieve segment-level insights in two additional ways:

Run separate campaigns per segment. Send the same snippets to your VIP segment and your non-VIP segment as separate campaigns. Each gets its own independent analysis, and you can compare winners across the two.
Stratified Thompson Sampling (Scale plan). When using the stratified assignment strategy, Liftstack maintains separate performance estimates per segment. While the report still shows aggregate results, the allocation engine uses per-segment data, which means variants that work better for specific segments get more traffic within those segments.

What is Segment Analysis?

Available on campaign reports for Growth plan and above, Segment Analysis breaks down variant performance by audience profile properties such as city, country, or region. Liftstack automatically detects properties that have between 2 and 10 distinct values and at least 50 profiles per group, then shows per-segment conversion rates for each variant.

Do segments get their own verdicts?

Yes. When Segment Analysis has enough data within a segment group (at least 50 profiles per variant per segment value), Liftstack computes a per-segment verdict using the same Bayesian framework as the overall campaign verdict. This means you can see not just "Variant A won overall" but also "Variant A won in the UK, but Variant B won in Germany." Segment verdicts are shown alongside the segment breakdown on the campaign report. Note that per-segment sample sizes are smaller than the overall campaign, so segment verdicts require more data to reach conclusions and should be treated as directional rather than definitive.

Is segment analysis causal?

No. Segment analysis is observational. Differences between segments may reflect audience composition rather than variant effectiveness. For example, if Variant A appears to outperform in London, that could be because London recipients differ from other recipients in ways unrelated to the variant content. Use segment analysis to generate hypotheses for future targeted tests, not to draw firm conclusions.

Is there a way to have a global holdout group?

Yes. When creating a campaign, you can set a holdout percentage (up to 20% of the audience). Holdout recipients are randomly selected and do not receive any HTML snippet content for that campaign. The template conditional for those slots falls through and renders nothing, so the email arrives without the tested HTML blocks.

This is an advanced feature for HTML content snippets only. Subject lines and copy slots are unaffected by holdout (you cannot send a blank subject line or empty button text). The holdout group answers: "Does having this HTML content in the email at all improve outcomes vs not having it?"

Key details:

The holdout percentage is set during campaign creation and cannot be changed after assignments are made.
Your campaign must have at least one HTML content type slot for holdout to take effect. If all slots are subject lines or copy, the holdout setting is silently ignored.
No control variant is required to use holdout.

How it appears in the report: After the campaign completes, the report includes a holdout comparison card for each HTML slot. This shows the holdout group's conversion rate (no snippet content) alongside the optimised group's rate, with the percentage improvement. This tells you the total value of having that HTML content in the email.

What are segment suggestions?

After a campaign completes, the campaign report may show a "Segment Suggestions" section with two types of recommendations:

Similar segments: other segments in your workspace that have audience overlap or demographic similarity to the current campaign's segment. These are segments where the winning variant is likely to perform well based on cross-campaign pattern matching.
Untested segments: segments you have used in other campaigns but have never tested with the current snippet or content type. These represent gaps in your testing coverage.

Segment suggestions help you decide where to run your next test. They are generated automatically from your campaign history and audience data.

How does the segment picker work?

During the campaign wizard's Audience step, Liftstack fetches the available segments (or lists) from your connected ESP and displays them as a selectable list. The segment list is cached for performance, so it loads quickly on subsequent visits. If your ESP segments have changed recently, click the refresh button to pull the latest list. Select the segment you want to target, and Liftstack will sync that audience when you proceed to the next step.

Can I run a test targeting only mobile users or only desktop users?

Not directly within Liftstack. However, you can achieve this by creating a segment in your ESP that filters by device type (most ESPs support this), and then running your Liftstack campaign against that segment. The test results will then reflect only that device audience.

Can I see if Variant A won for one demographic but Variant B won for another?

Not as a built-in report split. Liftstack analyses each campaign as a single audience. If you suspect a variant performs differently across demographics, the recommended approach is to run separate campaigns against demographic-specific segments. This gives you statistically rigorous per-segment results, rather than post-hoc slicing which is prone to false positives.

The Content Insights feature (Growth and Scale plans) does detect patterns across campaigns, which can surface observations like "urgency messaging tends to outperform for your promotional segments." These are observational hints, not segment-level A/B test results, but they can guide your testing strategy.

Dashboard & Insights

What do the dashboard stat cards show?

The four cards at the top of the dashboard give you a monthly snapshot:

Card	What It Shows
Campaigns This Month	How many campaigns you've sent with Liftstack
Snippets Tested	How many unique content variants were tested
Clear Winners	Percentage of tested slots where a clear winner was found
Est. Revenue Uplift	Total estimated additional revenue from choosing winning variants

What are Content Insights?

Content Insights are patterns the system detects across your historical campaigns. For example: "In subject lines, variants with urgency tone tend to outperform your average by approximately 1.2%." These are surfaced with confidence levels:

High confidence. Pattern supported by substantial data (10,000+ exposures across many campaigns).
Moderate confidence. Suggestive pattern worth investigating, but based on less data.

Insights include channel and placement context when you filter by channel or placement type. This means you will see findings like "In CTAs, variants with short length outperform" rather than generic statements, so you know exactly where a pattern applies.

Important: Insights are observational, not causal. A pattern like "urgency outperforms" is a correlation. It could be influenced by the specific copy, audience, timing, or other factors that happened to accompany that tone. The insight is a hypothesis to test deliberately, not a guaranteed rule.

Every insight includes hedging language to remind you of this, and a disclaimer at the bottom reads: "These insights are based on historical patterns and may be influenced by factors beyond the content attribute itself. Use them as hypotheses to test, not as rules to follow."

Why don't I see any insights?

Insights require a meaningful history to detect patterns. They won't appear until:

You've completed at least 5 campaigns with the same snippet attributes
At least 3 variants share the attribute being analysed
The pattern passes a statistical threshold (adjusted for the number of attributes being tested simultaneously)

What are the help tooltips throughout the app?

Liftstack displays contextual "?" icons next to statistical terms and metrics across the campaign report, dashboard, and analytics pages. Hovering or clicking a tooltip reveals a plain-language explanation of the concept (for example, what "P(best)" means, how ROPE works, or what a guardrail violation indicates). There are over 20 tooltip definitions covering terms like credible interval, expected loss, Monte Carlo samples, SRM, attribution window, and more. These are designed so you do not need to leave the page or consult external documentation to understand what you are looking at.

Revenue Attribution & Velocity

What is the Revenue Attribution Dashboard?

The Revenue Attribution Dashboard (under Analytics > Revenue) gives you a consolidated view of the revenue impact of all your testing. It shows:

Total uplift: the cumulative additional revenue generated by choosing winning variants across all campaigns
Monthly breakdown: a month-by-month bar chart of revenue uplift so you can track the value of testing over time
Channel breakdown: uplift split by channel (email, push, SMS, in-app), so you can see which channels are contributing most
Placement breakdown: uplift by placement type (subject line, CTA, hero block, etc.), showing which content positions generate the most value
Top campaigns table: the individual campaigns that contributed the most revenue uplift, ranked by estimated impact

The dashboard also includes a cumulative revenue uplift chart with a confidence band. All figures use deduplicated attribution to avoid inflating numbers from overlapping audiences.

What are Testing Velocity Metrics?

The analytics dashboard includes a Testing Velocity section (loaded on demand) that tracks your testing programme's pace and effectiveness:

Tests per month: how many campaigns you completed in the current month
Average time to verdict: the median number of days from campaign send to a conclusive verdict
Win rate trend: the percentage of tests that produced a clear winner, shown as a trend over recent months
Compounding score: an estimate of how testing results are building on each other over time, reflecting the cumulative benefit of applying winning insights to new campaigns

These metrics help you answer "are we testing enough?" and "are our tests getting faster and more effective?"

What is the MAB Regret Calculator?

For campaigns that use Thompson Sampling (Smart Allocation), the campaign report includes a Regret Calculator section. It shows:

Conversions saved: how many additional conversions the smart allocation captured compared to an equal traffic split
Revenue saved: the estimated additional revenue from those extra conversions
Allocation efficiency: a percentage indicating how close the allocation was to a theoretically optimal split

This quantifies the value of using Smart Allocation rather than a standard even split. It is only shown on campaigns that used Thompson Sampling.

What is the per-slot funnel chart?

Each slot on the campaign report includes a funnel visualisation showing the step-by-step journey from exposure to conversion. The funnel steps are channel-aware:

Email: Exposures, Opens, Clicks, Conversions
Push: Exposures, Impressions, Taps, Conversions
SMS: Exposures, Clicks, Conversions
In-app: Exposures, Impressions, Taps/Clicks, Conversions

Each step shows the count and the drop-off percentage from the previous step, making it easy to see where recipients fall out of the funnel for each variant.

Snippet Management

How does snippet search work?

The snippet list page includes a search bar that queries across multiple fields: snippet name, variant content, variant labels, and tag names. This means you can search for a phrase that appears inside a variant's body text (for example, "free shipping") and find all snippets containing that phrase, even if the snippet name does not mention it. Results update as you type via HTMX.

What is snippet version history?

Every time you edit a variant's content, Liftstack automatically saves a version snapshot of the previous content. You can view the version history from the variant's edit page, which shows a chronological list of all past versions with timestamps and the content at each point.

If you need to revert a change, click "Restore" on any previous version. This replaces the current content with the selected version and creates a new version entry for the content that was replaced, so nothing is ever lost.

Can I get AI-generated variant suggestions?

Yes. On any snippet's detail page, click the "Suggest Variants" button to open the AI suggestion panel. Liftstack analyses winning patterns from your historical test data (which tones, structures, and phrases tend to outperform) and generates new variant ideas that incorporate those patterns. The suggestions are tailored to the snippet's channel and placement type.

You can review each suggestion and add it as a new variant with one click. AI suggestions require a workspace LLM configuration (set up under Settings > AI).

Can I import snippets in bulk?

Yes. Go to Snippets > Import to upload a CSV file containing multiple snippets and their variants. Select the channel and placement type, then upload your file. The CSV should contain columns for snippet name, variant label, and variant content. Rows with the same snippet name are grouped together as variants of the same snippet.

The import page includes format documentation and a downloadable template CSV so you can prepare your file correctly.

Can I export my snippets?

Yes. Go to Snippets > Export to download a CSV file containing all snippets in the current workspace. The export includes snippet name, channel, placement type, variant labels, variant content, and tags. This is useful for backup, migration between workspaces, or sharing with team members who want to review content outside Liftstack.

Snippet Performance

What is the Snippet Performance page?

This page aggregates how each variant has performed across all the campaigns it's appeared in. Instead of looking at one campaign at a time, you can see the big picture: which variants consistently win, which are reliable, and which are inconsistent.

What do the performance verdicts mean?

Verdict	Criteria	What It Means
Strong performer	Won 60%+ of campaigns, across 4+	Reliably outperforms. Consider making it your default.
Consistent	Won 40%+ with low variability	Reliable middle-of-the-road performer
Variable	High variability across campaigns	Sensitive to audience or timing. Unpredictable.
Needs more data	Fewer than 3 campaigns	Too early to judge. Keep testing.

What does the sparkline show?

The sparkline chart on each variant's detail page shows its conversion rate across every campaign. A flat line is good (consistent performer). A downward trend suggests novelty effects wore off. An upward trend suggests the audience is warming to it.

What is a temporal trend warning?

If a variant's performance is clearly trending up or down across campaigns (tested in 3 or more campaigns), Liftstack surfaces a warning. This helps you catch two temporal biases:

Novelty effects: a new content style gets a temporary engagement boost simply because it is different from what recipients are used to. The boost decays as the novelty wears off, meaning the current estimate may overstate long-term performance.
Primacy effects: recipients are habituated to the existing style and initially resist the change. The new variant underperforms at first but improves over time, meaning the current estimate may understate long-term performance.

Liftstack detects these by computing the Spearman rank correlation between campaign send order and conversion rate. When a strong monotonic trend is found, the warning tells you the direction and includes both the most recent conversion rate and the average rate across all campaigns so you can judge the likely long-term performance yourself.

Novelty effects are most pronounced when testing dramatically different content styles (e.g., emoji-heavy vs minimalist) against an audience that has received a consistent style for months. For minor copy variations (e.g., "Shop now" vs "Browse the collection"), they are typically negligible.

What is predicted variant performance?

On the campaign report, each variant includes a predicted performance estimate based on its historical track record across prior campaigns. This uses Bayesian posteriors fitted from all previous appearances of that variant, weighted by recency. The prediction shows the expected conversion rate range for the variant in future campaigns, helping you decide whether to keep testing it or retire it. Predictions improve in accuracy as the variant appears in more campaigns.

Content Intelligence & Playbook

What is the Content Intelligence page?

The Content Intelligence page (under Analytics > Content Intelligence) analyses the content attributes you have tagged on your snippets (such as tone, style, message type, or visual treatment) and identifies which attributes are associated with better or worse performance. For each attribute value, Liftstack computes a Bayesian posterior comparison against the workspace average and reports whether that attribute tends to outperform, underperform, or show no significant difference. This is more rigorous than simple win-counting: it accounts for sample size, uncertainty, and the number of comparisons being tested simultaneously.

What is the Content Playbook?

The Content Playbook (under Analytics > Playbook) is an auto-generated reference document that synthesises everything Liftstack has learned from your testing history. It includes:

Winning rules: content patterns that consistently outperform, with channel and placement context (e.g., "In subject lines, variants with urgency tone outperform by approximately 1.2%")
Anti-patterns: content approaches that tend to underperform or trigger guardrail violations
Trends: patterns that are stable, improving, or declining over rolling time windows. Small fluctuations (below 0.5 percentage points) are classified as stable to avoid noise.
Gaps: attribute-value combinations that have been tested in fewer than 5 campaigns, meaning there is not yet enough data for a reliable insight. These are suggestions for future experiments.
Placement breakdown: top insights broken down by placement type (subject lines, CTAs, push titles, etc.), so you can see what works in each content position
Channel breakdown: separate findings for each channel you test on

The playbook updates automatically as you complete more campaigns. You can filter by channel using the dropdown at the top of the page. A highlights widget on the analytics dashboard shows the top 3 winning rules with a link to the full playbook.

Content attributes like "length" are evaluated relative to the channel and placement. For example, an 8-word push title is classified as "long", while an 8-word email body is "short". This means length-based insights are meaningful within each content type.

What is the Content Memory Dashboard?

The Content Memory Dashboard (under Analytics > Content Memory) visualises how your testing knowledge has grown over time. It shows:

Knowledge growth timeline: a chart of cumulative insights and patterns discovered, plotted against the number of completed campaigns
Pattern effectiveness map: a heatmap of which content attributes are performing well across which channels and placement types
Channel coverage: how thoroughly you have tested across each channel
Knowledge gaps: areas where you have limited data and could benefit from additional testing

This dashboard helps you answer "are we learning from our tests?" and identify where to focus future experimentation.

What is the ESP Template Library?

The Template Library (under Integrations > Templates) lets you browse and scan your existing ESP templates for testable content blocks. Liftstack connects to your ESP's template API, lists your templates, and can scan individual templates to detect elements like subject lines, headings, CTAs, preheader text, and other content blocks that could become snippets.

When the scan finds a testable block, you can create a snippet directly from it with one click. The detected content becomes the control variant, and you can add alternative variants to start testing. Templates are synced daily via an automated background task, so newly created templates in your ESP will appear in the library without manual action.

Leaderboard, Benchmarks & Overlap

How does the leaderboard rank variants?

The leaderboard ranks all variants in a workspace by their Bayesian posterior performance across every campaign they have appeared in. Variants are sorted by their estimated conversion rate (posterior mean), not by raw observed rate. This means variants with more data are weighted more heavily, and variants with tiny sample sizes are not artificially inflated by lucky results. The ranking updates each time the analytics pipeline runs.

What does the verdict label on the leaderboard mean?

Each leaderboard entry carries a verdict label summarising its track record: Strong performer, Consistent, Variable, or Needs more data. These are the same verdicts used on the Snippet Performance page. The label reflects how reliably the variant has performed across multiple campaigns, not just a single test. A "Strong performer" tag means the variant has won in 60%+ of its appearances across at least 4 campaigns.

How are benchmarks computed?

Benchmarks compare each variant's conversion rate to the distribution of all variants in the same workspace and channel. The percentile (e.g. P82) tells you where a variant sits relative to its peers. P75 or above means the variant is in the top quartile. The benchmark is computed from completed campaigns only, so draft or in-progress campaigns do not affect the percentile.

How often are benchmarks updated?

Benchmarks are recalculated each time the daily aggregate_variant_performance task runs. This means the percentile you see reflects data through the most recent completed analytics cycle. If you complete a new campaign today, the benchmarks will incorporate its results by the next daily run.

What does audience overlap mean?

Audience overlap measures how many recipients appear in more than one active campaign at the same time. If 30% of Campaign A's audience also appears in Campaign B, those campaigns have 30% overlap. Overlap is computed during audience sync by comparing the profile ID sets across campaigns in the same workspace.

What should I do about high audience overlap?

High overlap (above ~25%) means many recipients are being tested in multiple campaigns simultaneously. This is not necessarily a problem, because Liftstack uses profile-level attribution and each campaign's analysis is independent. However, high overlap can make it harder to isolate the effect of a single content change if recipients are seeing multiple experimental variants across campaigns at once. If you want cleaner results, consider staggering campaigns so they do not run concurrently, or using Campaign Groups with shared assignment to ensure consistent variant allocation across overlapping campaigns.

Data Quality & Warnings

What is a Sample Ratio Mismatch (SRM)?

An SRM means the actual traffic split between variants doesn't match what was intended. For example, you set up a 50/50 split but actually got 53/47. This is a serious issue because it suggests something went wrong in the delivery pipeline. If the problem correlates with the variants, all the statistical results become untrustworthy.

Common causes: partial failures when writing assignments to your CRM, recipients unsubscribing between assignment and send, template rendering errors for one variant, or platform-side content filtering.

When SRM is detected, Liftstack blocks the verdict and shows a red warning explaining the mismatch. You should investigate the root cause before trusting any results.

What are data quality checks?

Before running any analysis, Liftstack automatically checks:

Assignment completeness. Were all audience members actually assigned a variant?
Sample ratio mismatch. Does the actual split match the intended split?
Zero-event variants. Does any variant have zero engagement events despite having recipients? (May indicate a tracking issue.)
Minimum data threshold. Has each variant accumulated enough data for meaningful analysis?

Issues are flagged directly on the campaign report with severity levels (critical warnings block analysis; minor warnings are informational).

What about bot traffic?

Email engagement metrics are polluted by bots. Liftstack automatically filters these out during event ingestion by detecting:

Known bot user agents (Googlebot, link scanners, headless browsers, etc.)
Known email security scanners (Barracuda, Proofpoint, Mimecast, etc.)
Impossibly fast clicks (within 1 second of delivery)

The campaign report shows what percentage of traffic was classified as bot activity and excluded. Typical campaigns see 5 to 15% bot traffic.

What does "interaction detected" mean?

When your campaign tests multiple slots (e.g., subject line AND hero image), Liftstack checks whether the combination matters. An interaction means: Variant A in the subject line slot performs differently when paired with Variant X vs Variant Y in the hero slot.

Interactions are flagged with cautious language: "We detected a possible interaction... This may warrant investigation but could also be coincidental." The per-slot results remain valid. The interaction is additional context, not a change to the verdict.

Holdout Incrementality, Validation & Winner Deployment

What is a holdout group?

A holdout group is a percentage of your audience (1-20%) that always receives the control variant, regardless of how the test performs. This creates a true baseline for measuring the causal impact of your content optimisation. Without a holdout, you can measure which variant is best, but you cannot measure how much better it is than not optimising at all.

Where do I see the incrementality report?

From any campaign report where a holdout was enabled, click the "Incrementality Report" button. The report shows incremental lift, incremental conversions, incremental revenue, confidence levels, a lift distribution chart, and the estimated cost of testing.

What is the "cost of testing"?

During exploration, some audience members are exposed to non-winning variants. The cost of testing estimates how many conversions were lost during this period. This is normal and expected. It should decrease over time as Thompson Sampling shifts more traffic toward the winner.

How do I validate my integration?

Go to Integrations, select a connection, and click "Run Validation Test" on the edit page. Liftstack creates a campaign with two identical variants and monitors engagement events. After collecting enough data (minimum 1,000 events), it checks for sample ratio mismatch and false winner detection. Results typically appear within 48 hours.

What does a failed validation mean?

A failed validation indicates a systematic issue in your data pipeline. If SRM is detected, the audience split is uneven (check webhook configuration and audience sync). If a false winner is detected, there's a tracking or attribution bias (check bot detection, event deduplication, and webhook settings).

Can I re-run a validation test?

Yes. From the validation detail page, click "Re-run Validation Test". Each test is independent and the history of all past tests is preserved.

What is automated winner deployment?

When Liftstack declares a winner, it can automatically push the winning variant's content to your ESP template, replacing the conditional blocks with static winner-only content. This requires you to provide the ESP template ID during campaign setup.

How do I set up auto-deploy?

In the campaign wizard Step 5 (Send), enter your Platform Template ID, enable the "Auto-deploy winner" toggle, and set a safety delay (1-72 hours). When a winner is declared, Liftstack waits the configured delay, re-checks the verdict, then pushes the winner content to your ESP.

Can I cancel a scheduled deployment?

Yes. During the safety delay period, a "Cancel Deployment" button appears on both the campaign detail page and the analytics report. Clicking it cancels the scheduled push and no changes are made to your ESP template.

What if the verdict changes during the delay?

Liftstack re-checks the verdict immediately before pushing. If the verdict is no longer WINNER (e.g., it reverted to INSUFFICIENT after more data arrived), the deployment is automatically aborted.

Multi-Channel Testing

What channels does Liftstack support?

Liftstack supports five messaging channels: email, push notifications, SMS, in-app messages, and in-app surfaces (content cards). Each snippet and campaign is scoped to a single channel.

How do I create a push notification or SMS test?

The workflow is the same as email. When creating a snippet, select the channel (e.g., "Push Notification"). The available placements and content types adjust automatically. For push, you'll see placements like "title", "body", "image", and "deep_link". For SMS, you'll see "body". Then create a campaign, select the same channel, and the slot form will only show snippets matching that channel.

Are there character limits for push and SMS?

Liftstack shows soft character warnings: push titles at 65 characters, push body at 240 characters, and SMS body at 160 characters. These are advisory. Liftstack does not truncate your content, but the receiving platform or device may.

How does attribution work for push and in-app channels?

Attribution varies by channel:

Email and SMS use URL-based attribution for clicks: Liftstack embeds a tracking parameter (lf_cid) in links and matches click events back to the specific variant and slot. Other metrics (opens, conversions, revenue) use profile-based matching.
Push and in-app channels use profile-based attribution entirely: since Liftstack writes variant assignments to user profiles, any engagement event from that user is matched to their assigned variant via their profile ID.

In all cases, the attribution window is per-recipient (measured from each person's assignment time, not from a single campaign-wide timestamp). The default window is 7 days.

What are campaign groups?

Campaign groups let you organise related campaigns across channels for cross-channel reporting. For example, you might group an email campaign and a push campaign that both test the same product launch messaging. The group detail page shows a per-channel comparison table.

What is shared assignment?

When "shared assignment" is enabled on a campaign group, all campaigns in the group assign the same variant label to each user. If a user is assigned "Variant B" in the email campaign, they also see "Variant B" content in the push campaign. This enables true cross-channel experimentation where you can measure the combined effect of consistent messaging.

Can I mix channels within a single campaign?

No. Each campaign tests one channel. Mixing channels within a campaign would create incomparable metrics (email open rates and push tap rates have different base rates). Use campaign groups for cross-channel coordination instead.

Predictions & Cross-Workspace Benchmarks

How does the predictive winner work?

Before you send a campaign, Liftstack can show a predicted conversion rate for each variant based on historical testing data in your workspace. The prediction uses Bayesian Monte Carlo sampling: for each variant, Liftstack draws thousands of samples from a Beta posterior distribution fitted to the variant's past performance across previous campaigns. The result is a predicted conversion rate with a confidence range (e.g., "2.1% to 3.4%, most likely around 2.7%").

Variants that have appeared in many past campaigns produce tighter predictions. Variants with no history receive wider confidence ranges reflecting the uncertainty. The prediction considers the variant's own track record as well as the performance of similar content types and placements in your workspace. Predictions are informational: they help you set expectations and plan audience sizes, but they are not guarantees.

What is the predicted winner badge on the campaign list?

When a draft campaign has a single slot with at least two variants that have prior testing history, Liftstack runs the predictive winner analysis automatically. If one variant has a 70% or higher probability of winning (based on historical data), a "Predicted: Variant Name (X%)" label appears on the campaign card in the campaign list. This gives you a quick signal about which draft campaigns have a likely frontrunner before you even send them. The badge is only shown for draft campaigns and disappears once the campaign is sent and real data starts arriving.

How accurate are predictions?

Liftstack tracks prediction accuracy over time. After each campaign completes, the system compares the predicted confidence range to the actual observed conversion rate. You can view this history in the Prediction Accuracy section under Analytics.

Early on (with fewer than 10 completed campaigns), predictions will have wide confidence ranges and lower precision. As your workspace accumulates more testing history, predictions tighten. After 20 or more completed campaigns with consistent content types, most actual results will fall within the predicted confidence range. If predictions frequently miss, it typically means your testing patterns are shifting (new audience segments, different content styles) and the historical baseline is less representative.

What are anonymized benchmarks?

Cross-workspace benchmarks let you compare your metrics against other Liftstack workspaces. This is entirely opt-in: you enable it in Settings > Analytics, and your workspace contributes anonymized, aggregated metric data (conversion rates, click rates, revenue per exposure) to a shared pool. In return, you see network-wide percentiles (P25, P50, P75) alongside your workspace-level benchmarks.

To protect privacy, cross-workspace benchmarks require at least 10 participating workspaces before any data is displayed. If fewer than 10 have opted in, the feature shows a notice that the threshold has not been met. No individual workspace's data is ever identifiable. Only aggregate percentiles and distributions are shared. No campaign names, snippet content, variant text, audience data, or workspace identifiers are included.

This answers the question "how do our testing results compare to other teams?" without exposing anyone's specific numbers.

Common Questions About the Statistics

Is Bayesian analysis as rigorous as traditional statistics?

Yes, and arguably more so for this use case. The Bayesian approach used in Liftstack:

Produces the same quality of conclusions as frequentist methods (p-values, confidence intervals)
Provides answers that are easier to interpret correctly ("93% probability this is the best" vs "p < 0.05")
Handles continuous monitoring naturally, so you can check results at any time without inflating error rates
Does not require pre-determined sample sizes. It reports the current state of evidence regardless of how much data has arrived.
Includes built-in protection against the winner's curse (extreme results are naturally pulled toward realistic values)

Why 50,000 Monte Carlo samples?

Behind the scenes, Liftstack uses a simulation technique called Monte Carlo sampling: it draws 50,000 random scenarios from the statistical model to estimate probabilities. This is more than sufficient for stable, reproducible results. Increasing beyond 50,000 wouldn't meaningfully change any number you see in the report.

What is the "prior" and does it affect my results?

In Bayesian statistics, the prior represents your starting assumption before seeing any data. Liftstack defaults to an uninformative prior, meaning it starts with no assumptions about what the conversion rate should be. This is conservative and lets the data speak for itself.

After you've completed 5+ campaigns, Liftstack can automatically switch to an adaptive prior that encodes your workspace's typical conversion rate range (e.g., "our campaigns usually convert between 1% and 4%"). This helps small tests converge faster without biasing toward any particular variant, because it applies the same prior to all variants equally.

You can also manually set the prior if you have specific domain knowledge, but most users never need to touch this.

Won't the prior bias my results?

No, for two important reasons:

The same prior is applied to every variant in the test. It shifts all estimates equally and doesn't favour one variant over another.
The prior's influence shrinks rapidly as data arrives. After a few hundred exposures per variant, the data overwhelms the prior entirely.

The prior mainly matters in the early stages of a test (under 300 exposures per variant), where it prevents extreme estimates from tiny samples.

What is ROPE and why does it matter?

ROPE (Region of Practical Equivalence) is how Liftstack determines whether a difference is too small to care about. The default ROPE width is 0.5 percentage points, meaning if two variants are within half a percentage point of each other, they're treated as functionally equivalent. For low conversion rate campaigns (below ~2%), the ROPE width is automatically scaled down relative to the observed rate so that it remains a meaningful comparison threshold.

This prevents the system from declaring a "winner" that only beats the control by 0.02 percentage points. Technically better, but practically meaningless.

How does Liftstack handle multiple comparisons?

When you test many variants across many slots, the chance of finding a false positive increases. Liftstack handles this differently for each metric tier:

Primary metric. The Bayesian framework already accounts for all variants simultaneously. Probability of being best is computed jointly, so no additional correction is needed within a slot.
Guardrail metrics. Bonferroni correction is applied across guardrails within each channel. For email (3 guardrails: unsubscribe, complaint, bounce), the per-test probability threshold is raised from 90% to ~93.3% so that the combined false alarm rate stays at 10%. For channels with a single guardrail (push/SMS: opt-out, in-app: dismiss), no correction is needed.
Diagnostic metrics. No correction. They're explicitly labelled as exploratory context, not decision-drivers.
Cross-slot uplift. When summing uplift across multiple slots, Bonferroni-adjusted confidence intervals are computed. Each slot's CI uses a per-slot confidence level so the combined interval maintains 95% coverage.

Is the uplift number real? Can I trust it?

The uplift estimate ("+X additional conversions, +£Y additional revenue") is the system's best estimate based on the data, with several safeguards against overestimation:

It uses the posterior mean (not the raw observed difference), which naturally shrinks extreme estimates toward realistic values
It always includes a credible interval (range) so you can see the best and worst case
It reports the probability this is a real improvement (e.g., "94% chance of real improvement")

That said, all estimates have uncertainty. The true uplift could be at the high end of the range, the low end, or anywhere in between. The headline number is the most likely value, and the range gives you the realistic spread.

What is the winner's curse?

When you test many variants and declare the best-performing one the "winner", its observed performance tends to be slightly inflated by luck. The variant that happened to get favourable randomness in this particular test looks better than it truly is.

Liftstack mitigates this automatically through the Bayesian model (which shrinks extreme estimates) and by always reporting credible intervals alongside point estimates. You should interpret the range, not just the headline number.

Commercial, Privacy & Administration

How is Liftstack priced?

Liftstack offers three paid tiers, billed monthly or annually (with ~17% discount for annual billing). Billing is per-organization, with pooled limits across all workspaces:

	Starter (£249/mo)	Growth (£549/mo)	Scale (£999/mo)
Workspaces	1	5	20
Audience profiles (pooled)	250,000	2,000,000	10,000,000
Campaigns per month (pooled)	15	60	Unlimited
Slots per campaign	2	4	Unlimited
Variants per slot	3	5	5
Connections per workspace	1	1	3
Team members (org-level)	3	15	Unlimited
Smart Allocation	No	Yes	Yes
Revenue modelling	No	Yes	Yes
Content Insights	No	Yes	Yes
Segment Analysis	No	Yes	Yes
Stratified TS	No	No	Yes
Interaction detection	No	No	Yes
Adaptive priors	No	No	Yes

Add-on profile packs are available: +250K (£79/mo), +500K (£149/mo), +1M (£249/mo). Extra workspaces can be purchased on Growth (£79/mo each) and Scale (£59/mo each).

There is also a 14-day free trial with Growth-tier features, 1 workspace, and 2 campaigns, so you can run a real test before committing.

Can I invite my agency or team members to my organization?

Yes. Every plan includes team member seats at the organization level. You invite team members by email, and they get their own login with access to all workspaces in the organization. Liftstack supports three roles:

Owner: full access, including billing and organization settings
Admin: full access to campaigns, snippets, integrations, and workspace settings
Member: can create and manage campaigns and snippets; cannot modify integrations or organization settings

If you need limited access for stakeholders, the Member role is the closest fit. Members can view all reports and dashboards and manage campaigns and snippets, but cannot modify integration credentials or organization settings.

Is Liftstack GDPR compliant?

Liftstack is designed with data minimisation in mind:

What Liftstack stores: Platform profile IDs (the identifier your ESP uses), email addresses (for audience sync), and engagement events (clicks, opens, conversions) with their metadata. These are necessary to run the test and attribute results.
What Liftstack does NOT store: Payment information (handled entirely by Stripe), email content rendered to recipients (that stays in your ESP), or any personal data beyond what's listed above.
Encryption at rest: API credentials, email addresses, audience profile properties, and event payloads are all encrypted at rest using Fernet symmetric encryption with per-workspace derived keys. All data in transit uses TLS.
Data location: Liftstack runs on infrastructure hosted in the EU/US (depending on your account region). Contact support for specifics about data residency.
Data processing: Liftstack acts as a data processor on your behalf. You remain the data controller for your subscriber data.

If your organisation requires a Data Processing Agreement (DPA), contact support and we will provide one.

Does Liftstack store Personally Identifiable Information (PII)?

Liftstack stores the minimum PII necessary to run tests: platform profile IDs or email addresses from your audience sync. These are used to match assignments to engagement events for attribution. No additional personal data (names, addresses, payment details) is collected or stored.

Email addresses and audience profile properties are encrypted at rest using per-workspace Fernet keys. Engagement event payloads are also encrypted. Platform profile IDs (the opaque identifiers your ESP assigns to each contact) are stored unencrypted because they are required for database lookups and attribution joins.

Engagement events are stored with their metadata (timestamps, UTM parameters, event type) but do not include the content of the email itself or any personal data beyond the profile identifier.

What happens to my data if I cancel?

When you cancel your subscription:

Your workspace and all its data (campaigns, snippets, results, audience snapshots) remain accessible in read-only mode through the end of your current billing period.
After the billing period ends, your workspace enters a grace period. You can reactivate your subscription during this time to restore full access.
If you want your data deleted, contact support and we will permanently remove your workspace and all associated data.

Historical campaign results (verdicts, uplift numbers, variant performance) are yours. You can export CSV reports from any campaign before your access expires.

Who can see my test results?

Only members of your workspace. Liftstack is multi-tenant with strict workspace isolation. Users in one workspace cannot see campaigns, snippets, integrations, or results belonging to another workspace, even if they're on the same Liftstack account.

Does Liftstack have access to my ESP account?

Liftstack uses the API key you provide to make specific API calls: syncing audiences, writing profile properties, pushing templates, and fetching engagement events. It does not have access to your ESP dashboard, billing, or any data outside the scope of those API calls. The API key permissions determine exactly what Liftstack can and cannot do.

You can revoke access at any time by deleting the API key in your ESP's settings. Liftstack will immediately lose the ability to make any calls.

Account, Workspace & Activity

How are organizations and workspaces related?

Liftstack uses a two-level hierarchy: organizations and workspaces. An organization is your billing entity. It contains one or more workspaces, and your subscription plan applies to the organization as a whole (pooled campaign limits, audience profiles, and team member seats).

A workspace is where your day-to-day testing happens. Each workspace has its own snippets, campaigns, integrations, analytics, and settings. Team members invited to the organization can access all workspaces within it. If you manage multiple brands, regions, or business units, you can create separate workspaces for each while sharing a single subscription.

What is the Activity Log?

The Activity Log (under Settings > Activity) is a workspace-level audit trail that records significant actions taken by team members. Every important event is logged with a timestamp, the user who performed it, the action type, and the target entity. Examples include creating or editing snippets, launching campaigns, changing workspace settings, approving campaigns, and modifying integrations.

You can filter the log by action type (created, updated, deleted, etc.) and target type (campaign, snippet, integration, etc.). The log supports HTMX pagination for browsing through history. This is useful for accountability, debugging ("who changed this setting?"), and compliance.

Why do I need to verify my email address?

Email verification confirms that you own the address associated with your Liftstack account. It protects your account from being linked to a mistyped or fraudulent email, and ensures that password reset and security notification emails reach you. You can use Liftstack while unverified, but an amber banner will appear at the top of every page until you verify.

I didn't receive a verification email

A few things to check:

Spam or promotions folder. Check filtered folders in your inbox. The email comes from your organization's configured sender address.
Resend it. Click "Resend verification email" in the amber banner at the top of any page, or ask a teammate to check that the correct email is on your account.
Verification links expire after 7 days. If you waited too long, click the resend link to get a fresh one.

How do I change my email address?

Go to Settings > Account and update the email field. Liftstack will send a verification email to your new address. Your current email stays active until you click the verification link in the email sent to the new address. If the new address is already used by another account, the change will be rejected when you try to verify.

What happens if I don't verify my new email address?

Nothing changes. Your original email address remains on the account. The pending change is shown below the email field in Account Settings. The verification link expires after 7 days; after that, you would need to submit the change again.

Troubleshooting

My test has been running for days but still says "Insufficient Data"

This usually means one of:

The variants perform very similarly. If the true difference is tiny, you need a very large audience to detect it. Consider whether the content differences are meaningful enough.
Small audience. Check whether your audience meets the minimum size guidance for the effect size you're trying to detect.
Low conversion rate. Broadcast campaign conversion rates are often 0.05-0.2%, which requires much larger audiences than flow campaigns. Liftstack automatically adjusts its decision thresholds for low rates and will show a data quality warning when this applies. Consider testing a higher-funnel metric like click rate or open rate for faster signal.
Not enough conversions yet. Each variant needs at least 3 conversions (configurable) before verdict computation begins. At low conversion rates, this takes more exposures.

The report will show an estimate of how many more exposures are needed. If that number is impractically large, the variants may simply be too similar to distinguish. That is a valid result; consider declaring them equivalent and moving on.

Why does one variant show zero events?

A variant with recipients but zero engagement events may indicate a tracking issue:

Check that the template conditional logic is rendering correctly for that variant
Verify that the tracking links contain the correct lf_cid parameter
Confirm that your webhook or event polling is functioning

Liftstack flags this as a data quality warning on the campaign report.

Why was my winner blocked by a guardrail?

The variant with the best primary metric performance also triggered a safety threshold. The guardrails that can block a winner depend on the channel:

Email: unsubscribe rate, spam complaint rate, or bounce rate exceeded the threshold vs the control
Push/SMS: opt-out rate exceeded the threshold vs the control
In-app: dismiss rate exceeded the threshold vs the control

The violation message in the report includes the specific metric, the observed rates, and the probability threshold used (which may be Bonferroni-adjusted when multiple guardrails are checked).

Consider:

Reviewing the variant's content for overly aggressive messaging
Looking at which audience segments are driving the negative metric
Whether the increase is acceptable given the conversion gains (you can acknowledge the guardrail and proceed if you've investigated)
For bounce rate violations, check whether the variant contains content that might trigger spam filters or whether there are deliverability issues with the variant's formatting

The report shows an SRM warning. What do I do?

An SRM (Sample Ratio Mismatch) means the traffic split doesn't match what was configured. Steps to investigate:

Check for partial failures in the CRM profile write step (look for error logs during the writeback)
Check whether audience members were suppressed or unsubscribed between assignment and send
Verify that the template renders correctly for all variants (a broken conditional could funnel everyone to a default)
Check for platform-side filtering (spam filters catching one variant's content)

Until the root cause is identified, the statistical results for this slot should not be trusted.

Can I re-run a test?

Yes. Create a new campaign with the same snippet and variants. Liftstack will use the historical data from previous campaigns to inform the new test (especially with Smart Allocation enabled). Each campaign is a fresh experiment with fresh assignments.

How do I export my data?

Click the "Export CSV" button on any campaign report to download the full metrics table. This includes all variants, all metrics, and the verdict information. You can also export all snippets in a workspace as a CSV file from Snippets > Export (see the Snippet Management section above).

Start compounding revenue from the campaigns you already send

Start your free trial

14-day free trial on the Growth tier. No credit card required.