Skip to main content

Introduction to Sitecore CDP data retention policy

Abstract

Overview of Sitecore CDP data retention policy.

This topic introduces the Sitecore CDP data retention policy.

Sitecore CDP uses four data storage tiers: technical storage; live storage; archival storage; and data mart.

Technical storage

Sitecore CDP technical storage is a distributed, high-performing data-storage layer that keeps actionable customer data available for other data layers.

The following table describes the data retention limits and removal frequency for the technical storage tier:

Data entity

Retention limits

Removal frequency

Guests with customer as the guest type

Sitecore CDP retains guests with customer as the guest type for an indefinite amount of time. These are site visitors that have passed your organization's identity rules.

does not apply

Guests with visitor as the guest type

Guests with visitor as the guest type are retained for six months from the last date they were active. These are site visitors that are unidentified or are anonymous.

For example, Sitecore CDP removes guests with visitor as the guest type whose last_seen date is greater than six months.

weekly

Guests with retired as the guest type

Sitecore CDP removes retired guests. A retired guest is a guest profile that matches another guest profile. Sitecore CDP matches the guest's data according to your organization's identity rules, then migrates the guest data to the other guest profile, creating a retired guest profile. You cannot search for a retired guest profile.

weekly

Orders

Orders that belong to retired guests are removed.

weekly

Sessions

Sitecore CDP retains the guest's last 1000 sessions.

For example, if a guest has 1200 sessions, Sitecore CDP removes the oldest 200 sessions.

weekly

Note

Events are not stored in technical storage.

Live storage

Sitecore CDP live storage is a real-time, distributed, high-performance data-storage layer that keeps a subset of actionable customer data available in real-time for using in personalization.

The live storage tier is used for big data and powers all the guest context related intelligence, decisioning, and analytics features. Built on Lambda architecture, the live storage tier contains guests' real-time and historical behaviors and transactional data.

Your organization might have different data limits or time durations that apply depending on your industry's regulations. The data limits in the following table are the default, and do not apply if your organization has requested a different configuration.

The following table provides details on the data retention limits and removal frequency for the live storage tier:

Data entity

Retention limits

Removal frequency

Guests with customer as the guest type

Sitecore CDP retains guests with customer as the guest type for an indefinite amount of time. These are site visitors that have passed your organization's identity rules.

does not apply

Guests with visitor as the guest type

Sitecore CDP retains guests with visitor as the guest type for 6 months from the last date they were active. These are site visitors that are unidentified or are anonymous.

daily

Orders

Sitecore CDP retains the guest's most recent 20 orders.

For example, if a guest has 29 orders, Sitecore CDP removes the oldest 9 orders.

daily

Sessions

Sitecore CDP retains the last 40 sessions within the last 90 days.

If any of the last 40 sessions are offline, Sitecore CDP only retains a maximum of 10 offline sessions. For example, if a guest's last 40 sessions consist of 12 offline sessions and 28 online sessions, Sitecore CDP only retains 10 offline sessions.

Sitecore CDP removes session data that is 91 days or older.

daily

Events

Sitecore CDP retains the last 100 events from a session. Events are stored in sessions. Only events from the last 40 sessions within the last 90 days are retained.

For example, Sitecore CDP removes an event from the 41st most recent session. Similarly, Sitecore CDP removes any event from sessions that are 91 days or older.

daily

Guest data extensions

Sitecore CDP retains the last 100 data extensions for a guest.

For example, if a guest has 103 data extensions, Sitecore CDP removes the oldest 3 guest data extensions.

daily

Guest identifiers

Sitecore CDP retains the last 50 identifiers for a guest.

For example, if a guest has 62 identifiers, Sitecore CDP removes the oldest 12 identifiers.

daily

Archival storage

Sitecore CDP archival storage consists of the Sitecore CDP data lake. The data retention policy for archival storage does not apply. Data is archived and available in the Sitecore CDP data lake for the agreed length of the contract for powering analytics and batch data synchronizations.

Data mart

Sitecore CDP data mart includes a subset of data from archival storage and is the data store used for reporting, performance analytics, and segmentation.

The following table provides details on the data retention limits and removal frequency for the data mart:

Location

Entity

Retention limits

Removal frequency

segmentation data mart

guests, orders, and sessions

For the life of the contract

upon customer request

segmentation data mart

events

Maximum of 2 years

manually

segmentation member history

guests

15 days

daily