Glossary

This guide lists the definitions of the RudderStack-related terms that you are likely to encounter throughout our documentation and while using RudderStack.

If you come across any term that is not listed here, you can contact us or start a conversation on our Slack channel.

API specification

The RudderStack API Spec helps you plan your event data and provides various options for tracking your events across all the RudderStack SDKs and APIs. As RudderStack has a unified event semantic for different destination platforms, you can easily translate your event data to different downstream tools by following this spec.

For more information on the RudderStack API Specification, refer to our docs.

Audit logs

Audit Logs is RudderStack's enterprise feature that gives you a detailed log of the user activities happening within your RudderStack workspace. These activities include the various user operations related to sources, destinations, transformations, and more.

For more information on the audit logs feature, refer to our docs.

Cloud Extract

Cloud Extract is RudderStack's ELT feature that lets you collect your raw events and data from various third-party cloud platforms such as Google Analytics, Marketo, Facebook Ads, Stripe, etc. and send it to your data warehouse with a user-specified frequency.

For more information on the various Cloud Extract sources and how to set them up in RudderStack, check out the docs.

Connection (Pipeline)

A connection is a one-to-one directional flow of events between a RudderStack source and a destination.

For more information on sources and destinations in RudderStack, check our Connections guide.

You can set up different types of connections in RudderStack to send your events, based on the type of source:

  • Event Stream: One source to many destinations
  • Cloud Extract: Multiple sources to one warehouse destination
  • Warehouse Action: One warehouse source to one downstream destination (mainly due to the mappings required when setting up the connection).

Setting up a connection involves adding a source and destination in RudderStack.

Connection modes (Cloud mode & Device mode)

You can send the event data from your website/mobile app to your desired destination via RudderStack in two ways:

  • Cloud mode: In this mode, the RudderStack SDKs track and send the event data directly to the RudderStack server for processing. RudderStack processes this data and routes it to the desired destination. This mode is useful when you want to transform your source events in a destination-specific format.
  • Device mode: In this mode, you can send the source events to your destination using the client-specific libraries on your website/mobile app. These libraries allow RudderStack to use the data you collect on your device to call the destination APIs without sending it to the RudderStack server first. This mode is useful when you want to send the events to a destination as-is, without any transformation.

These two modes are commonly referred to as RudderStack connection modes.

For more information, check the RudderStack connection modes guide.

Control plane

The control plane manages the configuration of your sources and destinations. The interface for the control plane is the RudderStack web app.

For more information on the control plane, refer to RudderStack's architecture.

Control Plane Lite

RudderStack's control plane offers an intuitive UI to configure your event data sources and destinations.

If you want to self-host these configurations, you can use the open-source Control Plane Lite utility to set up your control plane. You can then manage the source and destination configurations locally by exporting to or importing them from a JSON file.

For information on Control Plane Lite and how to use it, refer to the docs.

Customer Data Platform

A Customer Data Platform (CDP) is a software/collection of tools that unifies and persists all the customer-specific records across multiple data sources in a centralized location accessible to other tools/platforms. A CDP lets you build a comprehensive customer profile and use the insights for a variety of use-cases.

Data governance

RudderStack's data governance feature gives you the ability to access all your events and their metadata programmatically and identify any inconsistencies in them. This includes vital information related to the event schema, event payload versions, data types, and more.

Refer to the Data Governance API reference for the detailed technical documentation.

Data plane

The data plane is RudderStack's core engine responsible for:

  • Receiving and buffering the event data
  • Transforming the events into the required destination format
  • Relaying the events to the destination

For more information on the data plane, refer to RudderStack's architecture.

The data plane is intentionally separated from the control plane to give you complete ownership of your data. Depending on how RudderStack is deployed, you can set up your control plane and data plane via one of the following approaches:

  • RudderStack-hosted: RudderStack hosts both Control Plane and Data Plane
  • Hybrid: RudderStack hosts the Control Plane and the user hosts the Data Plane
  • Customer: User hosts both the Control Plane and Data Plane via open-source.

Destination

A destination is a tool or application where you want to send the data via RudderStack.

RudderStack currently supports over 80 destinations. These include data warehouses, analytics platforms, CRMs, marketing platforms, and more.

Check out the Destinations guide for the complete list of supported destinations.

ELT (Extract, Load, Transform)

The ELT (Extract, Load, Transform) process can be defined as:

  • Extract: Obtaining data from the source platform or application.
  • Load: Replicating the data from the source into the target system, typically a data warehouse or a data lake.
  • Transformation: Transforming the data in the desired format according to the business requirement/use-case.

Check out our blog to read more about ELT and how RudderStack facilitates it via the Cloud Extract feature.

Event

Events are the fundamental pieces that constitute clickstream data. They often correspond to user actions on websites or mobile apps such as clicks, page/screen views, logins, registrations, etc.

Tracking events in real-time helps businesses to understand user behavior and their product journey. These insights allow them to deliver better recommendations, send relevant push notifications, and optimize their product experience.

Check out our blog to read more about clickstream analytics.

Event Stream

RudderStack's Event Stream feature lets you collect your event data from all of your web and mobile apps and route it to a wide array of customer tools and data warehouses via RudderStack.

For more information on the various Event Stream sources supported by RudderStack, check out the docs.

Live Events

RudderStack's Live Events feature offers a debugger that shows the live events collected from your sources and sent to the connected destinations in real-time. With this feature, you can easily debug any errors in the failing events at a destination level and reduce your troubleshooting time and efforts.

Broadly speaking, this feature can be further classified into two categories:

  • Source Live Events: This feature gives you real-time visibility into the source events collected by RudderStack. This way, you can confirm if your source is configured correctly and is collecting & sending data as expected.
  • Destination Live Events: When routing events to a destination, sometimes events don't show up in your destination. This feature gives you real-time visibility into the destination's responses and helps you troubleshoot the problem.

For more information on this feature, check out our docs.

Reverse ETL

The ETL (Extract, Transform, Load) process can be defined as:

  • Extract: Collecting/pulling data from a variety of sources.
  • Transform: Refining and transforming the data as per the required data quality and business requirements.
  • Load: Pushing the data into a consolidated data store (for e.g., data warehouse)

Conversely, Reverse ETL is the process of routing the enriched data residing in your data warehouse to various downstream tools within your customer data stack. This includes various SaaS marketing, analytics, sales, and customer support tools.

Check out our blog to read more about Reverse ETL and how RudderStack facilitates it via the Warehouse Actions feature.

SDK

RudderStack offers client-side SDK support for web (JavaScript), mobile (Android, iOS), as well as server-side apps (Java, Python, Node.js, PHP, Go, and Ruby).

You can also track your event data from your Flutter app via the Flutter SDK or from your AMP page via the RudderStack AMP Component.

Refer to the SDK documentation for more details.

Source

A source is a platform or an application (web, mobile, server-side, or a third-party cloud app) from where RudderStack tracks and collects your event data. We highly recommend creating a source for every unique data source that you want to track.

For more information on RudderStack sources, check out our documentation.

Visual Data Mapper

The Visual Data Mapper(VDM) is a Warehouse Actions functionality. It offers an intuitive UI to map your data warehouse columns to specific destination fields without any second-guessing.

For more information on RudderStack sources, check out our documentation.

Teammates

RudderStack's Teammates feature enables you to add and manage other users in your current RudderStack workspace. It facilitates easier collaboration between you and other team members of your organization in using RudderStack effectively.

For more information on the Teammates feature, check out our documentation.

Transformations

RudderStack's Transformations feature lets you leverage your custom JavaScript functions that you can use to implement a variety of use-cases like:

  • Filtering or sampling events
  • Implementing a static logic to enrich your events
  • Removing any sensitive PII information from your customer events, and a lot more.

For more details on this feature and how to use it, check out our Transformations docs.

User suppression

User Suppression is RudderStack's enterprise feature that lets you programmatically suppress user data identified by a user ID. With this feature, you can block all the user data for all the sources and destinations in RudderStack.

Refer to the RudderStack User Suppression API reference for more information.

Warehouse Actions

Warehouse Actions is RudderStack's Reverse-ETL feature. It allows you to leverage the already processed customer data residing in your data warehouse and route this enriched information to your desired destinations, including SaaS marketing, analytics, sales, and customer support tools.

For more information on the various Warehouse Actions sources and how to set them up in RudderStack, check out the docs.

Warehouse destination

RudderStack supports sending events to your data warehouse. These are called the warehouse destinations.

Currently, RudderStack supports integration with all the leading data warehouses like Amazon Redshift, Azure Synapse, Google BigQuery, Snowflake, PostgreSQL, ClickHouse, and Microsoft SQL Server.

For more information on how to use these warehouse destinations, check out our Data Warehouse Integration guides.

Warehouse schema

When sending your events to a data warehouse via RudderStack, you need not define a schema for the event data before sending it from your source. Instead, RudderStack automatically does that for you by following a predefined warehouse schema. This schema defines the different tables and columns created based on different events.

Refer to the Warehouse Schema documentation for more details.

Workspace token

The workspace token is a unique identifier of your RudderStack workspace. You can find it by logging in to the RudderStack web app.

Write key

The write key (also referred to as the source write key) is a unique identifier for your source. It is used while sending events from a source to your specified destination via RudderStack.

Workspace Token vs Source Write Key Workspace Token vs Source Write Key

Contact us

For any questions on any of the topics covered in this guide, you can contact us. You can also start a conversation on our Slack channel.