Introduction to Storage
Cargo workflows are powered by underlying data models, which are structured tables containing organized data. These models can be imported into a workflow to generate runs.
To ensure efficient data management, Cargo leverages a data warehouse infrastructure to host these models. This warehouse collects, manages, and stores data for each workspace, ensuring accuracy, consistency, and accessibility across all data models within that workspace.
Cargo enables users to create data models from a variety of external and internal sources, including API endpoints, webhook events, and SQL queries on warehouse tables. This integration allows users to merge multiple data sources into a unified, cohesive model.
By supporting data model creation from both external sources and existing models within Cargo, the platform provides a flexible way to consolidate diverse datasets under a common schema.
Models support several advanced features, such as:
- Creating segments: Apply filters to data models to form segments, allowing for targeted workflows based on specific criteria.
- Change-based triggers: Enable workflows to respond in real-time to updates, additions, or deletions in the data, ensuring action based on the most pertient signals.
- Avoiding duplicate runs: Track records to prevent workflows from processing the same data multiple times.
- Model relationships: Allow models to connect to each other using common identifiers, providing a unified view.
- Custom columns: Use dynamic functions to create new fields that transform and enrich the data, adding calculated values or tags that enhance it.
- Seting up system of record: Allow using existing warehouse instance as a system of record kept up to data by Cargo
This framework ensures that Cargo users can effectively manage their data, offering a dynamic set of options for feeding data into workflows.
What you will learn
In this module, you will learn about:
- What are data models and how to use data loaders to import data from various sources into Cargo.
- Set model relationships and transform data within models to enhance its utility.
- Setting up segments and managing filters to trigger workflows.
- Use custom columns to create dynamic, calculated fields that add value to your data.
- Query and upsert data from within your workflows.
Core features
Data loaders
data:image/s3,"s3://crabby-images/82426/824268d5bbd1adf54df54559bedf0cd1cd431f41" alt="Cover image"
Among all of the integrations available in Cargo, those available as storage connectors allow the user to bring external data to a data model using a data loader. This capability enables you to integrate data from various sources, such as CRMs, marketing automation platforms, and e-commerce databases, into a single, unified model.
In Go-To-Market (GTM) functions, where fragmented data about prospects, customers, and their behaviors is often a limit on effective workflow creation, this is a game-changing feature. See the section on integration for more information on how this works.
There are three types of data loaders available in Cargo:
- API data loader: This loader allows you to connect to an external API endpoint to fetch data and import it into a model.
- Webhook data loader: This loader allows you to listen for events from an external source and import the data into a model.
- SQL data loader: This loader allows you to run SQL queries on tables in the data warehouse and import the results into a model.
As webhook-based data loaders create non-standard data structure, they have their own set of rules. For instance, once a webhook-based model is created, its columns are strongly typed. For instance, if column A was configured as an integer, it cannot be changed to a string for any subsequent records.
Model relationships
Relationships between models in Cargo are useful for creating a comprehensive view of your data. By connecting different models using common identifiers, you can create merged views of your customer data from different sources.
Defining relationships involves specifying how records in one model relate to records in another model. This includes determining the type of relationship (such as one-to-one, one-to-many, or many-to-many), and mapping the identifier columns between the models. Relating models to one another can provide richer context for decision-making and workflow triggers.
For instance, consider the example of connecting Salesforce data with Stripe invoice data. By linking the 'SalesforceAccounts' model with the 'StripeInvoices' model using a common identifier such as 'CustomerID', you can create a relationship that combines CRM data with financial data. This integrated view allows you to see how customer interactions in Salesforce relate to their invoicing history in Stripe.
Filters and segments triggers
By applying filters to data models, users can create focused subsets of data within the larger model. For example, a filter could be created to narrow down prospects who were participants in a webinar event.
Enabling a workflow allows it to respond to real-time changes in the underlying model's data. When an appropriate change is detected, the workflow can import records from that model to create runs. A workflow will periodically resync for changes according to a schedule or a 'cron job' (except for webhook-based models, which resync automatically when a new record is added). An appropriate change is triggered whenever a record's data matches the criteria set in the filter, whether due to a record's update, addition, or removal.
Importing records into workflows based on these changes allows users to build workflows that operate autonomously. For example, in a model containing customer data, records can be imported into a workflow whenever a new customer is added or when existing customer data is updated.
Another advantage of this system is the ability to track records that enter a workflow. By maintaining a history of imported records, Cargo prevents workflows from creating multiple runs for the same record.
Filter logic can be easily reused across Cargo workspace when saved as segments. Segments are named sets of filters.
Search and upsert model data
Within workflows, users have the ability to query data from any model in Cargo, ensuring that the data is accessible and actionable when needed.
data:image/s3,"s3://crabby-images/e4761/e476142a9596528d5723ac2a132c1ec7b3cedfcb" alt="Cover image"
This capability allows for one-off data retrieval operations without needing to create any merged views.
In a similar fashion, upserts to a model enable users to to push data to custom columns for existing records for transactional data, as it allows for synchronization of data across different systems.
data:image/s3,"s3://crabby-images/890c5/890c530a69dc188071d26574a0c22dd8525bbf23" alt="Cover image"
Custom columns
data:image/s3,"s3://crabby-images/a30a5/a30a5b1c447c1973100d73603d19e386eafaf836" alt="Cover image"
Cargo allows users to create custom columns. These columns let you transform data within your models by adding calculated fields or tags, enhancing the utility of your data. For example, you can categorize customers based on their purchase frequency or calculate their lifetime value.
You can use this to apply dynamic functions to your data models, creating new fields that aren't part of the original dataset. These columns can be used to add calculated values or to append tags that categorize data dynamically based on specific criteria. For instance, you might create a custom column that categorizes customers based on their purchase frequency.
There are three types of custom columns that you can create in Cargo:
- Custom columns: These columns are free-form columns that you can upsert data into from within a workflow. For example, you could create a custom column
- Computed columns: These columns are created using storage that calculate values based on other columns in the model. For example, you could create a computed column that classifies customers based on their job title and years of experience whether they are senior or not
- Metrics columns: These columns are used to store aggregated data, such as sums, averages, or counts, based on the values in other columns in other models. For example, you could create a metrics column inside the Salesforce data model that calculates the total revenue generated by each customer from the Stripe data model.
Guides
To help you get started, we have included practical guides that demonstrate how to use storage features effectively.
For example, learn how to set up data loaders, transform data using custom columns, and manage data queries and upserts. See here for instruction on connecting your Google BigQuery or Snowflake data warehouse to Cargo.
These guides provide step-by-step instructions to enhance your understanding and application of storage in real-world scenarios.