Deploy Hybrid Data Pipeline
- Last Updated: March 25, 2025
- 2 minute read
- Hybrid Data Pipeline
- Version 5.0
- Documentation
Hybrid Data Pipeline is a self-hosted solution that can be deployed on single node or on multiple nodes. Multiple node deployments require the use of a load balancer. For single-node deployments, the use of a load balancer is optional. Deploying Hybrid Data Pipeline on one or more nodes behind a load balancer is generally recommended. In either case, Hybrid Data Pipeline can be deployed on an in-house network or on a cloud service such as Amazon Web Services, Azure, or Google Cloud Services, giving you the ability to build your environment according to your organization's needs.
The following image shows the basic structure of an environment that uses Salesforce Connect and Hybrid Data Pipeline to consume external data.
How you deploy Hybrid Data Pipeline depends in part on the location of the external data you want to expose. In the simplest scenario, your external data already resides in the cloud, and you simply deploy Hybrid Data Pipeline in the cloud.
In contrast, if your external data resides on a database behind a firewall, you have two options.
- First, you could install Hybrid Data Pipeline behind the firewall, close to your data. In this case, you would need to use a VPN or other gateway to enable communication between Salesforce and Hybrid Data Pipeline.
- Second, you could host Hybrid Data Pipeline in the cloud and access data behind a network firewall, using the On-Premises Connector (OPC). The OPC is an agent that runs behind the firewall close to your data. It uses outbound SSL to communicate with the Hybrid Data Pipeline server, so there is no need to open ports with a VPN or other gateway.
Refer to the following documentation resources for more detailed deployment information.
- Product requirements
Product requirements for the Hybrid Data Pipeline server and On-Premises Connector.
- Deployment scenarios and Deployment configurations
Your deployment scenario determines how you deploy and configure Hybrid Data Pipeline.
- Installing and upgrading the Hybrid Data
Pipeline server
The server must be deployed before installing additional components, such as the OPC.
Important: When using Hybrid Data Pipeline to access external data with Salesforce Connect, a TLS certificate from a well-known certificate authority in PEM file format must be provided during installation. - Installing the Hybrid Data Pipeline On-Premises
Connector
Install the OPC for secure connections from cloud applications to data that resides behind a firewall without having to setup a VPN or other gateway.
- Hybrid Data Pipeline on AWS Guide
Shows how to prepare an AWS environment for a Hybrid Data Pipeline single-node deployment without a load balancer, how to install the service on an EC2 instance, and how to create a data source.