Configure mapping
The mapping of an integration's data source defines the ingested data and its destination. It allows you to specify:
- Which data you wish to ingest from the integrated tool.
- Which properties in the integration's blueprints will be filled with the ingested data.
How does mapping work?
Integration mapping is configured in the data sources page of your portal, under Exporters
.
Each integration has its own mapping, written in YAML
.
To understand how mapping works, let's take a look at an example. After you complete the onboarding and connect your Git provider to Port, you will see an exporter entry in your data sources page:
data:image/s3,"s3://crabby-images/35748/35748986a5556f6c738e21434d3c8e212f2e8289" alt=""
Clicking on this entry will open the mapping configuration. In the bottom left panel, you will see the YAML configuration of the mapping.
Note that Port provides default mapping, providing values to the properties defined in the relevant blueprint:
data:image/s3,"s3://crabby-images/4ffc2/4ffc25d6c8d59b24d03764abe48af30b51683a7f" alt=""
Configuration structure
This section will explain each part of the configuration YAML.
Some of the keys use JQ queries to filter the data ingested from the tool's API.
-
The
resources
key is the root of the YAML configuration:resources:
- kind: repository
...
-
The
kind
key is a specifier for the object you wish to map from the tool's API (in this example, a Github repository).
To see whichkinds
are available for mapping, refer to the integration's documentation. In this example, the available kinds are listed in the Github integration page.resources:
- kind: repository
selector:
...
-
The
selector
and thequery
keys let you filter exactly which objects of the specifiedkind
will be ingested into Port:resources:
- kind: repository
selector:
query: "true" # JQ boolean query. If evaluated to false - skip syncing the object.
port:Using a JQ query, you can define your desired conditions. For example, to ingest only repositories that have a name starting with
"service"
, use thequery
key like this:query: .name | startswith("service")
-
The
port.entity.mappings
key contains the section used to map the object fields to Port entities.
Here you can specify theblueprint
in Port to which the data should be mapped, and which API object will be ingested to each of its properties.resources:
- kind: repository
selector:
query: "true"
port:
entity:
mappings: # Mappings between one GitHub API object to a Port entity. Each value is a JQ query.
identifier: ".name"
title: ".name"
blueprint: '"service"'
properties:
description: ".description"
url: ".html_url"
defaultBranch: ".default_branch"To create multiple mappings of the same kind, you can add another item to the
resources
array:resources:
- kind: repository
selector:
query: "true"
port:
entity:
mappings: # Mappings between one GitHub API object to a Port entity. Each value is a JQ query.
identifier: ".name"
title: ".name"
blueprint: '"service"'
properties:
description: ".description"
url: ".html_url"
defaultBranch: ".default_branch"
- kind: repository # In this instance repository is mapped again with a different filter
selector:
query: '.name == "MyRepositoryName"'
port:
entity:
mappings: ...
Test your mapping - JQ playground
The mapping configuration window contains a JQ playground that allows you to test your JQ queries against example responses from the API of the integrated tool. This is useful for validating your queries and ensuring they return the expected results.
For integrations based on the Ocean framework, examples will be automatically generated for each resource kind
in your mapping, based on real data ingested from the tool. You can disable this behavior by setting the sendRawDataExamples
flag to false
in the integration's configuration.
To test your mapping against the example data, click on the Test mapping
button in the bottom-right panel.
Manually add test examples
For each resource kind
in your mapping (in the bottom-left panel), you can add an example in the Test examples
section.
Click on the Add kind
button to add an example:
data:image/s3,"s3://crabby-images/e12b4/e12b4ec55305084439178d925c2e993d9c82fcfe" alt=""
After adding your example, click on the Test mapping
button in the bottom-right panel to test your mapping against the example data.
Mapping relations
You can use the mapping YAML file to set the value of a relation between entities. This is very useful when you want to automatically assign an entity to the relation of another entity using a convention of your choice.
For example, say we have a service
blueprint and a PagerDuty Service
blueprint with a relation between them:
data:image/s3,"s3://crabby-images/4e33d/4e33d61eb63e88cc341cad58ace5e02110b49c89" alt=""
After ingesting all of our services and PagerDuty services, we want to connect each service
to its corresponding PagerDuty service
. To achieve this, we have two options:
-
Option 1 - manually assign a PagerDuty service to each service using the UI:
-
Go to the Services page of your software catalog.
-
Choose a service you want to assign a PagerDuty service to. Hover over it, click on the
...
button on the right, and selectEdit
. -
In the
PagerDuty service
field, select the relevant PagerDuty service from the dropdown list, then clickUpdate
:
-
-
Option 2 - use the integration's mapping YAML. In our example, we can add an entry to the mapping of the PagerDuty integration:
-
Go to your data sources page and click on the PagerDuty exporter:
-
Add the following entry to the mapping YAML:
- kind: services
selector:
query: "true"
port:
entity:
mappings:
identifier: .name
blueprint: '"service"'
properties: {}
relations:
pagerduty_service: .idNow, if a
service's
identifier is equal to aPagerDuty service's
name, that service will automatically have its on-call property filled with the relevant PagerDuty service.
This is just the convention we chose for this example, but you can use a different one if you'd like.
-
Create multiple entities from an array API object
In some cases, an application's API returns an array of objects that you want to map to multiple entities in Port.
To achieve this, Port provides you with the itemsToParse
key, its value should be a JQ query that returns an array.
In order to reference an array item attribute, use .item
in your JQ expression.
Here is an example mapping configuration of a Jira issue
, where we want to map each of the issue's comments
to a separate comment
entity:
- kind: issue
selector:
query: .item.name != 'test-item' and .issueType == 'Bug'
port:
itemsToParse: .fields.comments
entity:
mappings:
identifier: .item.id
blueprint: '"comment"'
properties:
text: .item.text
relations:
issue: .key
The object returned from Jira for which we would apply this mapping might look like this (note the comments
array):
Example Jira API response (click to expand)
{
"url": "https://example.com/issue/1",
"status": "Open",
"issueType": "Bug",
"comments": [
{
"id": "123",
"text": "This issue is not reproducing"
},
{
"id": "456",
"text": "Great issue!"
}
],
"assignee": "user1",
"reporter": "user2",
"creator": "user3",
"priority": "High",
"created": "2024-03-18T10:00:00Z",
"updated": "2024-03-18T12:30:00Z",
"key": "ISSUE-1"
}
Common use-cases
Splitting a kind
block
Sometimes the CreateRelatedMissingEntities
flag is passed as false
to prevent generation of additional entities for relations. This can lead to cases where entity ingestion will not happen because the target entity for a relation does not exist in your catalog.
To handle such cases, you can split a single kind
to multiple mappings like this:
createMissingRelatedEntities: false
- kind: services
selector:
query: "true"
port:
entity:
mappings:
identifier: .name
blueprint: '"service"'
properties:
#Properties mapping
- kind: services
selector:
query: "true"
port:
entity:
mappings:
identifier: .name
blueprint: '"service"'
properties: {}
relations:
pagerduty_service: .id
Looking at this mapping configuration we see the following:
- The first
kind
block is used to create the entity along with all of its properties. - The second
kind
block is used to update the same entity (notice the mapping for the identifier is the same in both configurations) with relations. If the target entity of the relation does not exist (i.e. you have no matching PagerDuty service), the update itself will fail, but the data ingestion will still complete successfully.
This case can also be expanded for handling multiple relation, for each relation that might not be established, you can split it into another kind mapping.