Lineage Diagram

 

The data flow "interactive" Analysis Diagram displays the columns/fields involved in the given data flow trace, not all the columns. The user can then select the columns/fields to be displayed to better present the business use case of that data flow. Then the user can interact within that diagram by selecting columns/fields to display its lineage. Furthermore, the Analysis Diagrams allow you to display conditional labels such as PII or Confidential Sensitivity Level, not only providing more critical information to the user, but also better visualization of the propagation of that information (e.g. PII) through the  data flow lineage trace.

Please refer to the diagram visualization common features.

In addition to those general features, additionally there features specific to the classic diagram presentation.

In order to see data flow lineage, one must

Define a configuration that contains all of the models potentially in the data flow

Stitch the models together by resolving connection definitions and Build the configuration

Once the configuration is ready, then you are ready to report on lineage.

End-to-end data flow lineage across models is only available at the classifier (e.g., table) and feature (e.g., column) level.  If instead, one goes to the object page for a schema or model, , as this is not classifier or feature, the data flow tab shows the overview lineage within the scope of that model only.

Steps

1.  Trace data flow lineage.

2.  Click the Analysis Diagram tab on the left side.

3.  From here you may

Pick the Direction in the pull-down in the header of the diagram:

-       Impact (Destination) direction

-       Lineage (Sources) direction

-       Any type for both data impact and lineage.

Select which columns  to display in the diagram using the Columns pull-down in the header of the diagram.

-       A list of possible columns with a quick find is presented with checkboxes.

Pick the Depth in the pull-down in the upper right.

-       1 (Adjacent) step in the lineage.  Objects in the lineage that are the next items in a lineage trace. 

For impact, adjacent can often be the data store (like a warehouse) that is the target of an object being loaded by DI/ETL that is the focus of the lineage.  For course lineage, it can often mean the data source directly loaded from to produce the object that is the focus of the lineage.

-       2 thru 9 steps in the lineage

-       Any type for both data impact and lineage.

Click the Show actions for the selected object icon and

-       Select Show/Hide Columns to show columns in the selected object, or all objects if none is selected.

-       Select Expand/Collapse All to expand the display of the selected object down to the current display level (columns or tables) or collapse to the highest level.  Applies to all objects if none is selected.

Click Save an image to produce a downloadable file with a lineage image.

Click Filters and specify lineage filter options.

Click Display Options and specify lineage display options.

Example

Navigate to the object page for the Customer table in the Staging DW.dbo schema.

Go to the Data Flow tab and click the Diagram tab on the left side.

Pick ANY for the Direction in the pull-down in the diagram header.

 

A screenshot of a computer

Description automatically generated

 

The red colored pin indicating the source of the lineage and impact trace.

The diagram defaults to the classifier (table) level for performance reasons.

Click the Show actions for the selected object icon and select Show Columns.

 

A screenshot of a computer

Description automatically generated

 

Now columns are visible, but still not the column lines.  Again, this is for reasons of performance and simplicity of presentation.

 

Click on the Display Options icon and click Show Conditional Labels

 

A screenshot of a computer

Description automatically generated

 

A screenshot of a computer

Description automatically generated

 

Here, you may pick and choose conditional labels to show in the diagram and the image shows all of them selected for display.

 

A screenshot of a computer

Description automatically generated

 

Click on the Display Options icon and select Show Term Definitions

 

A screenshot of a computer

Description automatically generated

 

A screenshot of a computer

Description automatically generated

 

Terms, like US Social Security Number (documenting the ID field), are used to document columns and tables that are in this lineage trace and this is shown in the diagram.

Lineage Diagram Visualization Common Features

There are a number of common features and tools available when visualizing a lineage trace, data model, etc.

Lineage Diagram Show Overview

You may click this Show overview icon to show or hide an Overview panel of the model diagram. Click in the overview to quickly move to a portion of the full diagram.

 

A screenshot of a computer

Description automatically generated

Lineage Diagram Zoom In/Out and Fit to content

Click Zoom in () or Zoom out () icons to adjust the aspect ratio of the diagram. Also, you may click on the Fit to content ()icon to view the entire diagram at the best zoom that will fit.

Lineage Diagram Collapse / Expand

 

 

Click Expand / Collapse to expand or collapse the entire diagram (ensure that you do not have an object selected, otherwise the action will only apply to that object).

You may also click on the plus sign for an object to expand and the minus sign to collapse just that object.

Lineage Diagram Show/Hide Columns

A screenshot of a computer

Description automatically generated

 

You may show or hide all columns or show specific ones.

In order for this option to be available, you must have specified Yes for COLUMN FLOW in the Data Flow Options menu.

Lineage Diagram Show actions for the selected object

Show the actions available in the context menu for the selected object.

A screenshot of a computer

Description automatically generated

 

Open to go to the object page for the object

Open Lineage to change the point of origin (red pin in diagram) to present a new lineage display.

 

Lineage Diagram Trace in General

Select the Analysis Diagram tab on the left to obtain this presentation. You will see a graphical presentation of the lineage (data impact or data source).

Additional options include:

Overview

You may click this icon to show or hide an Overview panel of the lineage trace diagram. Click in the overview to quickly move to a portion of the full diagram.

 

A screenshot of a computer

Description automatically generated

 

Zoom In/Out and Fit to content

Click Zoom in or Zoom out icons to adjust the aspect ratio of the diagram. Also, you may click on the Fit to content icon to view the entire diagram at the best zoom that will fit.

 

 

Collapse / Expand

 

 

Click Expand / Collapse to expand or collapse the entire diagram (ensure that you do not have an object selected, otherwise the action will only apply to that object).

 

You may also click on the plus sign for an object to expand and the minus sign to collapse just that object.

Open the object page

 

You may right-click and select Open (),to navigate to the object page.

Print

 

 

You may download a PNG or SVG image of the diagram.

Quick find

 

 

In the upper right, there is a search text box that will provide a quick list of object names that contain the text you type. You may click on any of the results to select that object in the diagram and moving the focus there.

 Explore Further

Invoking a lineage trace from any reference to an object

You may invoke a lineage trace from any diagram or any list of results (e.g., from a Browse or Search), either via right-click context menu

In the CLASSIC DIAGRAM

 

A screenshot of a computer

Description automatically generated

In the DIAGRAM

A close up of a sign

Description automatically generated

Interpreting the graphical lineage

In general, the lineage tools within MetaKarta function identically whether one is analyzing data flow lineage, semantic lineage or both. However, the presentation is different, as follows:        

 

 

In addition, MetaKarta has four levels of presentation:

Configuration Model Connections Overview – which is a diagram representing the various Models contained within a configuration and how they are related (or stitched) to each other based upon connection definitions manually assigned to MetaKarta.

Model Connections Overview – which is a diagram representing the various Models contained within the directory of an external repository and how they are related (or stitched) to each other based upon connection definitions already provided in the external metadata repository.

Model Lineage Overview – which is a diagram representing an overview of the lineage within a given Model.

Lineage Trace analysis at the configuration or Model level – which is a fully detailed trace of semantic and/or data flow lineage for detailed analysis.

Properties Panel

Click to select a object and view its properties in the Properties Panel on the right. You may show and hide this panel as needed.

 

Lineage Diagram Display Options

The  Display Options are available.

 

 

Show Term Definitions – Show semantic lineage back to glossary terms for those which provide definitions of objects in the data flow lineage

Compact View – Toggles between small boxes with icons and wider boxes with more text and detail.

Show Conditional Labels – Display conditional labels in the diagram as selected in Edit Conditional Labels.

Edit Conditional Labels – Pick the conditional labels to display

MAXIMUM NODE WIDTH – set the size of the object boxes.

Lineage Diagram Options Show Term Definitions

Here you may see the terms with Defined by relationships in the diagram.

Lineage Diagram Options Compact View

 

Lineage Diagram Options Show Conditional Labels

Click on the Display Options icon and click Show Conditional Labels

 

 

A screenshot of a computer

Description automatically generated

 

Here, you may pick and choose conditional labels to show in the diagram and the image shows all of them selected for display.

 

A screenshot of a computer

Description automatically generated

Lineage Diagram Options Show Mixed Connections

Sometimes an ETL/DI process will read from one table and write to another table in the same database. In this case, the lineage will often show process arrows that loop back because the normal presentation is to group tables inside their respective schemas. However, by default, the lineage in the Data Flow tab will attempt to produce a continuous diagram from left to right through the lineage by breaking up these tables giving a more understandable lineage picture.

 

Diagram

Description automatically generated with medium confidence

 

 When you click this checkbox the lineage is returned to the mode where all tables are grouped into their respective schemas and thus loops are shown.

 

Graphical user interface

Description automatically generated

Steps

1.  Begin a lineage trace.

The default setting where this option is not checked is presented.

2.  Check  Display Options > Show Mixed Connections to show these objects

The lineage is returned to the mode where all tables are grouped into their respective schemas and thus loops are shown

3.  You may uncheck to return to the default.

4.  Then select Show Mixed Connections from the Display Options menu.

Example

A screenshot of a computer

Description automatically generated

 

Expand Staging DW.dbo.CustomerPayment and select PaymentID.

 

A screenshot of a computer

Description automatically generated

 

Many new objects, which are not directly connected by data flow links now appear. Selecting Data Flow Settings >Control Flow > Complete shows any objects which are connected via control flow to the origin object and any subsequent objects.

Lineage Diagram Options Maximum Node Width

In many cases, names of objects may be too long to fit into the objects in the diagram. You may specify several different node width maximums to make the diagram more readable. Click on Display Options.

 

A screenshot of a computer

Description automatically generated

 

Pick the note width.