Data Profile Viewer F1 Help

Use the Data Profile Viewer to view the output of the Data Profiling task.

For more information about how to use the Data Profile Viewer, see Viewing Profile Output in the Data Profile Viewer. For more information about how to use the Data Profiling task, which creates the profile output that you analyze in the Data Profile Viewer, see Setting Up the Data Profiling Task.

Static Options

  • Open
    Click to browse for the saved file that contains the output of the Data Profiling task.

  • Profiles pane
    Expand the tree in the Profiles pane to see the profiles that are included in the output. Select a profile to view the results for that profile.

  • Message pane
    Displays status messages.

  • Drilldown pane
    Displays the rows of data that match a value in the output, if the data source that is used by the Data Profiling task is available.

    For example, if you are viewing the output of a Column Value Distribution profile for a US State column, the Detailed Value Distribution pane might contain a row for "WA". Double-click the row in the Detailed Value Distribution pane to see the rows of data where the value of the state column is "WA" in the drilldown pane.

Dynamic Options

Profile Type = Column Length Distribution Profile

Column Length Distribution Profile - <column> pane

  • Minimum Length
    Displays the minimum length of values in this column.

  • Maximum Length
    Displays the maximum length of values in this column.

  • Ignore Leading Spaces
    Displays whether this profile was computed with an IgnoreLeadingSpaces value of True or False. This property was set on the Profile Requests page of the Data Profiling Task Editor.

  • Ignore Trailing Spaces
    Displays whether this profile was computed with an IgnoreTrailingSpaces value of True or False. This property was set on the Profile Requests page of the Data Profiling Task Editor.

  • Row Count
    Displays the number of rows in the table or view.

Detailed Length Distribution pane

  • Length
    Displays the column lengths found in the profiled column.

  • Count
    Displays the number of rows in which the value of the profiled column has the length shown in the Length column.

  • Percentage
    Displays the percentage of rows in which the value of the profiled column has the length shown in the Length column.

Profile Type = Column Null Ratio Profile

Column Null Ratio Profile - <column> pane

  • Null Count
    Displays the number of rows in which the profiled column has a null value.

  • Null Percentage
    Displays the percentage of rows in which the profiled column has a null value.

  • Row Count
    Displays the number of rows in the table or view.

Profile Type = Column Pattern Profile

Column Pattern Profile - <column> pane

  • Row Count
    Displays the number of rows in the table or view.

Pattern Distribution pane

  • Pattern
    Displays the patterns computed for the profiled column.

  • Percentage
    Displays the percentage of rows whose values match the pattern displayed in the Pattern column.

Profile Type = Column Statistics Profile

Column Statistics Profile - <column> pane

  • Minimum
    Displays the minimum value found in the profiled column.

  • Maximum
    Displays the maximum value found in the profiled column.

  • Mean
    Displays the mean of the values found in the profiled column.

  • Standard Deviation
    Displays the standard deviation of the values found in the profiled column.

Profile Type = Column Value Distribution Profile

Column Value Distribution Profile - <column> pane

  • Number of Distinct Values
    Displays the count of distinct values found in the profiled column.

  • Row Count
    Displays the number of rows in the table or view.

Detailed Value Distribution pane

  • Value
    Displays the distinct values found in the profiled column.

  • Count
    Displays the number of rows in which the profiled column has the value shown in the Value column.

  • Percentage
    Displays the percentage of rows in which the profiled column has the value shown in the Value column.

Profile Type = Candidate Key Profile

Candidate Key Profile - <table> pane

  • Key Columns
    Displays the columns that were selected for profiling as a candidate key.

  • Key Strength
    Displays the strength (as a percentage) of the candidate key column or combination of columns. A key strength of less than 100% indicates that duplicate values exist.

Key Violations pane

  • <column1>, <column2>, etc.
    Displays the duplicate values that were found in the profiled column.

  • Count
    Displays the number of rows in which the specified column has the value shown in the first column.

Profile Type = Functional Dependency Profile

Functional Dependency Profile pane

  • Determinant Columns
    Displays the column or columns selected as the determinant column. In the example where the same United States Zip Code should always have the same state, the Zip Code is the determinant column.

  • Dependent Columns
    Displays the column or columns selected as the dependent column. In the example where the same United States Zip Code should always have the same state, the state is the dependent column.

  • Functional Dependency Strength
    Displays the strength (as a percentage) of the functional dependency between columns. A key strength of less than 100% indicates that there are cases where the determinant value does not determine the dependent value. In the example where the same United States Zip Code should always have the same state, this probably indicates some state values are not valid.

Functional Dependency Violations pane

Note

A high percentage of erroneous values in the data could lead to unexpected results from a Functional Dependency profile. For example, 90% of the rows have a State value of "WI" for a Postal Code value of "98052." The profile reports rows that contain the correct state value of "WA" as violations.

  • <determinant column name>
    Displays the value of the determinant column or combination of columns in this instance of a functional dependency violation.

  • <dependent column name>
    Displays the value of the dependent column in this instance of a functional dependency violation.

  • Support Count
    Displays the number of rows in which the determinant column value determines the dependent column.

  • Violation Count
    Displays the number of rows in which the determinant column value does not determine the dependent column. (These are the rows where the dependent value is the value shown in the <dependent column name> column.)

  • Support Percentage
    Displays the percentage of rows in which the determinant column determines the dependent column.

Profile Type = Value Inclusion Profile

Value Inclusion Profile pane

  • Subset Side Columns
    Displays the column or combination of columns that were profiled to determine whether they are in the superset columns.

  • Superset Side Columns
    Displays the column or combination of columns that were profiled to determine whether they include the values in the subset columns.

  • Inclusion Strength
    Displays the strength (as a percentage) of the overlap between columns. A key strength of less than 100% indicates that there are cases where the subset value is not found among the superset values.

Inclusion Violations pane

  • <column1>, <column2>, etc.
    Displays the values in the subset column or columns that were not found in the superset column or columns.

  • Count
    Displays the number of rows in which the specified column has the value shown in the first column.