Skip to main content
Skip to main content

theilsU

The theilsU function calculates the Theil's U uncertainty coefficient, a value that measures the association between two columns in a table. Its values range from 0.0 (no association) to 1.0 (perfect agreement).

Syntax

theilsU(column1, column2)

Arguments

  • column1 and column2 are the columns to be compared

Returned value

  • a value between 0 and 1

Return type is always Float64.

Example

The following two columns being compared below have a small association with each other, so the value of theilsU is small and positive:

SELECT
    theilsU(a, b)
FROM
    (
        SELECT
            number % 10 AS a,
            number % 4 AS b
        FROM
            numbers(150)
    );

Result:

┌────────theilsU(a, b)─┐
│  0.30195720557678846 │
└──────────────────────┘

theilsU

Introduced in: v22.1

The theilsU function calculates the Theil's U uncertainty coefficient, a value that measures the association between two columns in a table. Its values range from −1.0 (100% negative association, or perfect inversion) to +1.0 (100% positive association, or perfect agreement). A value of 0.0 indicates the absence of association.

Syntax

theilsU(column1, column2)

Arguments

  • column1 — First column to be compared. Any
  • column2 — Second column to be compared. Any

Returned value

Returns a value between -1 and 1. Float64

Examples

Usage example

SELECT theilsU(a, b)
FROM (
    SELECT
        number % 10 AS a,
        number % 4 AS b
    FROM
        numbers(150)
);
┌────────theilsU(a, b)─┐
│ -0.30195720557678846 │
└──────────────────────┘