ColumnLength - AWS Glue
Services or capabilities described in AWS documentation might vary by Region. To see the differences applicable to the AWS European Sovereign Cloud Region, see the AWS European Sovereign Cloud User Guide.

ColumnLength

Checks whether the length of each row in a column conforms to a given expression.

Syntax

ColumnLength <COL_NAME><EXPRESSION>
  • COL_NAME – The name of the column that you want to evaluate the data quality rule against.

    Supported column types: String

  • EXPRESSION – An expression to run against the rule type response in order to produce a Boolean value. For more information, see Expressions.

Example: Column row length

The following example rule checks whether the value in each row in the column named Postal_Code is 5 characters long.

ColumnLength "Postal_Code" = 5 ColumnLength "weightinkgs" = 2 where "weightinkgs > 10"

Null behavior

The ColumnLength rule treats NULLs as 0 length strings. For a NULL row:

ColumnLength "Postal_Code" > 4 # this will fail
ColumnLength "Postal_Code" < 6 # this will succeed

The following example compound rule provides a way to explicitly fail NULL values:

(ColumnLength "Postal_Code" > 4) AND (ColumnValues "Postal_Code" != NULL)