Skip to content

SQL - PostgreSQL -Normalize Whitespace

Source: https://wiki.postgresql.org/wiki/Normalize_whitespace

NOTE: This function uses generic SQL.

CREATE OR REPLACE FUNCTION normalize_space(TEXT)
RETURNS TEXT
IMMUTABLE
LANGUAGE SQL
AS $$
SELECT regexp_replace(
    trim($1),
    E'\\s+',
    ' ',
    'g'
);
$$;
CREATE OR REPLACE FUNCTION normalize_space(text) 
RETURNS text LANGUAGE plperl AS $$
    $_ = $_[0]; 
    s/(^\s+|\s+$)//g;
    s/\s+/ /og;
    return $_;
$$;

So, if you want to update/insert to a table and normalize the text previously this action, you might create a trigger before insert for each row, like this:

CREATE TABLE pepito (wordy text, moody text, bar text);

DROP IF EXISTS trigger n_space on pepito;

CREATE TRIGGER n_space BEFORE INSERT OR UPDATE ON pepito FOR EACH ROW EXECUTE PROCEDURE n_space();

This function trigger, looks up fields of text type and touch NEW values with the replace expression.

NOTE: This function uses the PLPERL language based off of PERL.

CREATE OR REPLACE FUNCTION n_space() 
RETURNS trigger
VOLATILE
LANGUAGE plperl
AS $$
    my $rs = spi_exec_query(<<"    EOT");
    SELECT
        a.attname as name,
        format_type(a.atttypid,pg_attribute.atttypmod) 
    FROM
        pg_class AS c
    JOIN
        pg_attribute AS a
        ON (
            a.attrelid = c.oid
        AND c.oid = '$_TD->{table_name}'::regclass 
        AND a.attstorage <> 'p'
        AND format_type(a.atttypid,pg_attribute.atttypmod) = 'text'
    EOT

    my $rs_rows = $rs->{processed}-1;
    foreach my $rn (0 .. $rs_rows){
        my $row = $rs->{rows}[$rn]->{name};
        $_TD->{new}{$row} =~ s/(^\s+|\s+$)//g;
        $_TD->{new}{$row} =~ s/\s+/ /g;     
    }   
    return "MODIFY";
$$;
TRUNCATE TABLE pepito;
INSERT INTO pepito
VALUES
    ('asd asd AS     asd asf      ',' asd  4t45gr g er    ergt',' asd sa'),
    ('asd asd AS     asdbhfg',' asd  4t45gr g er    ergt',' asd sa'),
    ('    asd asd AS     asd asf      ',' asd  4t45gr g er    ergt',' asd sa');
SELECT * FROM pepito;

Backlinks:

list from [[SQL - Normalize Whitespace]] AND -"Changelog"