SQL - PostgreSQL - Ranking Data¶
Source: sql-snippets/rank.md at main · count/sql-snippets (github.com)
Description¶
A common requirement is the need to rank a particular row in a group by some quantity - for example, ‘rank stores by total sales in each region’.
Fortunately, PostgreSQL natively has a rank
window function to do this.
The syntax looks like this:
SELECT
RANK() OVER(PARTITION BY <fields> ORDER BY <rank_by>) AS RANK
FROM <schema.tablename>
where
fields
- zero or more columns to split the results by - a rank will be calculated for each unique combination of values in these columnsrank_by
- a column to decide the rankingschema.table
- where to pull these columns from
Example¶
We want to find out, for our random data sample of fruits, which fruit is most popular.
WITH data AS (
SELECT CASE
WHEN RANDOM() <= 0.25 THEN 'apple'
WHEN RANDOM() <= 0.5 THEN 'banana'
WHEN RANDOM() <= 0.75 THEN 'pear'
ELSE 'orange'
END AS fruit,
RANDOM() + (series^RANDOM()) AS popularity
FROM generate_series(1, 10) AS series
)
SELECT
fruit,
popularity,
RANK() OVER(ORDER BY popularity DESC) AS popularity_rank
FROM data
Output:
fruit | popularity | popularity_rank |
---|---|---|
banana | 3.994866211350613 | 1 |
banana | 3.2802621307398208 | 2 |
banana | 2.626913466142052 | 3 |
pear | 2.59071184223806 | 4 |
banana | 2.137745293125355 | 5 |
apple | 2.0741654350518637 | 6 |
apple | 2.0401770856253933 | 7 |
pear | 1.771223675755631 | 8 |
banana | 1.743055659082957 | 9 |
pear | 1.7158950640518675 | 10 |
Appendix: Links¶
Backlinks:
list from [[SQL - PostgreSQL - Ranking Data]] AND -"Changelog"