ADQL cheatsheet

-- randomly selects one out of every million objects
SELECT *
FROM gaiadr1.gaia_source
WHERE MOD(random_index, 1000000) = 0

1 = CONTAINS(POINT(smaller_catalog), CIRCLE(larger_catalog))

Functions

SIN(x), COS(x), TAN(x): all in radians
ASIN(x), ACOS(x), ATAN(x)
ATAN2(x, y): arctan(y/x) -> [-pi, +pi]
DEGREES(x): radians to degrees
RADIANS(x): degrees to radians
EXP(x)
LOG(x): natural log
LOG10(x)
POWER(x, y): x**y
SQRT(x)
ROUND(x, n): round to n decimal places. positive: to the right, negative: to the left of the decimal point.
FLOOR(x)
CEILING(x)
TRUNCATE(x, n)
ABS(x)
RAND(n)?
MOD(x, y): x mod y
PI()

All angle in degrees.

Put column names with spaces in double quotes ("").
Put column values to compare in single quotes ('').
Arithmetic operations are for columns. For operations between rows, use aggregate functions.
comments:

-- single line
/* multiple
  lines*/

LIKE: pattern matching
- %: wildcard e.g., WHERE column LIKE 'abc%'
- _: any individual character e.g., WHERE column LIKE 'ab_de'
- can be case sensitive or insensitive depending on DB?
IN: compare to a set of values
- WHERE year in (2009, 2010)
- WHERE flag in ('big', 'small')
BETWEEN AND: range of values incl. bounds
- WHERE year_rank BETWEEN 5 AND 10 is exactly same as WHERE year_rank >= 5 AND year_rank <= 10
IS NULL: missing or not
- = NULL does not work since = is arithmetic comparison
NOT
- year_rank NOT BETWEEN 2 AND 3
- "group" NOT LIKE '%macklemore%'
- artist IS NOT NULL
ORDER BY
- ascending by default; use ORDER BY column DESC for descending
- multiple columns: ORDER BY year DESC, year_rank

COUNT
- COUNT(*): count all rows
- COUNT(column): count all rows in column that is not NULL
SUM only on numerical columns
MIN, MAX works on non-numerical columns (alphabetical)
AVG on numerical columns, NULL rows are ignored
GROUP BY

HAVING: filter on aggregates
- order matters: select -> from -> where -> group by -> having -> order by

JOIN = INNER JOIN: unmatched rows are dropped.
LEFT/RIGHT JOIN: find match for every row in left, right table. Includes duplicates.
FULL JOIN = FULL OUTER JOIN: union of two tables
filter in JOIN by doing: JOIN table ON table.col1 = table2.col2 AND [condition]

Casting - supported for adql?
- CAST(column AS integer) or column::integer
Cleaning strings
- LEFT(column, 5): 5 characters from the left
- RIGHT(column, 5)
- LENGTH(column)
- TRIM([leading/trailing/both] 'characters' FROM column)

SELECT *
FROM (
  -- inner query
  SELECT ...
) sub
WHERE ...

When are subqueries useful?