Revenue
—
net of discount
Orders
—
distinct order keys
Line items
—
fact rows scanned
Avg discount
—
across all line items
Revenue by ship mode
SUM(extendedprice × (1 − discount)) GROUP BY ship mode
Revenue by market segment
lineitem ⋈ orders ⋈ customer, grouped by segment
Monthly revenue trend
lineitem ⋈ orders, GROUP BY date_trunc('month', CAST(o_orderdate AS TIMESTAMP))
🔍 Live SQL console
Run any Spark SQL against the 8 in-memory TPC-H tables. Try a join, a CTE, a window.
Run a query to see results.