Skip to main content

SQL Query Guide

In D.Hub, you can use standard SQL for dataset exploration and SQL transform nodes in pipelines.

SQL Transform Node

Used to filter or aggregate data within a pipeline.

Structure

  • Table Name: Connected input datasets are referenced by the default table name input, or by a user-specified alias.
  • Result: The query execution result (result of the SELECT statement) is passed to the next node.

Example

-- Filter items where 'status' is 'active' from input data
SELECT
id,
name,
created_at
FROM
input
WHERE
status = 'active'

Dataset Exploration (SQL Scratch Pad)

You can run ad-hoc queries to examine data in the 'Data' tab of the dataset detail page.

  • Restrictions: Only SELECT queries are allowed.
  • Table: You must use the current dataset's table name.
SELECT * FROM my_dataset_table LIMIT 100