SQL Query Guide
In D.Hub, you can use standard SQL for dataset exploration and SQL transform nodes in pipelines.
SQL Transform Node
Used to filter or aggregate data within a pipeline.
Structure
- Table Name: Connected input datasets are referenced by the default table name
input, or by a user-specified alias. - Result: The query execution result (result of the
SELECTstatement) is passed to the next node.
Example
-- Filter items where 'status' is 'active' from input data
SELECT
id,
name,
created_at
FROM
input
WHERE
status = 'active'
Dataset Exploration (SQL Scratch Pad)
You can run ad-hoc queries to examine data in the 'Data' tab of the dataset detail page.
- Restrictions: Only
SELECTqueries are allowed. - Table: You must use the current dataset's table name.
SELECT * FROM my_dataset_table LIMIT 100