Skip to main content
Version: v0.1.0

Creating a Dataset

This guide explains how to create a new dataset using the template wizard.

Step 1: Select Type

  1. Select the Dataset card.
  2. Choose a subtype.
    • Delta: General tabular data storage
    • Kafka / DDS: Real-time stream data
    • REST: External API integration

Step 2: Basic Info

[Screenshot] Basic info input screen of the dataset creation wizard

Define the dataset's metadata.

  • Name: Dataset name (required)
  • Alias: Display name
  • Description: Dataset description
  • Tags: Search tags
  • Additional settings (by type):
    • REST: Enter API Endpoint URL (required)
    • Kafka/DDS: Enter Topic name (AI auto-generation available)

Step 3: Schema Definition

Define the dataset's structure (columns). Three modes are supported.

UI Mode

[Screenshot] Dataset schema definition screen (with columns added in UI mode)

Add and configure columns in an intuitive table interface.

  • Add Column: Add a new column
  • Name: Column name
  • Type: Data type (Text, Integer, Decimal, Boolean, Timestamp, Date, etc.)
  • Nullable: Whether null values are allowed

JSON Mode

Write or paste a schema directly in JSON format. Useful for defining complex nested structures.

CSV Mode

Upload a sample CSV file to automatically infer the schema.

  • Drag and drop or select a file to upload, and the header and data types will be analyzed to auto-generate the schema.

Completion

Once all settings are complete, click the Create (or Submit Dataset) button to create the dataset.