Creating a Dataset
This guide explains how to create a new dataset using the template wizard.
Step 1: Select Type
- Select the Dataset card.
- Choose a subtype.
- Delta: General tabular data storage
- Kafka / DDS: Real-time stream data
- REST: External API integration
Step 2: Basic Info
[Screenshot] Basic info input screen of the dataset creation wizard
Define the dataset's metadata.
- Name: Dataset name (required)
- Alias: Display name
- Description: Dataset description
- Tags: Search tags
- Additional settings (by type):
- REST: Enter API Endpoint URL (required)
- Kafka/DDS: Enter Topic name (AI auto-generation available)
Step 3: Schema Definition
Define the dataset's structure (columns). Three modes are supported.
UI Mode
[Screenshot] Dataset schema definition screen (with columns added in UI mode)
Add and configure columns in an intuitive table interface.
- Add Column: Add a new column
- Name: Column name
- Type: Data type (Text, Integer, Decimal, Boolean, Timestamp, Date, etc.)
- Nullable: Whether null values are allowed
JSON Mode
Write or paste a schema directly in JSON format. Useful for defining complex nested structures.
CSV Mode
Upload a sample CSV file to automatically infer the schema.
- Drag and drop or select a file to upload, and the header and data types will be analyzed to auto-generate the schema.
Completion
Once all settings are complete, click the Create (or Submit Dataset) button to create the dataset.