Prepare DLHUB dataset
DLHUB currently supports 3 dataset formats for the training, to get these formats:
- Select the
?
icon next to theDetected Data Type
to show theTraining Data File Help
dialog - Select the desired file format from the drop-down list to learn detail about it.
- Click
Generate Sample File/Folders
to download the selected format's template - Based on the dataset template, you can prepare your own dataset for training.
Format 1: Classified Image Folder
Organize your images dataset and define their output by classified folders
For example, if you want to do image classification for Avengers, you will have classified folders such as Spiderman, Superman, Wonderwoman
Each folder will contain image dataset that belongs to the character.
Format 2: Feature and Category (csv or txt)
This is the standard format where you list your data as columns, including:
- Column of labels (output)
- Column of features (input)
Here is a simple example that has 4 outputs and 8 inputs. Each row defines the classified output with its corresponding inputs.
Format 3: Image Map File (csv or txt)
This file contains a list of image directories and their classified output, separated by Tab
The first column will be the list of image paths, and the second column will be the classified output.
You need to make sure the image directory contains the actual image file.