Adding a New Dataset#

To add a new dataset into aeon internal dataset repository, please proceed with the following steps:

  1. From the root of your aeon local repository, create a <dataset-name> folder:

mkdir ./datasets/data/<dataset-name>
  1. In the above directory, add your dataset file <dataset-name>.<EXT>, where <EXT> is the file extension:

    • The list of supported file formats is available in the aeon/MANIFEST.in file (e.g., .csv, .txt).

    • If your file format <EXT> does not figure in the list, simply add it in the aeon/MANIFEST.in file:

"aeon/MANIFEST.in"
...
recursive-include aeon/datasets *.csv ... *.<EXT>
...
  1. In aeon/datasets/_single_problem_loaders.py, declare a load_<dataset-name>(...) function. Feel free to use any other declared functions as templates for either classification or regression datasets.

  2. In aeon/datasets/__init__.py, append "load_<dataset-name>" to the list __all__.

  3. In aeon/datasets/setup.py, append "<dataset-name>" to the tuple included_datasets.