Skip to content

Codabench Datasets  #1898

Description

@ihsaan-ullah

Datasets updates

RELEASE 1 (Done, Solved by #1955):

  • new page for public datasets
  • filters like competitions page for searching
  • new properties in datasets page (name, description, uploader, uploaded when, size, downloads, verified

RELEASE 2 (Done, Solved by #2050)

  • add croissant metadata for public datasets
  • fix license not added to db on dataset creation

RELEASE 3 (To be discussed)

  • Field for citation (also in json-ld)
  • Field for version (also in json-ld)
  • Documentation (Markdown) about data format and reading, source of data, references, etc.
  • Type (tabular, images etc)
  • Size (number of examples)
  • Tags (multiple)
  • Mechanism to update verified flag (verified by admins for quality and safety)
  • How to handle revisions or versioning of datasets?
    It should not be possible to update verified datasets, or at least updating should break the verified flag.

Suggestions (from Adrien)

  • Having an unique DOI for each dataset
  • Progress bar during upload

Metadata

Metadata

Assignees

No one assigned

    Labels

    EnhancementFeature suggestions and improvementsPost-itInternal ideas

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions