Skip to content Learn about the access keys available for MAST Registry

Concept help - Data Set

A Data Set describes a record of data, including any location or time boundaries for the data, that has been captured and is available for use under a specific licence. A Data Set may be included in a Data Catalog, and can reference multiple Distributions that record different parts or formats of the data that are available to download.

A a dataset in DCAT is defined as a "collection of data, published or curated by a single agent, and available for access or download in one or more formats". A dataset does not have to be available as a downloadable file. For example, a dataset that is available via an API can be defined as an instance of dcat:Dataset and the API can be defined as an instance of dcat:Distribution. DCAT itself does not define properties specific to APIs description. These are considered out of the scope of this version of the vocabulary. Nevertheless, this can be defined as a profile of the DCAT vocabulary.

Tips for creating Data Sets

************** creation tip for DS

Registry specific help

______extra help for DS

Fields available on this metadata type

Field ISO definition and Registry Help (where available)
Name The primary name used for human identification purposes.
Definition Representation of a concept by a descriptive statement which serves to differentiate it from related concepts. (3.2.39)
Is Federated
Is Not Federable
Version Unique version identifier of this metadata item.
References Significant documents that contributed to the development of the metadata item which were not the direct source for the metadata content.
Origin The source (e.g. document, project, discipline or model) for the item (8.1.2.2.3.5)
Comments Descriptive comments about the metadata item (8.1.2.2.3.4)
Deleted The date after which the item has been soft deleted and is no longer visible in the registry
License Information about the license document under which the dataset is made available.
Rights Information about rights held in and over the dataset.
Release Date Date of formal publication of the dataset.
Modification Date Most recent date on which the dataset was changed, updated or modified.
Frequency The frequency at which dataset is published.
Spatial Coverage Spatial or geographic coverage of the dataset.
Temporal Coverage The temporal or time period that the dataset covers.
Catalog An entity responsible for making the dataset available.
Landing Page A Web page that can be navigated to in a Web browser to gain access to the dataset, its distributions and/or additional information

Corresponds to the ONDC field "Access URL". The file path and/or URL that gives access to a distribution of the resource.

Contact Point Relevant contact information for the Dataset.
Conforming Specification An established standard to which the described resource conforms.
Item Base

Custom Fields

Field Short definition Long definition
Legislation All legal mandates under which the data asset was collected, created, received, used or disclosed.
Services Australia Keywords Provides additional high-level information regarding asset content in a manner that facilitates discovery, linkage and descriptive analysis of data asset records. Keywords should be meaningful and relevant to the data asset. For a list of available keywords and definitions, refer to the Data Asset Register Key Word Tags Definitions document. Provide one or more keywords as a comma-separated list. Contact metadata.management if needing to add additional keywords to the controlled vocabulary. Obligation: Mandatory
Test Webinar
Period of time covered(begin)
Period of time covered(end)
Language
Contact Person Email
Test System
External Custodian
Who are in key data roles?
Original ticket/agreement System (Zendesk) A link to the original ticket/agreement system (currently Zendesk) to see the origins of the data set/asset.
Regulatory Requirements Met Required regulatory requirements are - AS 4590, 4819 & 19115, AGRkMS, ONDC's Metadata Attributes Guide, PSPF and NSW Gov Data Policy - Metadata Management and AGLS Metadata Standard (AS 5044).

Write Yes, if all the requirements are met for this data asset; if not, write No.

Sensitivity

This field will allow assets to be marked as sensitive, and only the registry administrators will be able to see that it’s a sensitive asset.

 

https://metadata.nsw.gov.au/home/

Description The description of the data asset.
Keywords Standardised terms that describe the data asset subject matter.

Purpose: Describes the topic(s) covered by the data asset, using language consistent across government data inventories. It answers the question “what is this data asset about?” and supports data discovery.

Obligation: Mandatory

Additional comments: Terms are selected from the Australian Government Interactive Functions Thesaurus (AGIFT) and internal agency business terminology. When selecting keywords, consider what search terms your users may choose when searching for the data asset, and provide as much granularity as practicable. Contact metadata.management to add additional keywords to the list. A full list of AGIFT terms are published on the NAA website.

Controlled values: Select one or more keywords from the provided list - select the Plus (+) button to browse and add keywords. At least one tier 1 AGIFT term is required (tier 1 terms are formatted in uppercase text).

Example(s):

  • COMMUNITY SERVICES
  • Income Support Schemes

ONDC alignment: Keyword (core attribute)

Privacy settings test from Platypus Help!
Security Classification
DS date first text then date type
Business Definitions
Data custodian

Official Definition

A representation of a dataset in a catalog. Data Catalog Vocabulary (DCAT): 5.3 Class: Dataset