Full Program »
Improving the Discovery of Restricted Data: Identifying Metadata Commonalities Across Restricted Data Sources
Background
The challenge of finding and accessing restricted data for research purposes is a known issue; specifically, researchers encounter barriers when identifying prospective data sources, when locating and understanding available data within those sources, and when discerning whether they are eligible to access it. A prior study conducted by the authors of this presentation identified that many restricted data sources do not make use of metadata to ensure their data are findable and accessible.
Methods
To assess the readiness of restricted data sources to utilize a metadata standard, this study identified common elements of both dataset descriptions and access requirements/procedures across 48 restricted health data sources. These elements were subsequently mapped to current metadata standards (e.g. DataCite) to determine how closely they matched the elements in these existing standards.
Results
Our findings indicate that many restricted data sources already provide dataset information that aligns closely with existing metadata standards, that data sources would benefit from adopting metadata standards to improve the discovery of their data, and that generally, it would be possible for these data sources to adopt an existing common metadata standard to describe their data. Access information provided by these data sources, however, is not adequately supported by existing standards. To ensure that the access requirements/procedures needed to acquire restricted datasets can be discoverable and transparent, metadata standards bodies will need to revise their schemas to include more descriptive access information. This revision would also provide researchers – who collect restricted data and must comply with funder and publisher data sharing policies – with standard guidelines for describing their data access request processes in more detail.
Conclusion
This presentation will discuss our findings in detail, articulate key challenges in assigning metadata to restricted data, and suggest recommendations for improving the discovery of and access to restricted data.