Data Curator, Metadata/Ontology IT Specialist

Posted to IASSIST on: 2012-02-23

Employer: Duke University


[[This position requires an analytical, creative person to establish data management processes for Duke’s new faculty activities system, Scholars@Duke, built on the VIVO semantic web application.  Reporting to the Manager of Faculty Data and Systems, this position will oversee all aspects of the system’s data and vocabularies, from designing, developing and maintain the local ontology, to working with data sources and coordinating data management efforts. 

The Data Curator will work with the cross-functional teams including the project manager, web developers, digital curators, information architects, business analysts, domain experts and faculty, as well as ontology experts in the VIVO Ontology Project. While this position is responsible for the local usability of metadata, ensuring that the data is accessible and easy to share, this person must immediately contribute to the staged implementation of Scholars@Duke.]{.TEXT}]{#top}

Job Responsibilities

  • Become an expert in the standard VIVO ontologies and the local ontology used in the first Scholars@Duke implementation.
  • Redesign the local ontology as needed for the initial release, and develop extensions with other VIVO ontology specialists.
  • Work with institutional repository team to oversee the interfaces for publication data.
  • Continue to oversee data and ontology developments necessary with rollouts to additional Duke schools.
  • Assess the requirements for the new data sets to add to Scholars@Duke, identifying and assessing potential data sources and working with data owners to add data incrementally to Scholars@Duke.
  • Work with domain experts to understand the structure and relationships of data sets, and design a representative ontology.
  • Confirm usability with existing data users.
  • Advise the outreach coordinator in establishing data management/maintenance processes.
  • Collaborate with VIVO Ontology Project on all extensions and ontology changes.
  • Participate in weekly VIVO meetings, attend national conference, and contribute to the VIVO ontology.
  • Combine a librarian’s understanding of metadata and commitment to precise knowledge mapping with a programmer’s ability to implement new technologies, resulting in high levels of data quality, integrity, breadth, timeliness and privacy.

Work Performed

  • Ontology analysis and development:
  • Evaluate use of VIVO ontology for existing data set.
  • Re-engineer ontology based on feedback from users and other VIVO implementations.
  • Create extensions as needed.
  • Plan ontology strategy for new data loads.
  • Create and maintain documentation of ontology.
  • Analysis of publications data & interfaces:
  • Work with library team to load publication data and link to open access repository.
  • Ensure publication data loads with correct ontology classifications.
  • Procurement of new institutional data sources:
  • Discuss requests for new data sources.
  • Identify data owners and propose inclusion in Scholars@Duke.
  • Analyze fitness of data source, oversee conversion and data cleanup.
  • Act as liasion between data owners and developers.
  • Provide usability support to consumers of data:
  • Handle questions and requests for help using Scholars@Duke data.
  • Oversee necessary data cleanup efforts.
  • Work with data consumers to debug issues with downstream systems.
  • Act as liaison between data consumers and developers.
  • Contribute to national VIVO ontology project:
  • Participate in and/or lead biweekly ontology calls.
  • Respond to questions/problems posted on ontology list serve.
  • Participate in yearly ontology workshops and national VIVO conference.

Education required:

Bachelor’s or Master’s degree in Library Science, Information Science, Computer Science or Information Technology

Experience required:

At least two years of experience with metadata standards, taxonomies, and ontologies, or primarily database/web programming with some metadata experience. Experience with repositories and digital academic resources is required.


Experience with software implementations involving architectures with various data sources or input sources, and reconciling the ontologies in multiple systems. Knowledge of information architecture. Experience analyzing and understanding network graphs used to create appropriate relationships in an ontology.

Special Skills Required:

Knowledge of RDF and SPARQL, as well as experience with OWL editors, web services, and/or semantic web technologies. Excellent communication, presentation, and collaboration skills. Must be able to multitask, learn quickly, prioritize and think creatively. Proven ability to plan, manage, and complete projects.

**Preferred  **

Knowledge of SQL, XML, XSLT, Ruby or Java. Familiarity with semantic web technologies, linked open data and the software development lifecycle.

Archived on: 2012-03-30