Already a member?

Sign In

Conference Presentations 2016

  • IASSIST 2016-Embracing the 'Data Revolution': Opportunities and challenges for research, Bergen
    Host Institution: NSD - Norwegian Centre for Research Data

Poster Session (Thu, 2016-06-02)
Chair:Jenny Muilenburg

  • Data Management, Dissemination & Linkage in Add Health: The National Longitudinal Study of Adolescent to Adult Health
    Ashley Sorgi (University of North Carolina, Chapel Hill)

    [abstract]

    The National Longitudinal Study of Adolescent to Adult Health (Add Health) is a longitudinal study of a nationally representative sample of adolescents in grades 7-12 in the United States during the 1994-1995 school year. The Add Health cohort has been followed into young adulthood with four in-home interviews from 1995-2009 and will conduct a fifth wave of web based data collection and in-home visits in 2016-2018 to collect social and biological data on the respondents at ages 31-42. This poster provides an overview of our data dissemination strategies, a four tiered system set to minimize deductive disclosure risk for respondents. The poster also presents biomarker data available for each wave of data collection and provides a summary of Wave V data collection underway. We will discuss the most popular research areas and explore opportunities for new data users. Genome-Wide Association Study (GWAS) data will be disseminated through The NIH database of Genotypes and Phenotypes (dbGaP) in early 2016 and information on data linkage between social science phenotype and the newly available genetic data will be explored.

  • Integrating European survey research questions with Euro Question Bank
    Azadeh MahmoudHashemi (GESIS - Leibniz Institute for the Social Sciences)
    Wolfgang Zenk-Moltgen (GESIS - Leibniz Institute for the Social Sciences)

    [abstract]

    The project Euro Question Bank (EQB) will implement a searchable database of all the survey questions of studies, which are provided by CESSDA member archives. For social science researchers, the EQB will provide an easy and central facility to access the survey questions in different languages. So far, existing question databases were provided separately for different collections and in several different countries. This limited the access to existing survey questions as well as possibilities for cross-national comparative research. By providing different functionalities within EQB, such as "search", "comparison", and "multilingualism", users will be enabled to access the holdings of different research communities and browse by different elements such as "questions", "keywords", "concepts", "variables", "collections" and others. The implementation of EQB is based on the DDI-Lifecycle metadata standard and provides both DDI-Lifecycle and DDI-Codebook import/export functionalities. It is based on previous efforts, which have been made by GESIS in cooperation with other CESSDA member archives within several international projects. The content of EQB will be provided by the CESSDA member archives, after successful development of the EQB. The project aims to make it as easy as possible for CESSDA member archives to supply documentation to the EQB.

  • A lack of Persistent Identifiers means missing data - PIDs in CESSDA
    Kerrin Borschewski (GESIS - Leibniz Institute for the Social Sciences)
    Brigitte Hausstein (GESIS - Leibniz Institute for the Social Sciences)

    [abstract]

    The increasing amount of scientific digital data imposes the need to identify datasets with persistent identifiers (PIDs). The functionality to unambiguously locate and access digital resources and to associate them with related metadata is essential to allow data-preservation, -retrieval, and -citation. CESSDA (Consortium of European Social Science Data Archives) aims to promote the results of social science research. To do so and to increase the visibility of data, the persistent identification of the CESSDA data holdings is of great importance. Currently, the different CESSDA data archives use varying PID systems without any common approach, which diminishes CESSDA's potential to achieve its aim. Therefore, the consortium has established a task on PIDs, which aims for the development of a common CESSDA PID Policy, to foster research in general and the visibility of data. To establish this policy, the task-contributors chose diverse approaches (quantitative survey, personal interviews, group discussion), that give an overview of the CESSDA data archives' needs, wishes, and problems concerning the use of PIDs. The poster provides an overview of the current status of the CESSDA PID Task concerning the CESSDA PID Policy.

  • Making Nordic Health Data Visible
    Dag Kiberg (Norwegian Social Science Data Service (NSD))
    Mari Kleemola (Finnish Social Science Data Archive, University of Tampere)
    Annaleena Okuloff (Finnish Social Science Data Archive, University of Tampere)
    Bodil Stenvig (Danish Data Archive - FSD)
    Jeppe Klok Due (Danish Data Archive - FSD)
    Elisabeth Strandhagen (Swedish National Data Service)

    [abstract]

    The project is network collaboration between the Nordic social sciences data services with the primary aim to develop a discovery portal prototype for Nordic health data. Such portal requires:

    • Common metadata standards
    • Broadening existing documentation standards with controlled vocabularies
    • Harmonized formats across repositories A first prototype is already made and published.

    The project focus on metadata needed to discover, locate and search for Nordic health data, including access rights. One of the challenges is that DDI as such does not specify or restrict the values, or terms, that can or should be used to describe the data. Inconsistent use of terms leads to misunderstandings and complicates drastically the machine-actionability and interoperability. The project has during its first period:

    1. Charted the controlled vocabularies used by the four participating data archives and compared how they are used;

    2. Identified the vocabularies that would be most useful for a Nordic Health Data Portal and broadened them to include concepts relevant for health data;

    3. Mapped the DDI Codebook and DDI Lifecycle data descriptions.

    The presentation will focus on the objective of the project, controlled vocabulary and metadata, and the working method used.

  • Building a Metadata Portfolio for CESSDA
    Mari Kleemola (Finnish Social Science Data Archive, University of Tampere)
    Wolfgang zenk-Moltgen (GESIS, Leibniz Institute for the Social Sciences)
    Anne Etheridge (UK Data Service)

    [abstract]

    We will present the first outcomes of the Consortium of European Social Science Data Archives (CESSDA) Metadata Management Project (CMM). The project is a CESSDA Work Plan Task with an objective to develop, promote and implement a standardised metadata design, content and practice for all CESSDA data assets. The main output will be the Metadata Standards Portfolio Version 1 that will encompass support for resource discovery, question banks, preservation, data access and multilinguality, be platform independent, and help CESSDA Service Providers achieve the Data Seal of Approval (DSA) certification requirements related to metadata issues. The two main building blocks of the Portfolio are the Core Metadata Portfolio and the Controlled Vocabularies Portfolio. The Core will be built mainly upon the DDI-Lifecycle standard but will include elements from other relevant standards where appropriate. The CV Portfolio will contain CESSDA Controlled Vocabularies for relevant metadata fields, taking into account and supporting the DDI CVG work. The project period is November 2015 - April 2017 and the project is a collaboration between eight CESSDA Service Providers: FSD (lead), ADP, CASD, DDA, GESIS, NSD, SND and UKDS.

  • UK Data Service: the 'a' (Access) team
    Laura Beauchamp (UK Data Archive)
    Alix Taylor (UK Data Archive)

    [abstract]

    The Access team are responsible for processing user requests for access to data available via the UK Data Service and for managing user queries submitted to its Helpdesk. This poster session will present the work of the team in a graphical way to highlight: a) how the workflow changes depending on the level of access for the data requested, e.g. Open, Safeguarded, Controlled access, and b) the types of query submitted to the Helpdesk.

  • da|raSearchNet - Searching for Research Data
    Karoline Harzenetter (GESIS - Leibniz Institute for the Social Sciences)
    Brigitte Hausstein (GESIS - Leibniz Institute for the Social Sciences)

    [abstract]

    The DFG funded project da|raSearchNet aims for the development and establishment of an integrated search network. It is a project hosted by the GESIS Leibniz Institute for the Social Sciences and carried out in the framework of the Registration Agency for Social and Economic Data da|ra. The agency's service is to allocate persistent identifiers to data using the DOI system.

    Da|raSearchNet enables users to do research inside an up to date database of data references in one place and links it to data holdings worldwide. The da|ra interface permits to use value-added services that allow for individual search, use and management of data references. The search engine is based on the da|ra search index, which includes the metadata of all objects registered with da|ra and metadata harvested via web interfaces from selected national and international repositories. The content of the database is constantly expanded by registration activities and the harvesting process. Before entering the index of da|raSearchNet the metadata is transformed. This transformation includes among other tasks mapping to different metadata standards, combining linking and enriching metadata and checking for redundant data. Content extension and metadata transformation is strongly supported by technical solutions. Current important clients of da|ra are the GESIS Data Archive, the ICPSR and Research Data Centers like the Socio Economical Panel (SOEP), the Survey of Health, Ageing and Retirement in Europe (SHARE), and many others. da|raSearchNet is a service for data providers to gain greater visibility for their holdings, researchers to find and manage references and for service providers to access exposed structured metadata via OAI-PMH. The project complements the retrieval services of GESIS for to research data that is not stored itself in the Data Archive of GESIS.

  • Update Your Space: Tips for Renovating a Map and Data Centre
    Kevin Manuel (Ryerson University)

    [abstract]

    Are you planning on updating your Data Centre? Ryerson University Library (Toronto, Canada) renovated its Map and Data Centre starting in the autumn of 2015. Here are some tips and considerations from our experience that will help you in planning your next renovation. The first stage of planning starts with the physical redesign of the space: the architectural design, layout of collections and computer workstations, and selection of new furniture. Next there is planning for the installation of new computer hardware, installation of software, and setting up new internet connections. Once the physical and technical upgrades are completed then policies need to be established for a data centre. These include setting hours of operation, staffing, a food policy, and setting up an online booking system for users to access GIS and statistical software. Finally, involving feedback from users about the final renovation results can confirm if the renovation was successful or if any adjustments need to be made.

  • Using Stata for Web Scraping
    Rob O'Reilly (Emory University)

    [abstract]

    Practitioners of data journalism at The Guardian like to note that working with data is often "80% perspiration, 10% great idea, 10% output" (http://www.theguardian.com/news/datablog/2011/jul/28/data-journalism). Extracting data from web-based sources is a case in point: even if the data are "open" and accessible to all, that does not guarantee that they will be in a usable format for research and analysis. Instead, it often takes extensive effort to extract the contents of such sources and get them into clean, usable states, even when the original data are presented in tabular form. A common approach to web scraping is to make use of a programming language such as Python or R. However, Stata also possesses functionality that can be useful for this purpose. In this presentation, I will discuss how we have been taking both built-in Stata commands for loops, macros, and string functions and user-written Stata commands for parsing text files and using them in combination to grab data from web sites, clean their contents, and turn them into usable datasets for researchers.

  • The Introduction of SRDA
    Wan Yun Lo (Academia Sinica)

    [abstract]

    The Survey Research Data Archive (SRDA) founded in November 1984 by the Center of Survey Research engages in the systematic acquisition, organization, preservation, and dissemination of academic survey data in Taiwan. The datasets collected in SRDA are donated by researchers, surveys carried out by SRDA, government department, and other academic organizations and broadly divided into survey data, censuses, and In-house value-added data. Confidentiality and sensitivity are evaluated prior to the release of every survey data set. Standard data management and cleaning procedures are applied to ensure data accuracy and completeness. In addition, metadata and relevant supplement files are also edited and attached. For the access of the restricted data with personal, confidential, and sensitive information, SRDA provides two services: on-site service and remote service. Both services are provided only after the approval of the application. In order to ensure data security, SRDA has been awarded ISO 27001:2005 certification for its digital data storage and usage services from BSI management Systems in 2010. SRDA has also obtained ISO 27001: 2013 transition certifications in 2015. Since 2012 SRDA has developed an online comprehensive inquiry service of academic and government survey data. The use of concept terms or keywords as parts of question item allows users to search data more conveniently and efficiently in addition to the primary services for searching by topics and data sets. The SRDA set up Networked Social Science Tools and Resources (Nesstar) in 2009. The user interface of Nesstar allows users to search and browse survey data on the web.

  • IASSIST Quarterly

    Publications Special issue: A pioneer data librarian
    Welcome to the special volume of the IASSIST Quarterly (IQ (37):1-4, 2013). This special issue started as exchange of ideas between Libbie Stephenson and Margaret Adams to collect

    more...

  • Resources

    Resources

    A space for IASSIST members to share professional resources useful to them in their daily work. Also the IASSIST Jobs Repository for an archive of data-related position descriptions. more...

  • community

    • LinkedIn
    • Facebook
    • Twitter

    Find out what IASSISTers are doing in the field and explore other avenues of presentation, communication and discussion via social networking and related online social spaces. more...