Limits...
The custodian administered research extract server: "improving the pipeline" in linked data delivery systems.

Eitelhuber T, Davis G - Health Inf Sci Syst (2014)

Bottom Line: The introduction of CARES has reduced the workload burden of linked data extractions, while improving the efficiency, stability and predictability of turnaround times.CARES overcomes multiple obstacles with no sacrifice to the integrity, confidentiality or security of data.CARES is a valuable component of linkage infrastructure that is operable at any scale and adaptable to many data environments.

View Article: PubMed Central - PubMed

Affiliation: Data Linkage Branch, The Western Australian (WA) Department of Health, 189 Royal St East Perth, WA 6004 Perth, Australia.

ABSTRACT

Background: At Western Australia's Data Linkage Branch (DLB) the extraction of linked data has become increasingly complex over the past decade and classical methods of data delivery are unsuited to the larger extractions which have become the norm. The Custodian Administered Research Extract Server (CARES) is a fast, accurate and predictable approach to linked data extraction.

Methods: The Data Linkage Branch (DLB) creates linkage keys within and between datasets. To comply with the separation principal, these keys are sent to applicable data collection agencies for extraction. Routing requests through multiple channels is inefficient and makes it hard to monitor work and predict delivery times. CARES was developed to address these shortcomings and involved ongoing consultation with the Custodians and staff of collections, plus challenges of hardware, programming, governance and security.

Results: The introduction of CARES has reduced the workload burden of linked data extractions, while improving the efficiency, stability and predictability of turnaround times.

Conclusions: As the scope of a linkage system broadens, challenges in data delivery are inevitable. CARES overcomes multiple obstacles with no sacrifice to the integrity, confidentiality or security of data. CARES is a valuable component of linkage infrastructure that is operable at any scale and adaptable to many data environments.

No MeSH data available.


Related in: MedlinePlus

“The Pipeline”; an illustration of the timeline of a linked data project in Western Australia’s Data Linkage Branch, from initial application; through approvals, cohort creation and linkage; to data delivery.
© Copyright Policy - open-access
Related In: Results  -  Collection

License 1 - License 2
getmorefigures.php?uid=PMC4376516&req=5

Fig1: “The Pipeline”; an illustration of the timeline of a linked data project in Western Australia’s Data Linkage Branch, from initial application; through approvals, cohort creation and linkage; to data delivery.

Mentions: The DLB performs routine linkages, value add functions, and facilitates an extract tailoring service to researchers. The DLB performs routine linkages of Department of Health data collections including Hospital Morbidity, Emergency Department, Cancer Registry, Mental Health and Midwives Notifications, as well as non-Health collections such as the Registry of Births, Deaths and Marriages, Road Crash data and the State Electoral Roll. In total, DLB maintains linkage keys for well over 30 different state-based data collections [4], in addition to overseeing value adding processes relating to geocoding and genealogy. The branch also provides a service to researchers by providing tailored extracts of de-identified, linked information about study cohorts, often including multiple cohorts of cases, controls and family connections. Every research request must go through initial review and feedback, ethical approval, custodian and DLB approval, cohort creation, linkage, service data extraction and checking before data is provided. This series of processes form the titular “pipeline”, which is illustrated in Figure 1. The final few phases, wherein the linkable datasets are requested, extracted, standardised, checked and made available to the researcher (for example, through a secure laboratory environment or the provision of aggregate tables or deidentified datasets to the researcher) are collectively referred to in this paper as “data delivery.”Figure 1


The custodian administered research extract server: "improving the pipeline" in linked data delivery systems.

Eitelhuber T, Davis G - Health Inf Sci Syst (2014)

“The Pipeline”; an illustration of the timeline of a linked data project in Western Australia’s Data Linkage Branch, from initial application; through approvals, cohort creation and linkage; to data delivery.
© Copyright Policy - open-access
Related In: Results  -  Collection

License 1 - License 2
Show All Figures
getmorefigures.php?uid=PMC4376516&req=5

Fig1: “The Pipeline”; an illustration of the timeline of a linked data project in Western Australia’s Data Linkage Branch, from initial application; through approvals, cohort creation and linkage; to data delivery.
Mentions: The DLB performs routine linkages, value add functions, and facilitates an extract tailoring service to researchers. The DLB performs routine linkages of Department of Health data collections including Hospital Morbidity, Emergency Department, Cancer Registry, Mental Health and Midwives Notifications, as well as non-Health collections such as the Registry of Births, Deaths and Marriages, Road Crash data and the State Electoral Roll. In total, DLB maintains linkage keys for well over 30 different state-based data collections [4], in addition to overseeing value adding processes relating to geocoding and genealogy. The branch also provides a service to researchers by providing tailored extracts of de-identified, linked information about study cohorts, often including multiple cohorts of cases, controls and family connections. Every research request must go through initial review and feedback, ethical approval, custodian and DLB approval, cohort creation, linkage, service data extraction and checking before data is provided. This series of processes form the titular “pipeline”, which is illustrated in Figure 1. The final few phases, wherein the linkable datasets are requested, extracted, standardised, checked and made available to the researcher (for example, through a secure laboratory environment or the provision of aggregate tables or deidentified datasets to the researcher) are collectively referred to in this paper as “data delivery.”Figure 1

Bottom Line: The introduction of CARES has reduced the workload burden of linked data extractions, while improving the efficiency, stability and predictability of turnaround times.CARES overcomes multiple obstacles with no sacrifice to the integrity, confidentiality or security of data.CARES is a valuable component of linkage infrastructure that is operable at any scale and adaptable to many data environments.

View Article: PubMed Central - PubMed

Affiliation: Data Linkage Branch, The Western Australian (WA) Department of Health, 189 Royal St East Perth, WA 6004 Perth, Australia.

ABSTRACT

Background: At Western Australia's Data Linkage Branch (DLB) the extraction of linked data has become increasingly complex over the past decade and classical methods of data delivery are unsuited to the larger extractions which have become the norm. The Custodian Administered Research Extract Server (CARES) is a fast, accurate and predictable approach to linked data extraction.

Methods: The Data Linkage Branch (DLB) creates linkage keys within and between datasets. To comply with the separation principal, these keys are sent to applicable data collection agencies for extraction. Routing requests through multiple channels is inefficient and makes it hard to monitor work and predict delivery times. CARES was developed to address these shortcomings and involved ongoing consultation with the Custodians and staff of collections, plus challenges of hardware, programming, governance and security.

Results: The introduction of CARES has reduced the workload burden of linked data extractions, while improving the efficiency, stability and predictability of turnaround times.

Conclusions: As the scope of a linkage system broadens, challenges in data delivery are inevitable. CARES overcomes multiple obstacles with no sacrifice to the integrity, confidentiality or security of data. CARES is a valuable component of linkage infrastructure that is operable at any scale and adaptable to many data environments.

No MeSH data available.


Related in: MedlinePlus