Evaluation of a hierarchical ascendant clustering process implemented in a veterinary syndromic surveillance system
•The HAC clustering process was evaluated by submitting an a priori labelled dataset.•Se, Sp, robustness and stability of the HAC were evaluated for this set of data.•“MoSS-E2” showed to be a valid configurable tool to cluster clinical observations. Syndromic surveillance is considered as one of the...
Saved in:
Published in | Preventive veterinary medicine Vol. 120; no. 2; pp. 141 - 151 |
---|---|
Main Authors | , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
Netherlands
Elsevier B.V
15.06.2015
Elsevier |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | •The HAC clustering process was evaluated by submitting an a priori labelled dataset.•Se, Sp, robustness and stability of the HAC were evaluated for this set of data.•“MoSS-E2” showed to be a valid configurable tool to cluster clinical observations.
Syndromic surveillance is considered as one of the surveillance components for early warning of health-related events, as it allows detection of aberrations in health indicators before laboratory confirmation. “MoSS-Emergences 2” (MoSS-E2), a tool for veterinary syndromic surveillance, aggregates groups of similar clinical observations by hierarchical ascendant classification (HAC).
In the present study, this HAC clustering process was evaluated using a reference set of data that, for the purpose of this evaluation, was a priori divided and defined as Bluetongue (BTV) positive cases (PC) on the one hand and BTV negative cases (NC) on the other hand. By comparing the clustering result of MoSS-E2 with the expected outcome, the sensitivity (the ability to cluster PC together) and specificity (the ability to exclude NC from PC) of the clustering process were determined for this set of data. The stability of the classes obtained with the clustering algorithm was evaluated by comparing the MoSS-E2 generated dendrogram (applying complete linkage) with dendrograms of STATA® software applying average and single linkage methods. To assess the systems’ robustness, the parameters of the distance measure were adjusted according to different scenarios and obtained outcomes were compared to the expected outcome based on the a priori known labels. Rand indexes were calculated to measure similarity between clustering outcomes. The clustering algorithm in its default settings successfully segregated the reference BTV cases from the non-BTV cases, resulting in a sensitivity of 100.0% (95% CI: 89.0–100.0) and a specificity of 100.0% (95% CI: 80.0–100.0) for this set of data. The different linkage methods showed similar clustering results indicating stability of the classes (Rand indexes of respectively 0.77 for average and 0.75 for single linkage). The system proved to be robust when changing the parameters as the BTV cases remained together in meaningful clusters (Rand indexes between 0.72 and 1). The configurable MoSS-E2 system demonstrated its suitability to identify meaningful clusters of clinical syndromes. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0167-5877 1873-1716 |
DOI: | 10.1016/j.prevetmed.2015.03.002 |