VHSMarker and the Canine Cardiac Keypoint (CCK) Dataset: A Benchmark for Veterinary Cardiac X-ray Analysis
Overview
Overall Novelty Assessment
The paper introduces three contributions: VHSMarker, a web-based annotation tool for cardiac keypoint labeling; the Canine Cardiac Keypoint (CCK) Dataset comprising 21,465 annotated radiographs; and MambaVHS, a baseline model integrating Mamba state-space blocks with convolutional layers. Within the taxonomy, the work resides in the 'Hybrid and Transformer-Enhanced Keypoint Models' leaf under 'Deep Learning Architectures for Keypoint Detection'. This leaf contains only three papers total, indicating a relatively sparse research direction compared to the more crowded 'EfficientNet-Based Keypoint Localization' leaf with eight papers and the 'Regression-Based VHS Prediction' leaf with eight papers.
The taxonomy reveals that most prior work clusters in two neighboring branches: pure CNN architectures (EfficientNet, ResNet, HRNet, ConvNeXt) for keypoint detection, and direct VHS regression or classification methods that bypass explicit landmark localization. The paper's hybrid approach—combining state-space modeling with convolutional layers—sits at the intersection of these directions. Nearby leaves include 'Specialized CNN Architectures for Landmark Detection' and 'Object Detection Frameworks for VHS Keypoints', both of which rely on standard convolutional backbones without transformer or state-space components. The taxonomy's scope notes clarify that this leaf specifically covers architectures integrating attention mechanisms or state-space models, distinguishing it from pure CNN approaches.
Among the 17 candidates examined, the annotation tool contribution (VHSMarker) showed no refutable prior work in the single candidate reviewed. The dataset contribution (CCK Dataset) examined nine candidates, with two appearing to provide overlapping prior work—suggesting that large-scale annotated canine cardiac datasets may exist in the limited search scope. The MambaVHS model contribution examined seven candidates with no refutations, indicating that state-space modeling for VHS estimation appears less explored among the top semantic matches. The limited search scale (17 total candidates) means these findings reflect a focused subset of the literature rather than exhaustive coverage.
Based on the top-17 semantic matches and taxonomy structure, the work appears to occupy a less crowded methodological niche (hybrid state-space models) while addressing a task with established CNN and regression baselines. The dataset contribution shows some overlap with prior resources among examined candidates, though the annotation tool and model architecture appear more distinctive within the limited search scope. The analysis does not cover broader veterinary imaging literature or recent preprints outside the candidate pool.
Taxonomy
Research Landscape Overview
Claimed Contributions
The authors introduce VHSMarker, a clinician-oriented web tool that reduces annotation time from over a minute to 10–12 seconds per image while supporting real-time keypoint placement, automated VHS calculation, built-in quality checks, and seamless data export for scalable dataset creation.
The authors constructed a large-scale benchmark dataset comprising 21,465 annotated canine thoracic radiographs from 12,385 dogs across 144 breeds, making it the largest curated resource for canine cardiac analysis and providing a standardized benchmark for training and evaluation.
The authors propose MambaVHS, a hierarchical deep learning model that integrates state-space modeling (Mamba blocks) with convolutional layers to achieve robust and accurate VHS prediction, achieving 91.8% test accuracy and establishing state-space modeling as a promising direction for veterinary imaging.
Core Task Comparisons
Comparisons with papers in the same taxonomy category
[16] EdgeANet: A Transformer-based Edge Representation Learning Network for Canine X-ray Verification PDF
[35] A Hybrid EfficientNet-B7 and Transformer Model for Predicting Vertebral Heart Scale in Canine Chest Radiographs PDF
Contribution Analysis
Detailed comparisons for each claimed contribution
VHSMarker: Web-based annotation tool for canine cardiac keypoints
The authors introduce VHSMarker, a clinician-oriented web tool that reduces annotation time from over a minute to 10–12 seconds per image while supporting real-time keypoint placement, automated VHS calculation, built-in quality checks, and seamless data export for scalable dataset creation.
[51] Adaptive learning methods based on crowdsourced data labelling for medical images data mining PDF
Canine Cardiac Keypoint (CCK) Dataset
The authors constructed a large-scale benchmark dataset comprising 21,465 annotated canine thoracic radiographs from 12,385 dogs across 144 breeds, making it the largest curated resource for canine cardiac analysis and providing a standardized benchmark for training and evaluation.
[4] An automated deep learning method and novel cardiac index to detect canine cardiomegaly from simple radiography PDF
[53] Regressive vision transformer for dog cardiomegaly assessment PDF
[15] Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection PDF
[52] Exploring large-scale public medical image datasets PDF
[54] Development of an artificial intelligence-based method for the diagnosis of the severity of myxomatous mitral valve disease from canine chest radiographs PDF
[55] Use of deep learning to detect cardiomegaly on thoracic radiographs in dogs PDF
[56] Pilot study: application of artificial intelligence for detecting left atrial enlargement on canine thoracic radiographs PDF
[58] Cardiac radiography PDF
[59] Canine thoracic radiographic images as an educational dataset for distance learning and research on vertebral heart score PDF
MambaVHS: Baseline model integrating Mamba blocks for VHS estimation
The authors propose MambaVHS, a hierarchical deep learning model that integrates state-space modeling (Mamba blocks) with convolutional layers to achieve robust and accurate VHS prediction, achieving 91.8% test accuracy and establishing state-space modeling as a promising direction for veterinary imaging.