Graph convolutional neural network (GCN) has effectively boosted the multi-label image recognition task by modeling correlation among labels. In previous methods, label correlation is computed based on statistical information through label diffusion, and therefore the same for all samples. This, however...
To address these challenges, we propose a skeleton-aware distance transform (SDT) that combines the merits of object skeleton in preserving connectivity and DT in modeling geometric arrangement to represent instances with arbitrary structures. Comprehensive experiments on histopathology image segmentation ...
Gall, M., Rinderle-Ma, S.: Visual modeling of instance-spanning constraints in process-aware information systems. pp. 597-595. Advanced Information Systems Engineering (May 2017)Gall, M., Rinderle-Ma, S.: Visual modeling of instance-spanning constraints in process-aware information systems. In...
We propose the use of geometry-aware flow, which serves as a well-suited representation for modeling the transformation between instance-level facial attributes. Specifically, we leverage the facial landmarks as the geometric guidance to learn the differentiable flows automatically, despite of the ...
Inspired by the explicit difficulty modeling scheme in self-paced learning, CAAF utilizes a pairwise manifold ranking loss to evaluate the ranking confidence of each unlabeled sample. The ranking confidence improves not only the interaction efficiency by indicating valuable feedback candidates but also ...
Gall, M., Rinderle-Ma, S.: Visual modeling of instance-spanning constraints in process-aware information systems. pp. 597-595. Advanced Information Systems Engineering (May 2017)Indiono, C., Mangler, J., Fdhila, W., Rinderle-Ma, S.: Rule-based runtime moni- toring of instance-spanning...
DCFA-LVIS is capable of modeling videos of arbitrary lengths without needing to partition the frames into smaller groups. During inference, the entire video is input and then downsampled to 360p, a step that follows the approach of MaskTrack R-CNN [2]. This model learns to represent video-...