At present, most CT scanning systems adopt the traditional horizontal O-frame, suspension or C-arm structures, which can not balance the flexibility and functionality. With the development of robot technology, CT scan...
详细信息
Vessel centerline extraction is essential for carotid stenosis assessment and atherosclerotic plaque identification in clinical diagnosis. Simultaneously, it provides a region of interest identification and boundary i...
Vessel centerline extraction is essential for carotid stenosis assessment and atherosclerotic plaque identification in clinical diagnosis. Simultaneously, it provides a region of interest identification and boundary initialization for computer-assisted diagnosis tools. In magnetic resonance imaging (MRI) cross-sectional images, the lumen shape and vascular topology result in a challenging task to extract the centerline accurately. To this end, we propose a space-refine framework, which exploits the positional continuity of the carotid artery from frame to frame to extract the carotid artery centerline. The proposed framework consists of a detector and a refinement module. Specifically, the detector roughly extracts the carotid lumen region from the original image. Then, we introduce a refinement module that uses the cascade of regressors from a detector to perform sequence realignment of lumen bounding boxes for each subject. It improves the lumen localization results and further enhances the centerline extraction accuracy. Verified by large carotid artery data, the proposed framework achieves state-of-the-art performance compared to conventional vessel centerline extraction methods or standard convolutional neural network *** relevance— Our proposed framework can be used as an important aid for physicians to quantitatively analyze the carotid artery in clinical practice. It is also used as a new paradigm for extracting the centerline of carotid vessels in computer-assisted tools.
Monocular depth estimation is a fundamental technique for robots to perceive the real (unseen) scene. Supervised methods rely on large-scale datasets with groundtruth (GT) depth labels, which cannot be well generalize...
Monocular depth estimation is a fundamental technique for robots to perceive the real (unseen) scene. Supervised methods rely on large-scale datasets with groundtruth (GT) depth labels, which cannot be well generalized to other scenes. A dominant solution is to directly train the model on target scenes in self-supervised way with pseudo depth labels (e.g. generated by stereo matching). However, pseudo depth labels are often unreliable especially near object boundaries. It may disturb the training of the model and consequently decrease the depth quality in the inference. In this paper, we investigate the structure similarity of RGB-Depth based on Gaussian kernels, because the structure of RGB image is always reliable. Such RGB-Depth structure similarity measurement is then used to improve the self-supervised depth estimation in two aspects. It is first utilized to measure the confidence of pseudo depth labels and filter unreliable pixels. It is then utilized to limit the structure of predicted depth maps in the loss. Experiments on the KITTI Eigen Splits datasets verify that the proposed method achieves better or comparable quantitative results and always achieves better visual results with clear depth boundaries compared with five recent baselines.
Adaptive control is widely used in nonlinear systems, and is under active research. Although the conventional adaptive control guarantees the asymptotic stability, it may yield limited performance in real experiments ...
详细信息
As a kind of pneumatic actuators, pneumatic artificial muscles (PAMs) possess remarkable merits, such as lightweight, compliance, higher security. However, with significant characteristics, e.g., strong bearing capaci...
详细信息
Traditional road-level topological maps cannot meet the demand for high precision navigation services required by human drivers and autonomous vehicles. Meanwhile, current enhanced lane-level topological maps with red...
Traditional road-level topological maps cannot meet the demand for high precision navigation services required by human drivers and autonomous vehicles. Meanwhile, current enhanced lane-level topological maps with redundant details pose practical challenges for automatic map construction and efficient lane-level route planning. To address these challenges, this paper proposes a novel topological map representation. Based on this representation, a hierarchical mapping framework is established, which includes the extraction of road primitives, generation of directed graph, and automatic construction of lane-level topological map. The constructed topological map enables efficient lane-level route planning by the shortest path search algorithm with having the minimal nodes of topological graph. The proposed mapping framework and lane-level route planning have been extensively tested and evaluated on two real urban traffic scenarios, demonstrating the feasibility and effectiveness of this novel topological map representation.
Embodied AI, where agents accomplish specific tasks through interaction with their surrounding environment, is attracting attention in the community. As a more comprehensive and practical embodied task, visual room re...
Embodied AI, where agents accomplish specific tasks through interaction with their surrounding environment, is attracting attention in the community. As a more comprehensive and practical embodied task, visual room rearrangement aims to rearrange the initially misplaced objects in a room to the target configuration. In this task, the misplaced targets are mostly small objects that are difficult to locate, and there is no indication of which object needs to be rearranged in sensory input. Direct search for small and unknown targets poses a challenge. Previous works tend to directly incorporate sensory input into a reward-based framework. These methods do not learn the relationships between objects and infer which objects need to be rearranged, resulting in low performance. Understanding the scene structure and reasoning task goals can enable agents to rearrange targets more efficiently. In this paper, we propose a receptacle-target relation graph, which considers the spatial relationship between target objects and the receptacle objects appearing around them, enabling agents to understand the environment. Furthermore, we also propose a target reasoning network to explicitly predict the target object. Finally, our goal-aware policy can plan a series of actions based on their understanding of the task goal. Extensive compmethodarative and ablation experiments were conducted on the AI2Thor platform, demonstrating the effectiveness of our method.
Diabetic retinopathy(DR),the main cause of irreversible blindness,is one of the most common complications of *** present,deep convolutional neural networks have achieved promising performance in automatic DR detection...
详细信息
Diabetic retinopathy(DR),the main cause of irreversible blindness,is one of the most common complications of *** present,deep convolutional neural networks have achieved promising performance in automatic DR detection *** convolution operation of methods is a local cross-correlation operation,whose receptive field de-termines the size of the local neighbourhood for ***,for retinal fundus photographs,there is not only the local information but also long-distance dependence between the lesion features(*** and exudates)scattered throughout the whole *** proposed method incorporates correlations between long-range patches into the deep learning framework to improve DR ***-wise re-lationships are used to enhance the local patch features since lesions of DR usually appear as *** Long-Range unit in the proposed network with a residual structure can be flexibly embedded into other trained *** experimental results demon-strate that the proposed approach can achieve higher accuracy than existing state-of-the-art models on Messidor and EyePACS datasets.
This paper presents a demolition system, which is composed of a spherical inspection robot and a demolition robot. For the former, it is designed to get information about the demolition scene. The demolition robot wit...
详细信息
The lifespan of a generator greatly depends on the quality and aging of its stator bar insulation material. Aging of insulation materials can lead to premature equipment failure and significant material loss, resultin...
详细信息
ISBN:
(数字)9781737749769
ISBN:
(纸本)9798350371420
The lifespan of a generator greatly depends on the quality and aging of its stator bar insulation material. Aging of insulation materials can lead to premature equipment failure and significant material loss, resulting in substantial economic losses. However, existing methods for predicting the lifespan of electronic wire bars have several drawbacks, such as slow training speed, the need for a large amount of training data, and a tendency to overfit. To address this issue, we propose a characteristic enhancement algorithm based on conditional uncorrelation. This algorithm leverages characteristic enhancement to generate an extensive dataset and utilizes subset selection to identify relevant electrical parameters for predicting the remaining life span of the stator bar’s main insulation configurations. Experimental results demonstrate the advantages of our research compared to deep learning models. Our approach offers a promising solution for accurately predicting the remaining life of stator bar insulation, thereby facilitating effective maintenance planning and minimizing economic losses.
暂无评论