The interpretability of deep learning models has emerged as a compelling area in artificial intelligence *** safety criteria for medical imaging are highly stringent,and models are required for an ***,existing convolu...
详细信息
The interpretability of deep learning models has emerged as a compelling area in artificial intelligence *** safety criteria for medical imaging are highly stringent,and models are required for an ***,existing convolutional neural network solutions for left ventricular segmentation are viewed in terms of inputs and ***,the interpretability of CNNs has come into the *** medical imaging data are limited,many methods to fine-tune medical imaging models that are popular in transfer models have been built using massive public Image Net datasets by the transfer learning ***,this generates many unreliable parameters and makes it difficult to generate plausible explanations from these *** this study,we trained from scratch rather than relying on transfer learning,creating a novel interpretable approach for autonomously segmenting the left ventricle with a cardiac *** enhanced GPU training system implemented interpretable global average pooling for graphics using deep *** deep learning tasks were *** included data management,neural network architecture,and *** system monitored and analyzed the gradient changes of different layers with dynamic visualizations in real-time and selected the optimal deployment *** results demonstrated that the proposed method was feasible and efficient:the Dice coefficient reached 94.48%,and the accuracy reached 99.7%.It was found that no current transfer learning models could perform comparably to the ImageNet transfer learning *** model is lightweight and more convenient to deploy on mobile devices than transfer learning models.
Visible‐infrared person re‐identification(VI‐ReID)is a supplementary task of single‐modality re‐identification,which makes up for the defect of conventional re‐identification under insufficient *** is more chall...
详细信息
Visible‐infrared person re‐identification(VI‐ReID)is a supplementary task of single‐modality re‐identification,which makes up for the defect of conventional re‐identification under insufficient *** is more challenging than single‐modality ReID because,in addition to difficulties in pedestrian posture,camera shoot-ing angle and background change,there are also difficulties in the cross‐modality *** works only involve coarse‐grained global features in the re‐ranking calculation,which cannot effectively use fine‐grained ***,fine‐grained features are particularly important due to the lack of information in cross‐modality re‐*** this end,the Q‐center Multi‐granularity K‐reciprocal Re‐ranking Algorithm(termed QCMR)is proposed,including a Q‐nearest neighbour centre encoder(termed QNC)and a Multi‐granularity K‐reciprocal Encoder(termed MGK)for a more comprehensive feature *** converts the probe‐corresponding modality features into gallery corresponding modality features through modality transfer to narrow the modality *** takes a coarse‐grained mutual nearest neighbour as the dominant and combines a fine‐grained nearest neighbour as a supplement for similarity *** experiments on two widely used VI‐ReID benchmarks,SYSU‐MM01 and RegDB have shown that our method achieves state‐of‐the‐art ***,the mAP of SYSU‐MM01 is increased by 5.9%in all‐search mode.
In many practical applications, 3D point cloud analy-sis requires rotation invariance. In this paper, we present a learnable descriptor invariant under 3D rotations and reflections, i.e., the O(3) actions, utilizing t...
详细信息
ISBN:
(数字)9798350353006
ISBN:
(纸本)9798350353013
In many practical applications, 3D point cloud analy-sis requires rotation invariance. In this paper, we present a learnable descriptor invariant under 3D rotations and reflections, i.e., the O(3) actions, utilizing the recently intro-duced steerable 3D spherical neurons and vector neurons. Specifically, we propose an embedding of the 3D spherical neurons into 4D vector neurons, which leverages end-to-end training of the model. In our approach, we perform TetraTransform-an equivariant embedding of the 3D input into 4D, constructed from the steerable neurons-and ex-tract deeper O(3)-equivariant features using vector neurons. This integration of the TetraTransform into the VN-DGCNN framework, termed TetraSphere, negligibly increases the number of parameters by less than 0.0002%. TetraSphere sets a new state-of-the-art performance classifying randomly rotated real-world object scans of the challenging subsets of ScanObjectNN. Additionally, TetraSphere outperforms all equivariant methods on randomly rotated synthetic data: classifying objects from ModelNet40 and segmenting parts of the ShapeNet shapes. Thus, our results reveal the prac-tical value of steerable 3D spherical neurons for learning in 3D Euclidean space. The code is available at https: //***/pavlo-melnyk/tetrasphere.
Electrocardiogram (ECG) is an important non-invasive technique for diagnosing cardiovascular diseases (CVD). After acquiring the patients' raw ECG signal data, signal processing is essential for the diagnosis. Con...
详细信息
Inverse Kinematics (IK) is one of the most fundamental challenges in robotics. It refers to the process of determining the joint configurations required to achieve the desired position and orientation (pose) of a robo...
详细信息
ISBN:
(数字)9798350377705
ISBN:
(纸本)9798350377712
Inverse Kinematics (IK) is one of the most fundamental challenges in robotics. It refers to the process of determining the joint configurations required to achieve the desired position and orientation (pose) of a robot end-effector. Although numerous Data-Driven (DD) IK solvers have demonstrated encouraging results, they have not achieved the same accuracy when compared to other IK methods for complex robot configurations (e.g., numerical methods for higher Degrees of Freedom (DoF)). In this work, we propose a new Learning-by-Example method, and show that such a scheme considerably improves the IK learning results when compared to other DD learners. In our approach, the network input incorporates an example of joint-pose pair along with the query pose to predict the desired robot joint configuration. We show that the example joint-pose pair does not need to be too close to the query – i.e. example and query can be as far as 20 degrees apart in the joint configuration space. Furthermore, we investigate the utilization of residual and dense skip connections in Multilayer Perceptron for DDIK solvers and employ the resulting networks for two redundant robotic manipulators: a 7-DoF-7R commensurate robot and a 7DoF-2RP4R incommensurate robot. Our experimental results show that the resulting DDIK solver can reliably predict IK solutions with accuracy better than 1mm in position and 1deg in orientation.
Map-free LiDAR localization systems accurately localize within known environments by predicting sensor position and orientation directly from raw point clouds, eliminating the need for large maps and descriptors. Howe...
详细信息
ISBN:
(数字)9798331510831
ISBN:
(纸本)9798331510848
Map-free LiDAR localization systems accurately localize within known environments by predicting sensor position and orientation directly from raw point clouds, eliminating the need for large maps and descriptors. However, their long training times hinder rapid adaptation to new environments. To address this, we propose FlashMix, which uses a frozen, scene-agnostic backbone to extract local point descriptors, aggregated with an MLP mixer to predict sensor pose. A buffer of local descriptors is used to accelerate training by orders of magnitude, combined with metric learning or contrastive loss regularization of aggregated descriptors to improve performance and convergence. We evaluate FlashMix on various LiDAR localization benchmarks, examining different regularizations and aggregators, and demonstrating its effectiveness for rapid and accurate LiDAR localization in real-world scenarios. The code is available at https://***/raktimgg/FlashMix.
This paper introduces a novel approach integrating Differential Evolution (DE) with multi-objective optimization techniques for enhancing Type-1 Takagi-Sugeno-Kang (TSK) fuzzy rule-based systems to attain both fair pr...
详细信息
In this paper, we propose a method to automatically and efficiently tune high-dimensional vectors of controller parameters. The proposed method first learns a mapping from the high-dimensional controller parameter spa...
详细信息
ISBN:
(数字)9798350384574
ISBN:
(纸本)9798350384581
In this paper, we propose a method to automatically and efficiently tune high-dimensional vectors of controller parameters. The proposed method first learns a mapping from the high-dimensional controller parameter space to a lower dimensional space using a machine learning-based algorithm. This mapping is then utilized in an actor-critic framework using Bayesian optimization (BO). The proposed approach is applicable to complex systems (such as quadruped robots). In addition, the proposed approach also enables efficient generalization to different control tasks while also reducing the number of evaluations required while tuning the controller parameters. We evaluate our method on a legged locomotion application. We show the efficacy of the algorithm in tuning the high-dimensional controller parameters and also reducing the number of evaluations required for the tuning. Moreover, it is shown that the method is successful in generalizing to new tasks and is also transferable to other robot dynamics.
In this paper, we present an AI-based treadmill assistant that employs computervision and Deep learning algorithms to offer a comprehensive safety net for treadmill users. The system is designed and implemented with ...
详细信息
作者:
Johnson, HansSaniie, Jafar
Department of Electrical and Computer Engineering Research Laboratory ChicagoIL United States
This paper presents the design flow of an IoT human-machine touchless interface. The device uses embedded computing in conjunction with the Leap Motion Controller to provide an accurate and intuitive touchless interfa...
详细信息
暂无评论