Unstructured Numerical Image Dataset Separation (UNIDS) method employing an enhanced unsupervised clustering technique. The objective is to delineate an optimal number of distinct groups within the input grayscale (G-...
详细信息
Background Three-dimensional(3D)shape representation using mesh data is essential in various applications,such as virtual reality and simulation *** methods for extracting features from mesh edges or faces struggle wi...
详细信息
Background Three-dimensional(3D)shape representation using mesh data is essential in various applications,such as virtual reality and simulation *** methods for extracting features from mesh edges or faces struggle with complex 3D models because edge-based approaches miss global contexts and face-based methods overlook variations in adjacent areas,which affects the overall *** address these issues,we propose the Feature Discrimination and Context Propagation Network(FDCPNet),which is a novel approach that synergistically integrates local and global features in mesh *** FDCPNet is composed of two modules:(1)the Feature Discrimination Module,which employs an attention mechanism to enhance the identification of key local features,and(2)the Context Propagation Module,which enriches key local features by integrating global contextual information,thereby facilitating a more detailed and comprehensive representation of crucial areas within the mesh *** Experiments on popular datasets validated the effectiveness of FDCPNet,showing an improvement in the classification accuracy over the baseline ***,even with reduced mesh face numbers and limited training data,FDCPNet achieved promising results,demonstrating its robustness in scenarios of variable complexity.
With the adoption of cutting-edge communication technologies such as 5G/6G systems and the extensive development of devices,crowdsensing systems in the Internet of Things(IoT)are now conducting complicated video analy...
详细信息
With the adoption of cutting-edge communication technologies such as 5G/6G systems and the extensive development of devices,crowdsensing systems in the Internet of Things(IoT)are now conducting complicated video analysis tasks such as behaviour *** applications have dramatically increased the diversity of IoT ***,behaviour recognition in videos usually requires a combinatorial analysis of the spatial information about objects and information about their dynamic actions in the temporal *** recognition may even rely more on the modeling of temporal information containing short-range and long-range motions,in contrast to computer vision tasks involving images that focus on understanding spatial ***,current solutions fail to jointly and comprehensively analyse short-range motions between adjacent frames and long-range temporal aggregations at large scales in *** this paper,we propose a novel behaviour recognition method based on the integration of multigranular(IMG)motion features,which can provide support for deploying video analysis in multimedia IoT crowdsensing *** particular,we achieve reliable motion information modeling by integrating a channel attention-based short-term motion feature enhancement module(CSEM)and a cascaded long-term motion feature integration module(CLIM).We evaluate our model on several action recognition benchmarks,such as HMDB51,Something-Something and *** experimental results demonstrate that our approach outperforms the previous state-of-the-art methods,which confirms its effective-ness and efficiency.
Kirsten rat sarcoma viral oncogene homolog(namely KRAS)is a key biomarker for prognostic analysis and targeted therapy of colorectal ***,the advancement of machine learning,especially deep learning,has greatly promote...
详细信息
Kirsten rat sarcoma viral oncogene homolog(namely KRAS)is a key biomarker for prognostic analysis and targeted therapy of colorectal ***,the advancement of machine learning,especially deep learning,has greatly promoted the development of KRAS mutation detection from tumor phenotype data,such as pathology slides or radiology ***,there are still two major problems in existing studies:inadequate single-modal feature learning and lack of multimodal phenotypic feature *** this paper,we propose a Disentangled Representation-based Multimodal Fusion framework integrating Pathomics and Radiomics(DRMF-PaRa)for KRAS mutation ***,the DRMF-PaRa model consists of three parts:(1)the pathomics learning module,which introduces a tissue-guided Transformer model to extract more comprehensive and targeted pathological features;(2)the radiomics learning module,which captures the generic hand-crafted radiomics features and the task-specific deep radiomics features;(3)the disentangled representation-based multimodal fusion module,which learns factorized subspaces for each modality and provides a holistic view of the two heterogeneous phenotypic *** proposed model is developed and evaluated on a multi modality dataset of 111 colorectal cancer patients with whole slide images and contrast-enhanced *** experimental results demonstrate the superiority of the proposed DRMF-PaRa model with an accuracy of 0.876 and an AUC of 0.865 for KRAS mutation detection.
With the increasing demand for high-quality 3D holographic reconstruction, visual clarity and accuracy remain significant challenges in various imaging applications. Current methods struggle for higher image resolutio...
详细信息
Deep learning-based infrared small target detection (IRSTD) methods typically exploit spatial domain cues to infer dim and weak infrared targets. However, relying solely on spatial domain information is sub-optimal du...
详细信息
computer-generated aesthetic patterns arewidely used as design materials in various fields. Themost common methods use fractals or dynamicalsystems as basic tools to create various patterns. Toenhance aesthetics and c...
详细信息
computer-generated aesthetic patterns arewidely used as design materials in various fields. Themost common methods use fractals or dynamicalsystems as basic tools to create various patterns. Toenhance aesthetics and controllability, some researchershave introduced symmetric layouts along with thesetools. One popular strategy employs dynamical systemscompatible with symmetries that construct functionswith the desired symmetries. However, these aretypically confined to simple planar symmetries. Theother generates symmetrical patterns under theconstraints of tilings. Although it is slightly moreflexible, it is restricted to small ranges of tilingsand lacks textural variations. Thus, we proposed anew approach for generating aesthetic patterns bysymmetrizing quasi-regular patterns using general kuniformtilings. We adopted a unified strategy toconstruct invariant mappings for k-uniform tilings thatcan eliminate texture seams across the tiling ***, we constructed three types of symmetriesassociated with the patterns: dihedral, rotational, andreflection symmetries. The proposed method can beeasily implemented using GPU shaders and is highlyefficient and suitable for complicated tiling with regularpolygons. Experiments demonstrated the advantages of our method over state-of-the-art methods in terms offlexibility in controlling the generation of patterns withvarious parameters as well as the diversity of texturesand styles.
Traditional object detection requires a large amount of annotated data to ensure model accuracy and generalization, but in practical scenarios, labeled samples are often limited. Few-shot Object Detection (FSOD) addre...
详细信息
Learners' affective states play a crucial role in learning evaluation, and the external expressions that can directly reflect affect are facial expressions. However, the sample size of the database for the learnin...
详细信息
The security of vehicle-to-everything (V2X) communication systems is of critical importance for intelligent transportation systems. This paper discusses the use of reconfigurable intelligent surface (RIS) technology t...
详细信息
暂无评论