Large Language Models (LLMs) have evolved into Multimodal Large Language Models (MLLMs), significantly enhancing their capabilities by integrating visual information and other types, thus aligning more closely with th...
详细信息
Video frame interpolation (VFI) aims to generate predictive frames by motion-warping from bidirectional references. Most examples of VFI utilize spatiotemporal semantic information to realize motion estimation and int...
详细信息
Intelligent vehicle applications provide convenience but raise privacy and security *** of sensitive data,including vehicle location,and facial recognition information,poses a threat to user ***,traffic classification...
详细信息
Intelligent vehicle applications provide convenience but raise privacy and security *** of sensitive data,including vehicle location,and facial recognition information,poses a threat to user ***,traffic classification is vital for promptly overseeing and controlling applications with sensitive *** this paper,we propose ETNet,a framework that combines multiple features and leverages self-attention mechanisms to learn deep relationships between ***-Net employs a multisimilarity triplet network to extract features from raw bytes,and exploits self-attention to capture long-range dependencies within packets in a session and contextual information ***,we utilizing the loss function to more effectively integrate information acquired from both byte sequences and their corresponding *** simulated evaluations on datasets with similar attributes,ET-Net demonstrates the ability to finely distinguish between nine categories of applications,achieving superior results compared to existing methods.
Belief propagation(BP)decoding outputs soft information and can be naturally used in iterative *** list(BPL)decoding provides comparable error-correction performance to the successive cancellation list(SCL)*** this pa...
详细信息
Belief propagation(BP)decoding outputs soft information and can be naturally used in iterative *** list(BPL)decoding provides comparable error-correction performance to the successive cancellation list(SCL)*** this paper,we firstly introduce an enhanced code construction scheme for BPL decoding to improve its errorcorrection ***,a GPU-based BPL decoder with adoption of the new code construction is ***,the proposed BPL decoder is tested on NVIDIA RTX3070 and *** results show that the presented BPL decoder with early termination criterion achieves above 1 Gbps throughput on RTX3070 for the code(1024,512)with 32 lists under good channel conditions.
Facial expressions can provide a better understanding of people's mental status and attitudes towards specific things. However, facial occlusion in real world is an unfavorable phenomenon that greatly affects the ...
详细信息
The global trend of population aging poses significant challenges to society and healthcare systems,particularly because of neurocognitive disorders(NCDs)such as Parkinson's disease(PD)and Alzheimer's disease(...
详细信息
The global trend of population aging poses significant challenges to society and healthcare systems,particularly because of neurocognitive disorders(NCDs)such as Parkinson's disease(PD)and Alzheimer's disease(AD).In this context,artificial intelligence techniques have demonstrated promising potential for the objective assessment and detection of *** contactless screening technologies,such as speech-language processing,computer vision,and virtual reality,offer efficient and convenient methods for disease diagnosis and progression *** paper systematically reviews the specific methods and applications of these technologies in the detection of NCDs using data collection paradigms,feature extraction,and modeling ***,the potential applications and future prospects of these technologies for the detection of cognitive and motor disorders are *** providing a comprehensive summary and refinement of the extant theories,methodologies,and applications,this study aims to facilitate an in-depth understanding of these technologies for researchers,both within and outside the *** the best of our knowledge,this is the first survey to cover the use of speech-language processing,computer vision,and virtual reality technologies for the detection of NSDs.
How to protect cultural retics is of great significance to the transmission and dissemination of history and *** 3-dimensional(3D)modeling of cultural relics is an effective way to preserve *** efficiency and complexi...
详细信息
How to protect cultural retics is of great significance to the transmission and dissemination of history and *** 3-dimensional(3D)modeling of cultural relics is an effective way to preserve *** efficiency and complexity of cultural relic model reconstruction algorithms are significant challenges due to redundant *** tackle the above issue,a 3D reconstruction algorithm,named COLMAP+LSH,was proposed for movable cultural relics based on salient region ***+LSH algorithm introduces saliency region detection and locality-sensetive Hashing(LSH)to achieve efficient,accurate,and robust digital 3D modeling of cultural ***,400 cultural model data were collected through offline and online ***+LSH algorithm detects the salient region interactively and reduces the number of images in the salient region by feature ***,COLMAP+LSH algorithm utilizes LSH to calculate the image selection scores and employs the image selection scores to reduce the redundant *** experiments on the self-constructed cultural relics dataset show that COLMAP+LSH algorithm can efficiently achieve image feature diffusion and ensure the quality of artifact reconstruction while selecting most of the redundant image data.
Mobile Edge Computing(MEC)is a technology for the fifth-generation(5G)wireless communications to enable User Equipment(UE)to offload tasks to servers deployed at the edge of ***,taking both delay and energy consumptio...
详细信息
Mobile Edge Computing(MEC)is a technology for the fifth-generation(5G)wireless communications to enable User Equipment(UE)to offload tasks to servers deployed at the edge of ***,taking both delay and energy consumption into consideration in the 5G MEC system is usually complex and ***-orthogonal multiple access(NOMA)enable more UEs to offload their computing tasks to MEC servers using the same spectrum resources to enhance the spectrum efficiency for 5G,which makes the problem even more complex in the NOMA-MEC *** this work,a system utility maximization model is present to NOMA-MEC system,and two optimization algorithms based on Newton method and greedy algorithm respectively are proposed to jointly optimize the computing resource allocation,SIC order,transmission time slot allocation,which can easily achieve a better trade-off between the delay and energy *** simulation results prove that the proposed method is effective for NOMA-MEC systems.
Aiming at the problem of long time-consuming and low accuracy of existing age estimation approaches,a new age estimation method using Gabor feature fusion,and an improved atomic search algorithm for feature selection ...
详细信息
Aiming at the problem of long time-consuming and low accuracy of existing age estimation approaches,a new age estimation method using Gabor feature fusion,and an improved atomic search algorithm for feature selection is ***,texture features of five scales and eight directions in the face region are extracted by Gabor wavelet *** statistical histogram is introduced to encode and fuse the directional index with the largest feature value on Gabor ***,a new hybrid feature selection algorithm chaotic improved atom search optimisation with simulated annealing(CIASO-SA)is presented,which is based on an improved atomic search algorithm and the simulated annealing ***,the CIASO-SA algorithm introduces a chaos mechanism during atomic initialisation,significantly improving the convergence speed and accuracy of the ***,a support vector machine(SVM)is used to get classification results of the age *** verify the performance of the proposed algorithm,face images with three resolutions in the Adience dataset are *** the Gabor real part fusion feature at 48�48 resolution,the average accuracy and 1-off accuracy of age classification exhibit a maximum of 60.4%and 85.9%,*** results prove the superiority of the proposed algorithm over the state-of-the-art methods,which is of great referential value for application to the mobile terminals.
As the application of Industrial Robots(IRs)scales and related participants increase,the demands for intelligent Operation and Maintenance(O&M)and multi-tenant collaboration *** methods could no longer cover the r...
详细信息
As the application of Industrial Robots(IRs)scales and related participants increase,the demands for intelligent Operation and Maintenance(O&M)and multi-tenant collaboration *** methods could no longer cover the requirements,while the Industrial Internet of Things(IIoT)has been considered a promising ***,there’s a lack of IIoT platforms dedicated to IR O&M,including IR maintenance,process optimization,and knowledge *** this context,this paper puts forward the multi-tenant-oriented ACbot platform,which attempts to provide the first holistic IIoT-based solution for O&M of *** on an information model designed for the IR field,ACbot has implemented an application architecture with resource and microservice management across the cloud and multiple *** this basis,we develop four vital applications including real-time monitoring,health management,process optimization,and knowledge *** have deployed the ACbot platform in real-world scenarios that contain various participants,types of IRs,and *** date,ACbot has been accessed by 10 organizations and managed 60 industrial robots,demonstrating that the platform fulfills our ***,the application results also showcase its robustness,versatility,and adaptability for developing and hosting intelligent robot applications.
暂无评论