State-of-the-art recommender systems are increasingly focused on optimizing implementation efficiency, such as enabling on-device recommendations under memory constraints. Current methods commonly use lightweight embe...
详细信息
State-of-the-art recommender systems are increasingly focused on optimizing implementation efficiency, such as enabling on-device recommendations under memory constraints. Current methods commonly use lightweight embeddings for users and items or employ compact embeddings to enhance reusability and reduce memory usage. However, these approaches consider only the coarse-grained aspects of embeddings, overlooking subtle semantic nuances. This limitation results in an adversarial degradation of meta-embedding performance, impeding the system's ability to capture intricate relationships between users and items, leading to suboptimal recommendations. To address this, we propose a novel approach to efficiently learn meta-embeddings with varying grained and apply fine-grained meta-embeddings to strengthen the representation of their coarse-grained counterparts. Specifically, we introduce a recommender system based on a graph neural network, where each user and item is represented as a node. These nodes are directly connected to coarse-grained virtual nodes and indirectly linked to fine-grained virtual nodes, facilitating learning of multi-grained semantics. Fine-grained semantics are captured through sparse meta-embeddings, which dynamically balance embedding uniqueness and memory constraints. To ensure their sparseness, we rely on initialization methods such as sparse principal component analysis combined with a soft thresholding activation function. Moreover, we propose a weight-bridging update strategy that aligns coarse-grained meta-embedding with several fine-grained meta-embeddings based on the underlying semantic properties of users and items. Comprehensive experiments demonstrate that our method outperforms existing baselines. The code of our proposal is available at https://***/htyjers/C2F-MetaEmbed.
With the widespread use of blockchain technology for smart contracts and decentralized applications on the Ethereum platform, the blockchain has become a cornerstone of trust in the modern financial system. However, i...
详细信息
With the widespread use of blockchain technology for smart contracts and decentralized applications on the Ethereum platform, the blockchain has become a cornerstone of trust in the modern financial system. However, its anonymity has provided new ways for Ponzi schemes to commit fraud, posing significant risks to investors. Current research still has some limitations, for example, Ponzi schemes are difficult to detect in the early stages of smart contract deployment, and data imbalance is not considered. In addition, there is room for improving the detection accuracy. To address the above issues, this paper proposes LT-SPSD (LSTM-Transformer smart Ponzi schemes detection), which is a Ponzi scheme detection method that combines Long Short-Term Memory (LSTM) and Transformer considering the time-series transaction information of smart contracts as well as the global information. Based on the verified smart contract addresses, account features, and code features are extracted to construct a feature dataset, and the SMOTE-Tomek algorithm is used to deal with the imbalanced data classification problem. By comparing our method with the other four typical detection methods in the experiment, the LT-SPSD method shows significant performance improvement in precision, recall, and F1-score. The results of the experiment confirm the efficacy of the model, which has some application value in Ethereum Ponzi scheme smart contract detection.
Embodied visual exploration is critical for building intelligent visual agents. This paper presents the neural exploration with feature-based visual odometry and tracking-failure-reduction policy(Ne OR), a framework f...
详细信息
Embodied visual exploration is critical for building intelligent visual agents. This paper presents the neural exploration with feature-based visual odometry and tracking-failure-reduction policy(Ne OR), a framework for embodied visual exploration that possesses the efficient exploration capabilities of deep reinforcement learning(DRL)-based exploration policies and leverages feature-based visual odometry(VO) for more accurate mapping and positioning results. An improved local policy is also proposed to reduce tracking failures of feature-based VO in weakly textured scenes through a refined multi-discrete action space, keyframe fusion, and an auxiliary task. The experimental results demonstrate that Ne OR has better mapping and positioning accuracy compared to other entirely learning-based exploration frameworks and improves the robustness of feature-based VO by significantly reducing tracking failures in weakly textured scenes.
A fuzzy visual image denoising algorithm based on Bayesian estimation is proposed to address the problems of poor denoising performance and long denoising time in traditional image denoising algorithms. First, analyse...
详细信息
Improving website security to prevent malicious online activities is crucial,and CAPTCHA(Completely Automated Public Turing test to tell computers and Humans Apart)has emerged as a key strategy for distinguishing huma...
详细信息
Improving website security to prevent malicious online activities is crucial,and CAPTCHA(Completely Automated Public Turing test to tell computers and Humans Apart)has emerged as a key strategy for distinguishing human users from automated ***-based CAPTCHAs,designed to be easily decipherable by humans yet challenging for machines,are a common form of this ***,advancements in deep learning have facilitated the creation of models adept at recognizing these text-based CAPTCHAs with surprising *** our comprehensive investigation into CAPTCHA recognition,we have tailored the renowned UpDown image captioning model specifically for this *** approach innovatively combines an encoder to extract both global and local features,significantly boosting the model’s capability to identify complex details within CAPTCHA *** the decoding phase,we have adopted a refined attention mechanism,integrating enhanced visual attention with dual layers of Long Short-Term Memory(LSTM)networks to elevate CAPTCHA recognition *** rigorous testing across four varied datasets,including those from Weibo,BoC,Gregwar,and Captcha 0.3,demonstrates the versatility and effectiveness of our *** results not only highlight the efficiency of our approach but also offer profound insights into its applicability across different CAPTCHA types,contributing to a deeper understanding of CAPTCHA recognition technology.
Knowledge graphs(KGs) effectively mitigate data sparsity in recommendation systems(RSs) by providing valuable auxiliary information [1]. However, traditional centralized KG-based RSs increase the risk of user privacy ...
Knowledge graphs(KGs) effectively mitigate data sparsity in recommendation systems(RSs) by providing valuable auxiliary information [1]. However, traditional centralized KG-based RSs increase the risk of user privacy *** learning(FL) enhances RS's privacy by enabling model training on decentralized data [2]. Although integrating KG and FL can address both data sparsity and privacy issues in RSs [3], several challenges persist. CH1,Each client's local model relies on a consistent global model from the server, limiting personalized deployment to endusers.
Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing de...
详细信息
Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing details about the speaker’s goals and desires, as well as their internal condition. Certain vocal characteristics reveal the speaker’s mood, intention, and motivation, while word study assists the speaker’s demand to be understood. Voice emotion recognition has become an essential component of modern HCC networks. Integrating findings from the various disciplines involved in identifying vocal emotions is also challenging. Many sound analysis techniques were developed in the past. Learning about the development of artificial intelligence (AI), and especially Deep Learning (DL) technology, research incorporating real data is becoming increasingly common these days. Thus, this research presents a novel selfish herd optimization-tuned long/short-term memory (SHO-LSTM) strategy to identify vocal emotions in human communication. The RAVDESS public dataset is used to train the suggested SHO-LSTM technique. Mel-frequency cepstral coefficient (MFCC) and wiener filter (WF) techniques are used, respectively, to remove noise and extract features from the data. LSTM and SHO are applied to the extracted data to optimize the LSTM network’s parameters for effective emotion recognition. Python Software was used to execute our proposed framework. In the finding assessment phase, Numerous metrics are used to evaluate the proposed model’s detection capability, Such as F1-score (95%), precision (95%), recall (96%), and accuracy (97%). The suggested approach is tested on a Python platform, and the SHO-LSTM’s outcomes are contrasted with those of other previously conducted research. Based on comparative assessments, our suggested approach outperforms the current approaches in vocal emotion recognition.
A Brain Tumors are highly dangerous illnesses that significantly reduce the life expectancy of patients. The classification of brain tumors plays a crucial role in clinical diagnosis and effective treatment. The misdi...
详细信息
A Brain Tumors are highly dangerous illnesses that significantly reduce the life expectancy of patients. The classification of brain tumors plays a crucial role in clinical diagnosis and effective treatment. The misdiagnosis of brain tumors will result in wrong medical intercession and reduce chance of survival of patients Precisely diagnosing brain tumors is of utmost importance for devising suitable treatment plans that can effectively cure and improve the quality of life for patients afflicted with this condition. To tackle this challenge, present a framework that harnesses deep convolutional layers to automatically extract crucial and resilient features from the input data. Systems that use computers and with the help of convolutional neural networks have provided huge success stories in early detection of tumors. In our framework, utilize VGG19 model combined with fuzzy logic type-2 where used fuzzy logic type-2 that applied to enhancement the images brain where Type-2 fuzzy logic better handles uncertainty in medical images, improving the interpretability of image enhancement by managing noise and subtle differences with greater precision than Type-1 fuzzy logic for MRI images often contain ambiguous or low-contrast areas where noise, lighting conditions different and greatly improve accuracy. while used the VGG19 architecture to feature extraction and classify Tumor and non- Tumor. This approach enhances the accuracy of tumors classification, aiding in the development of targeted treatment strategies for patients. The method is trained on the Br35H dataset, resulting in a training accuracy of 0.9983 % and Train loss of 0.2118 while the validation accuracy of 0.9953 % validation loss of 0.2264. This demonstrates effective pattern learning and generalization capabilities. The model achieves outstanding accuracy, with a best accuracy for the model of 0.9983 %, While the test accuracy of the model reached of 99 %, and both of sensitivity and specificity at 0.9967
The evolving field of Alzheimer’s disease(AD)diagnosis has greatly benefited from deep learning models for analyzing brain magnetic resonance(MR)*** study introduces Dynamic GradNet,a novel deep learning model design...
详细信息
The evolving field of Alzheimer’s disease(AD)diagnosis has greatly benefited from deep learning models for analyzing brain magnetic resonance(MR)*** study introduces Dynamic GradNet,a novel deep learning model designed to increase diagnostic accuracy and interpretability for multiclass AD ***,four state-of-the-art convolutional neural network(CNN)architectures,the self-regulated network(RegNet),residual network(ResNet),densely connected convolutional network(DenseNet),and efficient network(EfficientNet),were comprehensively compared via a unified preprocessing pipeline to ensure a fair *** these models,EfficientNet consistently demonstrated superior performance in terms of accuracy,precision,recall,and F1 *** a result,EfficientNetwas selected as the foundation for implementing Dynamic *** GradNet incorporates gradient weighted class activation mapping(GradCAM)into the training process,facilitating dynamic adjustments that focus on critical brain regions associated with early dementia *** adjustments are particularly effective in identifying subtle changes associated with very mild dementia,enabling early diagnosis and *** model was evaluated with the OASIS dataset,which contains greater than 80,000 brain MR images categorized into four distinct stages of AD *** proposed model outperformed the baseline architectures,achieving remarkable generalizability across all *** findingwas especially evident in early-stage dementia detection,where Dynamic GradNet significantly reduced false positives and enhanced classification *** findings highlight the potential of Dynamic GradNet as a robust and scalable approach for AD diagnosis,providing a promising alternative to traditional attention-based *** model’s ability to dynamically adjust spatial focus offers a powerful tool in artificial intelligence(AI)assisted precisionmedicine,particularly in the early det
Deep learning algorithms have been rapidly incorporated into many different applications due to the increase in computational power and the availability of massive amounts of ***,both deep learning and ensemble learni...
详细信息
Deep learning algorithms have been rapidly incorporated into many different applications due to the increase in computational power and the availability of massive amounts of ***,both deep learning and ensemble learning have been used to recognize underlying structures and patterns from high-level features to make predictions/*** the growth in popularity of deep learning and ensemble learning algorithms,they have received significant attention from both scientists and the industrial community due to their superior ability to learn features from big *** deep learning has exhibited significant performance in enhancing learning generalization through the use of multiple deep learning *** ensemble deep learning has large quantities of training parameters,which results in time and space overheads,it performs much better than traditional ensemble *** deep learning has been successfully used in several areas,such as bioinformatics,finance,and health *** this paper,we review and investigate recent ensemble deep learning algorithms and techniques in health care domains,medical imaging,health care data analytics,genomics,diagnosis,disease prevention,and drug *** cover several widely used deep learning algorithms along with their architectures,including deep neural networks(DNNs),convolutional neural networks(CNNs),recurrent neural networks(RNNs),and generative adversarial networks(GANs).Common healthcare tasks,such as medical imaging,electronic health records,and genomics,are also ***,in this review,the challenges inherent in reducing the burden on the healthcare system are discussed and ***,future directions and opportunities for enhancing healthcare model performance are discussed.
暂无评论