Website security detection is important for Internet security. Existing machine learning-based malicious UrL detection methods have a low accuracy and weak generalization ability. Thus, we proposed a new multi-feature...
详细信息
In the past decade, Fake Base Stations (FBS) have been consistently employed by criminals to target mobile users through spam text messages. despite the introduction of several techniques to mitigate this problem, spa...
详细信息
dynamic metasurface antenna (dMA) array is a new type of array structure with low complexity and low cost, which is emerging as a promising technique for next-generation wireless networks. In this paper, a downlink mu...
详细信息
Multi-modal Named Entity recognition (MNEr) aims to better identify meaningful textual entities by integrating information from images. Previous work has focused on extracting visual semantics at a fine-grained level,...
详细信息
Multi-modal Named Entity recognition (MNEr) aims to better identify meaningful textual entities by integrating information from images. Previous work has focused on extracting visual semantics at a fine-grained level, or obtaining entity related external knowledge from knowledge bases or Large Language Models (LLMs). However, these approaches ignore the poor semantic correlation between visual and textual modalities in MNErdatasets anddo not explore different multi-modal fusion approaches. In this paper, we present MMAVK, a multi-modal named entity recognition model with auxiliary visual knowledge and word-level fusion, which aims to leverage the Multi-modal Large Language Model (MLLM) as an implicit knowledge base. It also extracts vision-based auxiliary knowledge from the image for more accurate and effective recognition. Specifically, we propose vision-based auxiliary knowledge generation, which guides the MLLM to extract external knowledge exclusively derived from images to aid entity recognition by designing target-specific prompts, thus avoiding redundant recognition and cognitive confusion caused by the simultaneous processing of image-text pairs. Furthermore, we employ a word-level multi-modal fusion mechanism to fuse the extracted external knowledge with each word-embedding embedded from the transformer-based encoder. Extensive experimental results demonstrate that MMAVK outperforms or equals the state-of-the-art methods on the two classical MNErdatasets, even when the large models employed have significantly fewer parameters than other baselines.
Aiming at the shortcomings of existing user profile construction algorithms that do not fully utilize contextual structural information and multidimensional feature information., a user profile construction model base...
详细信息
For internet companies, multi-dimensional KPIs (Key Performance Indicators) are crucial monitoring data and it is essential for operations and maintenance personnel to rapidly and precisely identify the attribute comb...
详细信息
In the method of sleep staging task, the artificial sleep staging process is complicated, while the traditional machine learning only realizes feature classification andrelies on manual feature extraction. This paper...
详细信息
Virtual Power Plant (VPP) calculation is a problem of searching optima in high-dimensional space. Its basic optimal solving methodology is to trim uninteresting regions of the search space where global optima are unli...
详细信息
In operation and maintenance of wind turbines, predictive maintenance is widely used to reduce the downtime and low operating efficiency due to failure of a wind turbine, in which the condition of a wind turbine is co...
详细信息
To meet the needs of future powersystemdevelopment and safe, stable, and high-quality operation, it is proposed to build a new generation of dispatching technology support system. The new generation dispatching tech...
详细信息
暂无评论