检索结果-内蒙古大学图书馆

Automated object recognition in high-resolution optical remote sensing imagery

National science Review 2023年第6期10卷 38-41页

作者： Yazhou Yao Tao Chen Hanbo Bi Xinhao Cai Gensheng Pei Guoye Yang Zhiyuan Yan Xian Sun Xing Xu Hai Zhang School of Computer Science and Engineering Nanjing University of Science and Technology Aerospace Information Research Institute Chinese Academy of Sciences School of Electronic Electrical and Communication Engineering University of Chinese Academy of Sciences Key Laboratory of Network Information System Technology (NIST) Aerospace Information Research Institute Chinese Academy of Sciences Department of Computer Science and Technology Tsinghua University School of Computer Science and Engineering University of Electronic Science and Technology of China Pazhou Laboratory (Huangpu) School of Mathematics Northwest University

INTRODUCTION With the rapid development of remote sensing technology,high-quality remote sensing images have become widely *** automated object detection and recognition of these images,which aims to automatically locate objects of interest in remote sensing images and distinguish their specific categories,is an important fundamental task in the *** provides an effective means for geospatial object monitoring in many social applications,such as intelligent transportation,urban planning,environmental monitoring and homeland security.

关键词：

来源：评论

学校读者我要写书评

暂无评论

OCRBench: on the hidden mystery of OCR in large multimodal models

引用

science China(information sciences) 2024年第12期67卷 23-35页

作者： Yuliang LIU Zhang LI Mingxin HUANG Biao YANG Wenwen YU Chunyuan LI Xu-Cheng YIN Cheng-Lin LIU Lianwen JIN Xiang BAI School of Artificial Intelligence and Automation Huazhong University of Science and Technology School of Electronic and Information Engineering South China University of Technology Microsoft Research School of Computer & Communication Engineering University of Science and Technology Beijing Institute of Automation Chinese Academy of Sciences School of Software Engineering Huazhong University of Science and Technology

Large models have recently played a dominant role in natural language processing and multimodal vision-language learning. However, their effectiveness in text-related visual tasks remains relatively unexplored. In this paper, we conducted a comprehensive evaluation of large multimodal models, such as GPT4V and Gemini, in various text-related visual tasks including text recognition, scene text-centric visual question answering(VQA), document-oriented VQA, key information extraction(KIE), and handwritten mathematical expression recognition(HMER). To facilitate the assessment of optical character recognition(OCR) capabilities in large multimodal models, we propose OCRBench, a comprehensive evaluation benchmark. OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available. Furthermore, our study reveals both the strengths and weaknesses of these models, particularly in handling multilingual text, handwritten text, non-semantic text, and mathematical expression *** importantly, the baseline results presented in this study could provide a foundational framework for the conception and assessment of innovative strategies targeted at enhancing zero-shot multimodal *** evaluation pipeline and benchmark are available at https://***/Yuliang-Liu/Multimodal OCR.

关键词： large multimodal model OCR text recognition scene text-centric VQA document-oriented VQA key information extraction handwritten mathematical expression recognition

来源：评论

学校读者我要写书评

暂无评论

Explainable AI for Pancreatic Cancer Prediction and Survival Prognosis: An Interpretable Deep Learning and Machine Learning Approach

Informatica (Slovenia)

引用

Informatica (Slovenia) 2024年第4期48卷 623-640页

作者： Srinidhi, B. Bhargavi, M.S. Department of Computer Science and Engineering Bangalore Institute of Technology Bengaluru India

Pancreatic cancer's devastating impact and low survival rates call for improved detection methods. While Artificial Intelligence has shown remarkable progress, its increasing complexity has led to "black box" models, hindering their acceptance in critical fields like healthcare. To address this, Explainable Artificial Intelligence (XAI) has gained traction, aiming to create transparent AI systems. In this study, we propose a comprehensive approach that combines the power of Deep Learning for pancreatic cancer detection using Computed Tomography (CT) images and Machine Learning (ML) for survival prognosis based on clinical data. By leveraging CT images with Deep learning models such as Convolutional Neural Networks, VGG-16 and DenseNet-201, effective diagnosis of Pancreatic Cancer is achieved and comprehensive insights into the tumor's spatial characteristics are obtained. DenseNet-201 outperformed the other models in terms of accuracy and interpretability with a predictive accuracy of 95%. The integration of ML techniques such as Stochastic Gradient Descent, Naïve Bayes and Extra Tree classifiers with clinical data predicts the chances of survival, providing vital information for treatment planning and personalized care. To validate the model's accuracy and interpretability, a comprehensive XAI validation is conducted using state-of-the-art techniques like Local Interpretable Model-agnostic Explanations and Shapley Additive Explanations. These methods provide localized explanations for predictions, allowing clinicians to understand risk and survival chances. This study holds immense potential to aid healthcare professionals in diagnosis, prognosis, and personalized treatment strategies, contributing to enhanced patient outcomes in the fight against pancreatic cancer. © 2024 Slovene Society Informatika. All rights reserved.

关键词： Stochastic systems

来源：评论

学校读者我要写书评

暂无评论

A Lightweight Object Detection Algorithm for Remote Sensing Images

引用

engineering Letters 2025年第3期33卷 704-711页

作者： Hou, Donghao Zhang, Yujun Ren, Jia School of Computer and Software Engineering University of Science and Technology Liaoning Anshan114051 China School of Electronic and Information Engineering University of Science and Technology Liaoning Anshan114051 China

With the continuous advancement of satellite technology, remote sensing images has been increasingly applied in fields such as urban planning, environmental monitoring, and disaster response. However, remote sensing images often feature small target sizes and complex backgrounds, posing significant computational challenges for object detection tasks. To address this issue, this paper proposes a lightweight remote sensing images object detection algorithm based on YOLOv9. The proposed algorithm incorporates the SimRMB module, which effectively reduces computational complexity while improving the efficiency and accuracy of feature extraction. Through a dynamic attention mechanism, SimRMB is capable of focusing on important regions while minimizing background interference, and by integrating residual learning and skip connections, it ensures the stability of deep networks. To further enhance detection performance, the FasterRepNCSPELAN4 module is introduced, which employs PConv operations to reduce computational load and memory usage. It also utilizes dilated convolutions and DFC attention mechanisms to strengthen feature extraction, thereby increasing the efficiency and accuracy of object detection. Additionally, this study integrates the GhostModuleV2 module, which generates core feature maps and employs lightweight operations to create redundant features, greatly reducing the computational complexity of *** results show that on the SIMD dataset, the improved YOLOv9 model has a parameter size of 167.88 MB and GFLOPs of 208.6. Compared to the baseline YOLOv9 model (parameter size: 194.57 MB, GFLOPs: 239.0), the parameter size is reduced by 13.71%, GFLOPs are reduced by 12.72%, and detection accuracy is improved by 1.4%. These results demonstrate that the proposed lightweight YOLOv9 model effectively reduces computational overhead while maintaining excellent detection performance, providing an efficient solution for object detection tasks in resou

关键词： Satellite imagery

来源：评论

学校读者我要写书评

暂无评论

Socially Aware V2X Localized QoS

引用

IEEE Internet of Things Journal 2024年第15期11卷 25925-25938页

作者： Kaliski, Rafael Han, Yue-Hua National Sun Yat-sen University Department of Computer Science and Engineering Kaohsiung804 Taiwan Academia Sinica Research Center for Information Technology Innovation Taipei115 Taiwan National Taiwan University of Science and Technology Department of Computer Science and Information Engineering Taipei106 Taiwan

Vehicle to Everything (V2X) is a core 5G technology. V2X and its enabler, Device-to-Device (D2D), are essential for the Internet of Things (IoT) and the Internet of Vehicles (IoV). V2X enables vehicles to communicate with other vehicles (V2V), networks (V2N), and infrastructure (V2I). While V2X enables ubiquitous vehicular connectivity, the impact of bursty data on the network's overall Quality of Service (QoS), such as when a vehicle accident occurs, is often ignored. In this work, we study both 4G and 5G V2X utilizing Evolved Universal Terrestrial Radio Access New Radio (E-UTRA-NR) and propose the use of socially aware 5G NR dual connectivity (en-DC) for traffic differentiation. We also propose localized QoS, wherein high-priority QoS flows traverse 5G road side units (RSUs) and normal-priority QoS flows traverse 4G base station (BS). We formulate a max-min fair QoS-aware nonorthogonal multiple access (NOMA) resource allocation scheme, QoS reclassify. QoS reclassify enables localized QoS and traffic steering to mitigate bursty network traffic's impact on the network's overall QoS. We then solve QoS reclassify via integer linear programming (ILP) and derive its approximation. We demonstrate that both optimal and approximation QoS reclassify resource allocation schemes in our socially aware QoS management methodology outperform socially unaware legacy 4G V2X algorithms (no localized QoS support, no traffic steering) and socially aware 5G V2X (no localized QoS support, yet utilizes traffic steering). Our proposed QoS reclassify scheme's QoS flow end-to-end latency requires only approx 15% of the time legacy 4G V2X requires. © 2014 IEEE.

关键词： Quality of service

来源：评论

学校读者我要写书评

暂无评论

DLP4DA-RPL: A Distributed Lightweight Protocol for Detection and Avoidance of Discarded DIO and DAO Attacks on RPL Routing Protocol in IoT

引用

IEEE Sensors Journal 2025年第12期25卷 22880-22894页

作者： Deepavathi, P. Mala, C. Department of Computer Science and Engineering National Institute of Technology Tiruchirappalli 620015 India

The Internet of Things (IoT) occupies the entire world in its hands. IoT devices have a resource-constrained nature known as Low Power and Lossy Networks (LLN). The Routing Protocol for Low Power and Lossy Networks (RPL) is provided by the Internet engineering Task Force (IETF) group to secure the IoT networks. The control messages Destination Oriented Directed Acyclic Graph (DODAG) information Object (DIO) and DODAG Advertisement Object (DAO) play a crucial part in RPL. Attackers focused on these control messages to degrade the performance of IoT networks and slowly bring them to a halt. To overcome these problems, this paper proposes a DLP4DA-RPL protocol to detect and avoid Discarded DIO (DDIO) and Discarded DAO (DDAO) control message attacks. The Contiki operating system simulates this proposed protocol using the Cooja simulator and this proposed protocol is implemented in our college environment. It is inferred from the simulation and real-time results that the proposed DLP4DA-RPL protocol outperforms the existing RPL protocols, such as RPL with Attacks, RPL without Attacks, SecTrust-RPL, and DDoS-RPL concerning End-to-end Delay, Energy Consumption, Packet Delivery Ratio, Throughput and Network Performance. © 2025 IEEE.

关键词： Routing protocols

来源：评论

学校读者我要写书评

暂无评论

Comprehensive overview of Alzheimer's disease utilizing Machine Learning approaches

引用

Multimedia Tools and Applications 2024年第37期83卷 85277-85329页

作者： Kumar, Rahul Azad, Chandrashekhar Department of Computer Science and Engineering National Institute of Technology Jamshedpur India

Alzheimer's disease is a common and complex brain disorder that primarily affects the elderly. Because it is progressing and has few effective therapies, it requires a thorough understanding of the condition;our study offers a comprehensive analysis of AD with a dual approach that combines both bibliometric and experimental analyses. The bibliometric analysis applies statistical and mathematical techniques to figure out the states of AD research, including publishing trends, prestigious journals, and collaborative networks. Concurrently, the experimental examination explores current advancements, focusing on Machine Learning, Deep Learning, and Metaheuristic approaches, tackling complex issues resulting from varied datasets. The experimental work is fascinating because it uses twenty classifiers and two datasets, initially without feature selection and then with seven feature selection techniques. This thorough investigation focuses on developments in disease processes, therapeutic approaches, and diagnostic tool development. This research offers a multidimensional overview of Alzheimer's disease by combining bibliometric and experimental methods, addressing problems and highlighting the shortcomings of earlier studies. By helping academics, policymakers, and healthcare professionals navigate the complexities of AD, this novel methodology advances a more thorough understanding of the Alzheimer's disease domain. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Speech processing

来源：评论

学校读者我要写书评

暂无评论

Applying sentiment analysis in social web for smart decision support marketing

引用

Journal of Ambient Intelligence and Humanized Computing 2024年第3期15卷 1927-1936页

作者： Wu, Shih-Jung Chiang, Rui-Dong Chang, Han-Chi Department of Innovative Information and Technology Tamkang University New Taipei Taiwan Department of Computer Science and Information Engineering Tamkang University New Taipei Taiwan

Because of the rapid development of communication and service in Taiwan, competition among telecommunication companies has become ever fiercer. Differences in marketing strategy usually become the key factor in keeping existing customers while attracting new ones. Although electronic word-of-mouth (e-WOM) is one of the most important pieces of information to a consumer making a purchase decision, very few articles on opinion mining have discussed and compared the relationship between multifaceted word-of-mouth (WOM) and marketing strategy. In this paper, we use our Chinese opinion-mining system (Wu et al. in J Supercomput 73:2987–3001, 2017) not only to retrieve articles related to 4G and conduct reputation analysis but also to discuss the relation between WOM and marketing strategy. The results show that (1) e-WOM can immediately and directly reflect the results of marketing strategy, and (2) although users are primarily concerned with aspects of price, online speed, and signal quality, for most Taiwanese customers, price is the key in choosing a telecommunication company. Moreover, although this paper used 4G-related articles from June 2014 to June 2015 for analysis, the results are consistent with the Taiwanese telecommunication companies’ current marketing strategy of attracting customers through low pricing. © Springer-Verlag GmbH Germany, part of Springer Nature 2018.

关键词： Sentiment analysis

来源：评论

学校读者我要写书评

暂无评论

Single- and Dual-Beam Pattern Reconfigurable Patch Antenna With Dual Switchable CP for Internet of Vehicles

引用

IEEE Internet of Things Journal 2025年第12期12卷 21904-21914页

作者： Wang, Shi-Tong Zhu, Lei University of Macau Department of Electrical and Computer Engineering Faculty of Science and Technology China

In this article, a novel method is proposed to facilitate the design of compact, low-profile, pattern reconfigurable antennas with fixed or switchable circular polarization (CP) for Internet of Vehicles (IoV) applications. The proposed method eliminates the need for a complex beamforming or phase-tuning network by innovatively exploiting four resonant modes of a square patch antenna. Specifically, the TM01 and TM10 modes are utilized to accomplish the single-beam CP pattern, while the dual-beam CP pattern is achieved by combining the TM11 and TM02 modes. An electric current model is developed to investigate the far-field radiation, revealing that the radiation pattern of TM11 mode can be effectively reshaped from quad-beam to dual-beam by manipulating the surface current distribution. The complete design method is verified by an antenna prototype working at 3.5 GHz, which consists of a radiative patch, a pair of open-ended stubs, a pair of slots, and four equivalent switchable pins. Different from the conventional pin-loading method, the proposed equivalent switchable pins attain excellent CP performance during switching by compensating for the detrimental electric coupling. The prototype is fabricated and measured. Its radiation can be switched between single-beam (7.8 dBic) and dual-beam CP pattern (3.5 dBic), exhibiting an overlapped CP coverage from −123° to 73°. Furthermore, another prototype with the extended switchable CP capability is realized. Without increasing the number of p-i-n diodes, this prototype achieves four operating states with independent pattern (single- and dual-beam) and polarization (right- and left-handed CP) reconfiguration. The merits of compact size, low profile, and flexible reconfigurability make the proposed method and design cases excellent candidates for IoV applications and intelligent transportation systems (ITSs). © 2014 IEEE.

关键词： Circular polarization

来源：评论

学校读者我要写书评

暂无评论

引用

Multimedia Tools and Applications 2024年 1-21页

作者： Gorai, Joy Shaw, Dilip Kumar Department of Computer Science and Engineering National Institute of Technology Jamshedpur India

As internet use in communication networks has grown, fake news has become a big problem. The misleading heading of the news loses the trust of the reader. Many techniques have emerged, but they fail because fraudsters or exploiters find new ways to deceive them. Semantic analysis and machine learning techniques play a significant role in fake news detection. We must semantically assess terms used in the headline and main content before filtration because words can have different meanings in different contexts. In the paper, a method for determining fake news is introduced by calculating the dissimilarity between the title and content of the news. Vector distance calculators are used to extract semantic dissimilarities, which were then utilized as an additional feature. Initially, Term frequency-inverse document frequency (Tf-Idf) and Mutual information (MI) are employed for text Feature Extraction (FE) on the ‘title’ and ’content’ of the news articles. Subsequently, four different vector distance calculators are used to extract vector distance-based features. The resulting distance values are used to train various machine learning classifiers, achieving the highest accuracy of 99%. Our method provides a comprehensive analysis by capturing diverse aspects of semantic dissimilarity from various distance calculators. The proposed method is then compared with previous techniques to demonstrate its effectiveness. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：