检索结果-内蒙古大学图书馆

Deep features and metaheuristics guided optimization-based method for breast cancer diagnosis

Multimedia Tools and Applications 2025年第16期84卷 16683-16707页

作者： Asad, Emon Mollah, Ayatullah Faruk Basu, Subhadip Chakraborti, Tapabrata Department of Computer Science and Engineering Aliah University Kolkata700160 India Department of Computer Science and Engineering Jadavpur University Kolkata700032 India The Alan Turing Institute and University College London London United Kingdom

Breast cancer is one of the most prevalent cancer types and the second leading cause of death among women. But fortunately, early diagnosis and treatment of breast cancer reduces mortality rates and improves the quality of life significantly. In this work, a computer-aided three-stage pipeline for the diagnosis of breast cancer from mammogram images has been proposed. Firstly, an attention-aided VGG16 network has been applied for deep feature extraction from mammogram images. Then, the extracted features are passed to the adaptive beta hill climbing-aided whale optimization algorithm for the most informative feature selection. Lastly, the features from stage 2 are again evaluated by the grey wolf optimizer combined with the adaptive beta hill climbing algorithm for the selection of the best subsets of features and subsequent identification of cancers using the k-nearest neighbours classifier. The proposed three-stage method has achieved an average accuracy of 97.63% and has reduced 81.78% of the original features. Considering the fact that well annotated imaging data is a costly resource for modern data intensive deep networks, this work shows that a fairly lightweight off the shelf network like VGG-16 can be coupled with a robust feature selection policy to outperform several recent approaches which employ more data intensive learning strategies with larger networks. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Mammography

来源：评论

学校读者我要写书评

暂无评论

A Huffman-based short message service compression technique using adjacent distance array

引用

International Journal of Information and Communication Technology 2024年第2期25卷 118-136页

作者： Sarker, Pranta Rahman, Mir Lutfur Department of Computer Science and Engineering North East University Bangladesh Telihaor Sheikhghat Sylhet3100 Bangladesh Department of Computer Science University of Hertfordshire Hertfordshire United Kingdom

The short message service (SMS) is a wireless medium of transmission that allows you to send brief text messages. Cell phone devices have an uttermost SMS capacity of 1,120 bits in the traditional system. Moreover, the conventional SMS employs seven bits for each character, allowing the highest 160 characters for an SMS text message to be transmitted. This research demonstrated that an SMS message could contain more than 200 characters by representing around five bits each, introducing a data structure, namely, adjacent distance array (ADA) using the Huffman principle. Allowing the concept of lossless data compression technique, the proposed method of the research generates character’s codeword utilising the standard Huffman. However, the ADA encodes the message by putting the ASCII value distances of all characters, and decoding performs by avoiding the whole Huffman tree traverse, which is the pivotal contribution of the research to develop an effective SMS compression technique for personal digital assistants (PDAs). The encoding and decoding processes have been discussed and contrasted with the conventional SMS text message system, where our proposed ADA technique performs outstandingly better from every aspect discovered after evaluating all outcomes. Copyright © 2024 Inderscience Enterprises Ltd.

关键词： Personal digital assistants

来源：评论

学校读者我要写书评

暂无评论

computer vision in smart agriculture and precision farming:Techniques and applications

引用

Artificial Intelligence in Agriculture 2024年第3期13卷 64-83页

作者： Sumaira Ghazal Arslan Munir Waqar S.Qureshi Department of Computer Science Kansas State University66506 ManhattanKSUSA Department of Electrical Engineering and Computer Science Florida Atlantic University33431 Boca RatonFLUSA School of Computer Science University of GalwayH91 TK33 GalwayIreland

The transformation of age-old farming practices through the integration of digitization and automation has sparked a revolution in agriculture that is driven by cutting-edge computer vision and artificial intelligence(AI)*** transformation not only promises increased productivity and economic growth,but also has the potential to address important global issues such as food security and *** survey paper aims to provide a holistic understanding of the integration of vision-based intelligent systems in various aspects of precision *** providing a detailed discussion on key areas of digital life cycle of crops,this survey contributes to a deeper understanding of the complexities associated with the implementation of vision-guided intelligent systems in challenging agricultural *** focus of this survey is to explore widely used imaging and image analysis techniques being utilized for precision farming *** paper first discusses various salient crop metrics used in digital *** this paper illustrates the usage of imaging and computer vision techniques in various phases of digital life cycle of crops in precision agriculture,such as image acquisition,image stitching and photogrammetry,image analysis,decision making,treatment,and *** establishing a thorough understanding of related terms and techniques involved in the implementation of vision-based intelligent systems for precision agriculture,the survey concludes by outlining the challenges associated with implementing generalized computer vision models for real-time deployment of fully autonomous farms.

关键词： Digital agriculture computer vision Smart agriculture Image analysis Vision-guided intelligent systems

来源：评论

学校读者我要写书评

暂无评论

A Systematic Literature Review on the Security Attacks and Countermeasures Used in Graphical Passwords

引用

IEEE Access 2024年 12卷 53408-53423页

作者： Por, Lip Yee Ng, Ian Ouii Chen, Yen-Lin Yang, Jing Ku, Chin Soon Universiti Malaya Faculty of Computer Science and Information Technology Department of Computer System and Technology Kuala Lumpur50603 Malaysia National Taipei University of Technology Department of Computer Science and Information Engineering Taipei106344 Taiwan Universiti Tunku Abdul Rahman Department of Computer Science Kampar31900 Malaysia

This systematic literature review delves into the dynamic realm of graphical passwords, focusing on the myriad security attacks they face and the diverse countermeasures devised to mitigate these threats. The core objective of this paper is to identify existing security threats to graphical password schemes and the corresponding countermeasures developed to mitigate these attacks. The study process begins by identifying the usable databases and search engines to identify all the relevant resources. The inclusion and exclusion criteria were carefully selected to prioritize the study, focusing mostly on attacks and countermeasures related to graphical password schemes between 2009 and 2023. After thorough identification and selection progress, 59 studies met all the criteria. Among these studies, 47 mentioned shoulder surfing as a threat to graphical password schemes, while 20 discussed brute force attacks. Additionally, there were 21 papers on dictionary attacks, 13 on smudge attacks, spyware attacks, and social engineering, and 19 that discussed guessing attacks as threats to graphical password schemes. Furthermore, the papers identified several other attacks, including frequency of occurrence analysis attacks, video recording, eavesdropping, computer vision, sonar, and image gallery attacks, with the corresponding numbers of papers being 9, 17, 5, 2, 2, and 1, respectively. The results also highlight the countermeasures proposed in the study papers to mitigate the aforementioned attacks. Among the various countermeasures identified, most revolve around randomization, obfuscation, and password space complexity as the most commonly used techniques for enhancing the security of graphical password schemes. © 2013 IEEE.

关键词： Authentication

来源：评论

学校读者我要写书评

暂无评论

Diluie: constructing diverse demonstrations of in-context learning with large language model for unified information extraction

引用

Neural Computing and Applications 2024年第22期36卷 13491-13512页

作者： Guo, Qian Guo, Yi Zhao, Jin Department of Computer Science and Engineering East China University of Science and Technology Shanghai200237 China Shanghai Key Laboratory of Data Science School of Computer Science Fudan University Shanghai200433 China

Large language models (LLMs) have demonstrated promising in-context learning capabilities, especially with instructive prompts. However, recent studies have shown that existing large models still face challenges in specific information extraction (IE) tasks. Moreover, it could have more effectively utilized various prompts such as instruction tuning, diverse demonstrations of in-context learning, and long-range token sequences for assisting language modeling in understanding context. In this study, we propose DILUIE, a unified information extraction framework based on in-context learning with diverse demonstration examples. DILUIE is encoded with an EVA attention mechanism and incremental encoding technology. Based on the constructed diverse demonstrations, we expand the size of instances efficiently in both instruction tuning and in-context learning to gain insights into the potential benefits of utilizing diverse information extraction datasets. To deepen the understanding of context, we further design three auxiliary tasks to assist in aligning contextual semantics. Experimental results demonstrate that DILUIE achieves 2.23 and 2.53% improvements in terms of Micor-/Macor-F1 on average relative to the current state-of-the-art baseline, which also significantly outperforms the GPT-3.5-turbo in zero-shot settings, and the average token length of achieving the best performance over tasks is around 15k. Furthermore, we observe that in-context learning shows enhanced performance when provided with more demonstrations during multiple-shot instruction tuning (8 k). Additionally, increasing the length of instructions (10 k) can result in a more substantial improvement in the upper limits of scaling for in-context learning. Code is available on https://***/Phevos75/DILUIE. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Demonstrations

来源：评论

学校读者我要写书评

暂无评论

Robust video question answering via contrastive cross-modality representation learning

引用

science China(Information sciences) 2024年第10期67卷 211-226页

作者： Xun YANG Jianming ZENG Dan GUO Shanshan WANG Jianfeng DONG Meng WANG School of Information Science and Technology University of Science and Technology of China Institute of Artificial Intelligence Hefei Comprehensive National Science Center School of Computer Science and Information Engineering Hefei University of Technology Institutes of Physical Science and Information Technology Anhui University School of Computer Science and Technology Zhejiang Gongshang University

Video question answering(VideoQA) is a challenging yet important task that requires a joint understanding of low-level video content and high-level textual semantics. Despite the promising progress of existing efforts, recent studies revealed that current VideoQA models mostly tend to over-rely on the superficial correlations rooted in the dataset bias while overlooking the key video content, thus leading to unreliable results. Effectively understanding and modeling the temporal and semantic characteristics of a given video for robust VideoQA is crucial but, to our knowledge, has not been well investigated. To fill the research gap, we propose a robust VideoQA framework that can effectively model the cross-modality fusion and enforce the model to focus on the temporal and global content of videos when making a QA decision instead of exploiting the shortcuts in datasets. Specifically, we design a self-supervised contrastive learning objective to contrast the positive and negative pairs of multimodal input, where the fused representation of the original multimodal input is enforced to be closer to that of the intervened input based on video perturbation. We expect the fused representation to focus more on the global context of videos rather than some static keyframes. Moreover, we introduce an effective temporal order regularization to enforce the inherent sequential structure of videos for video representation. We also design a Kullback-Leibler divergence-based perturbation invariance regularization of the predicted answer distribution to improve the robustness of the model against temporal content perturbation of videos. Our method is model-agnostic and can be easily compatible with various VideoQA backbones. Extensive experimental results and analyses on several public datasets show the advantage of our method over the state-of-the-art methods in terms of both accuracy and robustness.

关键词： video question answering cross-modality fusion contrastive learning cross-media reasoning

来源：评论

学校读者我要写书评

暂无评论

A Low Resource Multi-lingual Simultaneous Script Identification and Text Recognition Model

引用

SN computer science 2024年第6期5卷 740页

作者： Mukherjee, Jayati Roy, Utpal Computer Science and Engineering Academy of Technology Department of computer and system sciences Visva-bharati

In this paper, we have proposed a multi-task learning model for multi-lingual Optical Character Recognition. Our model does the script identification and text recognition simultaneously of offline machine printed documents. We have extracted the spatial and temporal features of a line image by the combination of several CNN and BLSTM layers. The feature is shared between the script identification and text recognition modules. Fully connected layer and softmax identify the script. The identified script works as a case selector for the text recognizer which is a CTC layer. Finally, the text is identified by the text recognizer. The model is applied to two public datasets: ISIDDI, RETAS containing Bengali degraded, and English pages. We have created a dataset of Devnagari/Hindi and Tamil scripts to test our model. The model has achieved 99.2% accuracy for script recognition. The achieved text recognition accuracy on the scripts Bengali, English, Hindi, and Tamil are respectively 91.68%, 97.07%, 95.68% and 92.27%. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2024.

关键词： Deep learning Multi-task learning Optical character recognition

来源：评论

学校读者我要写书评

暂无评论

Minimal Context-Switching Data Race Detection with Dataflow Tracking

引用

Journal of computer science & Technology 2024年第1期39卷 211-226页

作者：郑龙李洋辛杰刘海峰郑然廖小飞金海 National Engineering Research Center for Big Data Technology and System School of Computer Science and Technology Huazhong University of Science and TechnologyWuhan 430074China Services Computing Technology and System Laboratory School of Computer Science and TechnologyHuazhong University of Science and TechnologyWuhan 430074China Cluster and Grid Computing Laboratory School of Computer Science and TechnologyHuazhong University of Science and TechnologyWuhan 430074China

Data race is one of the most important concurrent anomalies in multi-threaded *** con-straint-based techniques are leveraged into race detection,which is able to find all the races that can be found by any oth-er sound race ***,this constraint-based approach has serious limitations on helping programmers analyze and understand data ***,it may report a large number of false positives due to the unrecognized dataflow propa-gation of the ***,it recommends a wide range of thread context switches to schedule the reported race(in-cluding the false one)whenever this race is exposed during the constraint-solving *** ad hoc recommendation imposes too many context switches,which complicates the data race *** address these two limitations in the state-of-the-art constraint-based race detection,this paper proposes DFTracker,an improved constraint-based race detec-tor to recommend each data race with minimal thread context ***,we reduce the false positives by ana-lyzing and tracking the dataflow in the *** this means,DFTracker thus reduces the unnecessary analysis of false race *** further propose a novel algorithm to recommend an effective race schedule with minimal thread con-text switches for each data *** experimental results on the real applications demonstrate that 1)without removing any true data race,DFTracker effectively prunes false positives by 68%in comparison with the state-of-the-art constraint-based race detector;2)DFTracker recommends as low as 2.6-8.3(4.7 on average)thread context switches per data race in the real world,which is 81.6%fewer context switches per data race than the state-of-the-art constraint based race ***,DFTracker can be used as an effective tool to understand the data race for programmers.

关键词： data race satisfiability modulo theory multi-threaded program dynamic detection

来源：评论

学校读者我要写书评

暂无评论

Impact of optimizers functions on detection of Melanoma using transfer learning architectures

引用

Multimedia Tools and Applications 2025年第14期84卷 13787-13807页

作者： Kılıçarslan, Serhat Aydın, Hatice Aktas Adem, Kemal Yılmaz, Esra Kavalcı Software Engineering Bandırma Onyedi Eylul University Balıkesir Turkey Computer Engineering Sivas University of Science and Technology Sivas Turkey Computer Engineering Sivas Cumhuriyet University Sivas Turkey

Early diagnosis-treatment of melanoma is very important because of its dangerous nature and rapid spread. When diagnosed correctly and early, the recovery rate of patients increases significantly. Physical methods are not sufficient for diagnosis and classification. The aim of this study is to use a hybrid method that combines different deep learning methods in the classification of melanoma and to investigate the effect of optimizer methods used in deep learning methods on classification performance. In the study, Melanoma detection was carried out from the skin lesions image through a simulation created with the deep learning architectures DenseNet, InceptionV3, ResNet50, InceptionResNetV2 and MobileNet and seven optimizers: SGD, Adam, RmsProp, AdaDelta, AdaGrad, Adamax and Nadam. The results of the study show that SGD has better and more stable performance in terms of convergence rate, training speed and performance than other optimizers. In addition, the momentum parameter added to the structure of the SGD optimizer reduces the oscillation and training time compared to other functions. It was observed that the best melanoma detection among the combined methods was achieved using the DenseNet model and SGD optimizer with a test accuracy of 0.949, test sensitivity 0.9403, and test F score 0.9492. © The Author(s) 2024.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

A Deep Learning-Based Salient Feature-Preserving Algorithm for Mesh Simplification

引用

computers, Materials & Continua 2025年第5期83卷 2865-2888页

作者： Jiming Lan Bo Zeng Suiqun Li Weihan Zhang Xinyi Shi Sichuan Key Provincial Research Base of Intelligent Tourism Sichuan University of Science and EngineeringZigong644005China School of Computer Science and Engineering Sichuan University of Science and EngineeringZigong644005China

The Quadric Error Metrics(QEM)algorithm is a widely used method for mesh simplification;however,it often struggles to preserve high-frequency geometric details,leading to the loss of salient *** address this limitation,we propose the Salient Feature Sampling Points-based QEM(SFSP-QEM)—also referred to as the Deep Learning-Based Salient Feature-Preserving Algorithm for Mesh Simplification—which incorporates a Salient Feature-Preserving Point Sampler(SFSP).This module leverages deep learning techniques to prioritize the preservation of key geometric features during *** results demonstrate that SFSP-QEM significantly outperforms traditional QEM in preserving geometric ***,for general models from the Stanford 3D Scanning Repository,which represent typical mesh structures used in mesh simplification benchmarks,the Hausdorff distance of simplified models using SFSP-QEM is reduced by an average of 46.58% compared to those simplified using traditional *** customized models such as the Zigong Lantern used in cultural heritage preservation,SFSP-QEM achieves an average reduction of 28.99% in Hausdorff ***,the running time of this method is only 6%longer than that of traditional QEM while significantly improving the preservation of geometric *** results demonstrate that SFSP-QEMis particularly effective for applications requiring high-fidelity simplification while retaining critical features.

关键词： Deep learning mesh simplification quadric error metrics(QEM) salient feature preservation point sampling

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：