检索结果-内蒙古大学图书馆

Sparse point spread function-based multi-image optical encryption

COMMUNICATIONS PHYSICS 2025年第1期8卷 1-9页

作者： Xu, Ning Qi, Dalong Cheng, Long Pan, Zhen Zhou, Chengyu Lin, Wenzhang Ma, Hongmei Yao, Yunhua Shen, Yuecheng Deng, Lianzhong Sun, Zhenrong Zhang, Shian East China Normal Univ Sch Phys & Elect Sci State Key Lab Precis Spect Shanghai Peoples R China East China Normal Univ Joint Res Ctr Light Manipulat Sci & Photon Integra Shanghai Peoples R China Shanxi Univ Collaborat Innovat Ctr Extreme Opt Taiyuan Peoples R China

Multi-image optical encryption (MOE) has demonstrated promising potential in image data protection owing to its parallel processing capability and abundant degrees of freedom. However, existing methods suffer from either low compression ratios or stringent experimental conditions, such as accurate calibration of phase modulation, precise manufacturing of encryption elements, and no ambient light interference. This work introduces a lensless sparse point spread function-based multi-image optical encryption (sPSF-MOE) technique that addresses these challenges and enhances performance. In the encryption process, each plaintext image is encoded using a sparsely distributed PSF with specifically designed geometric shapes through spatial phase engineering. The resulting ciphertexts are superimposed to produce a compressed ciphertext. During decryption, an iterative algorithm recovers encrypted images with improved reconstruction quality. We show that sPSF-MOE ensures high fidelity for binary (gray-scale) images at a compression ratio of 12 (6) and resists autocorrelation-based attacks. Integrating principal component analysis (PCA) into decryption preserves image high fidelity under ambient light interference. sPSF-MOE reduces the bandwidth requirement for data transmission while ensuring data integrity.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

A Multi-staining Digital Pathology image Registration Method Based on Global and Local Computing 27

A Multi-staining Digital Pathology Image Registration Method...

引用

27th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and parallel/distributed Computing (SNPD)

作者： Ding, Yi Zhang, Xin Lu, Cheng Commun Univ China Sch Comp & Cyber Sci Beijing Peoples R China Guangdong Prov Peoples Hosp Dept Radiol Guangzhou Peoples R China

ISBN: (纸本)9798350391961;9798350391954

To accurately evaluate the patient's condition, medical workers usually need to register multiple pathological images of the lesion site samples. Using computer technology to assist in registration work can effectively improve the efficiency of doctors analyzing pathological images. One of the most advanced methods currently is the Virtual Alignment of Pathology image Series method, which is a multi-staining digital pathology image registration method that combines global and local calculations. However, this method may encounter certain biases when processing images with significant angle differences. Through a detailed analysis of this method, this article proposes an improvement plan which optimizes the acquisition of non-rigid registration mask images, enabling the method to obtain mask images more reasonably and achieve better registration results for images with significant angle differences. This provides more accurate judgment basis and helps doctors diagnose and develop treatment plans more accurately.

关键词： digital pathological images global rigid registration local non-rigid registration feature point detection

来源：评论

学校读者我要写书评

暂无评论

Log-Scale Quantization in distributed First-Order methods: Gradient-Based Learning From distributed Data

引用

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2025年 22卷 10948-10959页

作者： Doostmohammadian, Mohammadreza Qureshi, Muhammad I. Khalesi, Mohammad Hossein Rabiee, Hamid R. Khan, Usman A. Semnan Univ Fac Mech Engn Semnan *** Iran Tufts Univ Dept Elect & Comp Engn Medford MA 02155 USA Sharif Univ Technol Comp Engn Dept Tehran *** Iran

Decentralized strategies are of interest for learning from large-scale data over networks. This paper studies learning over a network of geographically distributed nodes/agents subject to quantization. Each node possesses a private local cost function, collectively contributing to a global cost function, which the considered methodology aims to minimize. In contrast to many existing papers, the information exchange among nodes is log-quantized to address limited network-bandwidth in practical situations. We consider a first-order computationally efficient distributed optimization algorithm (with no extra inner consensus loop) that leverages node-level gradient correction based on local data and network-level gradient aggregation only over nearby nodes. This method only requires balanced networks with no need for stochastic weight design. It can handle log-scale quantized data exchange over possibly time-varying and switching network setups. We study convergence over both structured networks (for example, training over data-centers) and ad-hoc multi-agent networks (for example, training over dynamic robotic networks). Through experimental validation, we show that (i) structured networks generally result in a smaller optimality gap, and (ii) log-scale quantization leads to a smaller optimality gap compared to uniform quantization. Note to Practitioners-Motivated by recent developments in cloud computing, parallel processing, and the availability of low-cost CPUs and communication networks, this paper considers distributed and decentralized algorithms for machine learning and optimization. These algorithms are particularly relevant for decentralized data mining, where data sets are distributed across a network of computing nodes. A practical example of this is the classification of images over a networked data centre. In real-world scenarios, practical model nonlinearities such as data quantization must be addressed for information exchange among the computing nodes. T

关键词： Quantization (signal) Convergence Optimization Cost function Costs Heuristic algorithms distributed databases Machine learning algorithms Ad hoc networks Training distributed algorithm data classification quantization graph theory optimization

来源：评论

学校读者我要写书评

暂无评论

FERI: Feature Enhancement and Relational Interaction for image-text Matching 30

FERI: Feature Enhancement and Relational Interaction for Ima...

引用

30th IEEE International Conference on parallel and distributed Systems, ICPADS 2024

作者： Zhang, Yu Zhang, Jianqiang Song, Gongpeng Lu, Qin Zhao, Shuo Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Jinan China Shandong Engineering Research Center of Big Data Applied Technology Faculty of Computer Science and Technology Jinan China Shandong Fundamental Research Center for Computer Science Shandong Provincial Key Laboratory of Computer Networks Jinan China Shandong Branch of China Mobile Communication Group Design Institute Co. Jinan China

ISBN: (纸本)9798331515966

image-text matching is an important problem at the intersection of computer vision and natural language processing. It aims to establish the semantic link between image and text to achieve high-quality semantic alignment between the two modalities. However, the existing methods have the problem that the meaning expressed in the image or the complex narrative in the text cannot be fully understood due to insufficient feature extraction. Moreover, due to the essential modal differences between images and texts, how to effectively and accurately align the semantic contents in images and texts has become the key of research. In order to solve the above problems, this paper proposes a method based on feature enhancement and relationship interaction. When processing images, the proposed method fuses labeled features, region features and location features to represent images. When processing text, a combination of Bi-GRU and self-attention mechanism is used to represent the text. In order to further align the semantic content in images and texts accurately, this paper improves two relational interaction mechanisms by identifying connection relationships and learning association relationships. Thus, the relation enhanced embedding is obtained. Finally, it calculated the similarity of the enhanced embedding to judge the matching degree of the image and text. Extensive experiments on the public datasets Flickr30K and MSCOCO demonstrate the effectiveness of our method. © 2024 IEEE.

关键词： cross-modal retrieval feature enhancement image-text matching relationship interaction

来源：评论

学校读者我要写书评

暂无评论

image Compression and Reconstruction Based on Quantum Network

Image Compression and Reconstruction Based on Quantum Networ...

引用

1st International Conference on Smart Energy Systems and Artificial Intelligence (SESAI)

作者： Ji, Xun Liu, Qin Huang, Shan Chen, Andi Wu, Shengjun Nanjing Univ Sch Phys Natl Lab Solid State Microstruct Nanjing 210093 Peoples R China Nanjing Univ Collaborat Innovat Ctr Adv Microstruct Nanjing 210093 Peoples R China Nanjing Univ Inst Brain Sci Nanjing 210023 Peoples R China Nanjing Univ Kuang Yarning Honors Sch Nanjing 210023 Peoples R China Univ Sci & Technol China Hefei Natl Lab Hefei 230088 Peoples R China

ISBN: (纸本)9798350364613;9798350364606

Quantum network is an emerging type of network structure that leverages the principles of quantum mechanics to transmit and process information. Compared with classical data reconstruction algorithms, quantum networks make image reconstruction more efficient and accurate. They can also process more complex image information using fewer bits and faster parallel computing capabilities. Therefore, this paper will discuss image reconstruction methods based on our quantum network and explore their potential applications in image processing. We will introduce the basic structure of the quantum network, the process of image compression and reconstruction, and the specific parameter training method. Through this study, we can achieve a classical image reconstruction accuracy of 97.57%. Our quantum network design will introduce novel ideas and methods for image reconstruction in the future.

关键词： image compression and reconstruction quantum network parameter training

来源：评论

学校读者我要写书评

暂无评论

Improving Visual Question Answering by image Captioning

引用

IEEE ACCESS 2025年 13卷 46299-46311页

作者： Shao, Xiangjun Dong, Hongsong Wu, Guangsheng Hunan Univ Arts & Sci Sch Comp & Elect Engn Changde 415000 Peoples R China Key Lab Hunan Prov Control Technol Distributed Ele Changde 415000 Peoples R China Wuhan Univ Sch Comp Sci Wuhan 430072 Peoples R China Lyuliang Univ Dept Comp Sci Luliang 033000 Peoples R China Xinyu Univ Sch Math & Comp Sci Xinyu 338004 Peoples R China

Visual Question Answering (VQA) is a challenging task that bridges the computer vision and natural language processing communities. It provide natural language answers to questions related to an associated image. Most existing VQA methods focus on the fusion and inference of visual features with the textual question. However, visual features often lack the necessary semantic information required to answer the questions accurately. To address this limitation, we propose a novel approach called Question-Guided parallel Attention (QGPA), which effectively leverages the semantic information provided by an embedded image captioning model to answer related questions. First, we introduce an Attention-Aware (AA) mechanism that extends the traditional attention mechanism, helping to filter out incorrect or irrelevant information during answer prediction. Second, QGPA incorporates AA, which simultaneously utilizes visual features and semantic information from the embedded image captioning model to answer questions. Experiments results demonstrate that the accuracy of "Overall" of our proposed model delivers 72.57% and 72.55% on the test-dev and test-std split set of VQA-v2.0 dataset, respectively, which outperforms most existing VQA methods. The experiment results and ablation studies demonstrate that the proposed method has good performance.

关键词： Visualization Semantics Long short term memory Question answering (information retrieval) Predictive models Accuracy Knowledge based systems Feature extraction Information filters Context modeling Deep learning image captioning multimodal learning visual question answering

来源：评论

学校读者我要写书评

暂无评论

RETRACTED: A distributed submerged object detection and classification enhancement with deep learning (Retracted Article)

引用

distributed AND parallel DATABASES 2023年第1-2期41卷 161-162页

作者： Madhan, E. S. Kannan, K. S. Rani, P. Shobha Rani, J. Vakula Anguraj, Dinesh Kumar SRM Inst Sci & Technol Sch Comp Kattankulathur 603203 Tamil Nadu India CMR Engn Coll Dept Comp Sci & Engn Hyderabad 501401 India RMD Engn Coll Dept Comp Sci & Engn Kavaraipettai 601206 India CMR Inst Technol Dept MCA Bengaluru Karnataka India Koneru Lakshmaiah Educ Fdn Dept Comp Sci & Engn Vaddeswaram Andhra Pradesh India

Research in the autonomous underwater detection system has become rapidly increasing in Ocean Technology. In a recent object detection research study, there a need to enhance the quality, which needs to handle submerged object image processing techniques and a lot of demand to develop an intelligent vision system to improve the Blurred images and low-quality illumination. Manual research in undersea water leads to more significant pressures and complex environments in cost and workforce. It is necessary to develop a high acceptable autonomous image quality system to upgrade image quality. This paper proposed two approaches: (i) Gray shade and Max-RGB filter techniques to improve image quality. (ii) For optimization and low illumination problem modified Convolution Neural Technique (CNN) incorporated for classification and detection. Moreover, our proposed model has compared with Single-shot Detector (SDD), You Only Look Once (Yolo), Fast RCNN, Faster RCNN to uphold the quality detection found objects. This research article aids to found real-time underwater objects classification and detection. It helps to incorporate an Autonomous operation Vehicle (AOV) underwater research. Our experiment results show detection runs speed as 30 FPs (Frame per second).

关键词： Object detection Computer vision Convolution neural network RGB filter

来源：评论

学校读者我要写书评

暂无评论

Two-Dimensional parallel Spatio-Temporal Pyramid Pooling for Hand Gesture Recognition

引用

IEEE ACCESS 2023年 11卷 133755-133766页

作者： Jafari, Farzaneh Basu, Anup Univ Alberta Dept Comp Sci Edmonton AB T6G 2E8 Canada

Hand Gesture Recognition (HGR) plays a crucial role in user-friendly interactions between humans and computers. In recent years, using the Convolutional Neural Network (CNN) has improved the accuracy of image processing problems. Inspired by the high recognition rate of CNN and its efficiency, we propose a model for hand gesture recognition based on CNN and evaluate the results using images with plain and complex backgrounds. Recognizing different hand signs by Two-Dimensional parallel Spatio-Temporal Pyramid Pooling (2DPSTPP) features with deep learning methods reduces the size of the map, minimizes training complexity, and by paying attention to more details, improves detection performance. The effectiveness of the proposed method is evaluated using regular cross-validation tests on six datasets, namely American Sign Language (ASL), the NUS hand posture dataset I, the NUS hand posture dataset ii, the digits dataset, the hand gesture dataset, and the leap gesture recognition dataset.

关键词： ASL dataset convolutional neural network (CNN) hand gesture detection hand gesture dataset NUS hand posture dataset I NUS hand posture dataset ii two-dimensional parallel spatio-temporal pyramid pooling (2DPSTPP)

来源：评论

学校读者我要写书评

暂无评论

Spatial-Frequency Integration Network with Dual Prompt Learning for Few-shot image Classification 22

Spatial-Frequency Integration Network with Dual Prompt Learn...

引用

22nd IEEE International Symposium on parallel and distributed processing with Applications, ISPA 2024

作者： Liu, Yi He, Jie Xu, Shoukun Jiang, Guangqi Changzhou University Academy of Computing and Artificial Intelligence Jiangsu Changzhou213164 China

ISBN: (纸本)9798331509712

Few-shot image classification is a challenging task that aims to recognize image classes based on only a few training images. However, existing methods face the following two main challenges: (1) Ignoring the frequency domain information during image feature extraction. (2) It does not take the semantic gap between multiple modalities into consideration, which limits the classification performance. To overcome these limitations, we propose a novel method named Spatial-Frequency Integration Network with Dual Prompt Learning for few-shot image classification. Firstly, we introduce a spatial-frequency integration module that combines spatial domain and low-frequency information to extract discriminative image features from the image modality. Secondly, we design a dual prompting module, which integrates learnable prompts and hand-crafted prompts to improve the generalization of applications to new classes. Thirdly, we propose an image-text interaction module to enhance inter-modal complementary and consistency. Both theoretical and experimental validations confirm the effectiveness of the proposed method in few-shot image classification. © 2024 IEEE.

关键词： Zero-shot learning

来源：评论

学校读者我要写书评

暂无评论

Scalable SubXPCA on a distributed Platform Applied to Classification of Hyperspectral images 8

Scalable SubXPCA on a Distributed Platform Applied to Classi...

引用

8th IEEE International Conference on Information and Communication Technology, CICT 2024

作者： Rupa, Bogolu Negi, Atul School of Computer and Information Sciences University of Hyderabad India Vardhaman College of Engineering Department of Computer Science and Engineering Telangana Hyderabad India

ISBN: (纸本)9798331531539

The exponential growth of technological advancements in satellite and airborne remote sensing is giving rise to large volumes of high-dimensional hyperspectral image data. Apache Spark is one of the most popular, extensively used and open-source distributed processing frameworks, which is proven effective in processing large volumes of remotely sensed hyperspectral images in a time-efficient manner. Open-source distributed processing frameworks have proven effective in processing large volumes of remotely sensed hyperspectral images quickly and efficiently. While computational power has been increasing, the rate of data accumulation is more than the processing capabilities. Therefore, more efficient algorithms such as dimensionality reduction are needed to process and get accurate performance for the application. This paper proposes an efficient and parallel spectral dimensionality reduction approach based on feature partitioning principal component analysis called scalable SubXPCA. We implemented scalable SubXPCA on a spark cluster distributed environment. We compared scalable SubXPCA against other distributed feature partitioning and various non-feature partitioning dimensionality reduction methods. Our experiments on different real and synthetic datasets of hyperspectral images confirm that SubXPCA classification performance is not only better than its competitors but also that the running time of SubXPCA is better in distributed processing than serial processing. As the size of the hyperspectral image dataset increased, SubXPCA showed a speed up factor of 5.7× and more in the spark cluster compared to the serial version. © 2024 IEEE.

关键词： Dimensionality reduction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：