检索结果-内蒙古大学图书馆

1st International Conference on Data Engineering and machine Intelligence, ICDEMI 2023

作者： Dhinesh, A. Sumathy, P. Department of Computer Science Bharathidasan University Tamilnadu Tiruchirappalli620023 India

ISBN: (纸本)9789819776153

Remote Sensing image Captioning (RSIC) is crucial for many researchers since it has many applications in environmental monitoring, disaster management, urban planning, image retrieval, performance of building planes, military intelligence, and autonomous vehicles. The effective procedure to generate the captions from remote sensing images complements the above-mentioned application domains. Various baseline data sets have been created by the researchers to enhance the quality of captioning by processing the diverse features of the geospatial information. In this paper, we have technically reviewed important literature that follow different algorithms for generating the captions. For example, we have presented the technical review on vision-Language Aligning Paradigm (VLCA) under the bi-lingual caption generation model, Joint-Training Two-Stage (JTTS) technique under multimodel fusion category, Multilevel and Contextual Attention Network (MLCA-Net) under context-aware captioning, LEVIR-CC belongs to transfer learning model, BERT and GPT-3 models belong to transfer-based model, Multiscale Attention (MSA) and Multifeat Attention (MFA) of Multiscale captioning model and Summarization Driven (SD)-RSIC of fine-grained captioning model. We have also presented the performance of each of these methods on various benchmark datasets. For evaluation, different well-known performance metrics are considered. The result is critically evaluated and commented on. In the future, a more rigorous review of these methods along with other relevant methods will be presented along with implementation data. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： Urban planning

来源：评论

学校读者我要写书评

暂无评论

Real-Time Object Detection and Tracking Design Using Deep Learning with Spatial–Temporal Mechanism for Video Surveillance applications 10th

Real-Time Object Detection and Tracking Design Using Deep Le...

引用

10th International Conference on Innovations in Computer Science and Engineering, ICICSE 2022

作者： Kusuma, T. Ashwini, K. Global Academy of Technology Bangalore India

ISBN: (纸本)9789811974540

We propose a CNN-based framework for "real-time object detection and tracking using deep learning" in this paper, which includes a spatial–temporal mechanism. The impact of efficient data on performance benchmarks in terms of accuracy has changed. The data processing is handled by industry buzzwords: deep learning (DL) and computer vision (CV). The CNN-based framework uses the single object tracker value to match arrival models and find targets in the next frame. Simply applying single object tracking to multiple object tracking will encounter problems in computational efficiency and results due to occlusion. In this paper, we introduce a "spatial attention mechanism (STAM)" to manage occlusion bias and target interaction. Object tracking is a sensational technology in image processing with great future implications. Multiple object tracking (MOT) has seen an extensive boom in the last few years due to machine learning, deep learning, computer vision, and more. This paper aims to provide an object tracking software solution. Using YOLO’s "You Only Look Once" technology with the help of Tensor flow, the system is geared toward object detection, tracking, and counting. Proven, effective detection and tracking on various dataset. Algorithms that offer real-time, accurate, and precise identifications appropriate for real-time applications. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Advancements in low light image enhancement techniques and recent applications

引用

JOURNAL OF VISUAL COMMUNICATION AND image REPRESENTATION 2024年 103卷

作者： Anoop, P. P. Deivanathan, R. Vellore Inst Technol Sch Mech Engn Chennai 600127 India

Low-light image enhancement is an effective solution for improving image recognition by both humans and machines. Due to low illuminance, images captured in such conditions possess less color information compared to those taken in daylight, resulting in occluded images characterized by distortion, low contrast, low brightness, a narrow gray range, and noise. Low-light image enhancement techniques play a crucial role in enhancing the effectiveness of object detection. This paper reviews state-of-the-art low-light image enhancement techniques and their developments in recent years. Techniques such as gray transformation, histogram equalization, defogging, Retinex, image fusion, and wavelet transformation are examined, focusing on their working principles and assessing their ability to improve image quality. Further discussion addresses the contributions of deep learning and cognitive approaches, including attention mechanisms and adversarial methods, to image enhancement.

关键词： Computer vision Low-light image enhancement Deep learning image processing machine learning image quality assessment

来源：评论

学校读者我要写书评

暂无评论

Evolving Convolutional Neural Networks with Meta-Heuristics for Transfer Learning in Computer vision 3

Evolving Convolutional Neural Networks with Meta-Heuristics ...

引用

3rd International Conference on Evolutionary Computing and Mobile Sustainable Networks, ICECMSN 2023

作者： Srilakshmi, V. Kiran, G. Uday Mounika, M. Sravanthi, A. Sravya, N.V.K. Akhil, V.N.S. Manasa, M. B V Raju Institute of Technology Telangana Narsapur India

In the rapidly evolving landscape of computer vision and artificial intelligence, transfer learning has emerged as a powerful tool for efficiently applying pre-trained models to new tasks. This article delves into the intriguing concept of evolving Convolutional Neural Networks (CNNs) with meta-heuristics for transfer learning in computer vision. The primary focus is on enhancing the adaptability and efficiency of CNNs, making them better suited for specialized tasks. The article covers the significance of transfer learning, the challenges faced in transfer learning with CNNs, the basics of CNN architecture, and the role of meta-heuristics in optimizing CNNs. Real-world applications and success stories demonstrate the transformative potential of these techniques in fields like medical image analysis and autonomous vehicles. It explores emerging trends and potential developments in the domain, emphasizing the impact on various sectors, including healthcare, natural language processing, and robotics. The promise of evolving CNNs with meta-heuristics lies in their capacity to tackle intricate problems with greater precision, ultimately reshaping the landscape of artificial intelligence and machine learning. Ongoing research ensures a promising future for this amalgamation of technologies, promising breakthroughs that will have a lasting impact on the world of computer vision and beyond. © 2023 Elsevier B.V.. All rights reserved.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Towards machine learning for heterogeneous inverse scattering in 3D microscopy

引用

OPTICS EXPRESS 2022年第6期30卷 9854-9868页

作者： Wertheimer, Zsolt-Alon Bar, Chen Levin, Anat Technion Israel Inst Technol Dept Elect Engn Haifa Israel

Light propagating through a nonuniform medium scatters as it interacts with particles with different refractive properties such as cells in the tissue. In this work we aim to utilize this scattering process to learn a volumetric reconstruction of scattering parameters, in particular particle densities. We target microscopy applications where coherent speckle effects are an integral part of the imaging process. We argue that the key for successful learning is modeling realistic speckles in the training process. To this end, we build on the development of recent physically accurate speckle simulators. We also explore how to incorporate speckle statistics, such as the memory effect, in the learning framework. Overall, this paper contributes an analysis of multiple aspects of the network design including the learning architecture, the training data and the desired input features. We hope this study will pave the road for future design of learning based imaging systems in this challenging domain. (C) 2022 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement

关键词： image processing Imaging systems Inverse scattering Speckle noise Speckle reduction Three dimensional microscopy

来源：评论

学校读者我要写书评

暂无评论

Outlier-Robust Estimation: Hardness, Minimally Tuned Algorithms, and applications

引用

IEEE TRANSACTIONS ON ROBOTICS 2022年第1期38卷 281-301页

作者： Antonante, Pasquale Tzoumas, Vasileios Yang, Heng Carlone, Luca MIT Lab Informat & Decis Syst Cambridge MA 02139 USA Univ Michigan Dept Aerosp Engn Ann Arbor MI 48109 USA

Nonlinear estimation in robotics and vision is typically plagued with outliers due to wrong data association or incorrect detections from signal processing and machine learning methods. This article introduces two unifying formulations for outlier-robust estimation, generalized maximum consensus ($\textrm{G}$-$\textrm{MC}$) and generalized truncated least squares ($\textrm{G-TLS}$), and investigates fundamental limits, practical algorithms, and applications. Our first contribution is a proof that outlier-robust estimation is inapproximable: In the worst case, it is impossible to (even approximately) find the set of outliers, even with slower-than-polynomial-time algorithms (particularly, algorithms running in quasi-polynomial time). As a second contribution, we review and extend two general-purpose algorithms. The first, adaptive trimming ($\textrm{ADAPT}$), is combinatorial and is suitable for $\textrm{G}$-$\textrm{MC}$;the second, graduated nonconvexity ($\textrm{GNC}$), is based on homotopy methods and is suitable for $\textrm{G-TLS}$. We extend $\textrm{ADAPT}$ and $\textrm{GNC}$ to the case where the user does not have prior knowledge of the inlier-noise statistics (or the statistics may vary over time) and is unable to guess a reasonable threshold to separate inliers from outliers (as the one commonly used in RANdom SAmple Consensus $(\textrm{RANSAC})$. We propose the first minimally tuned algorithms for outlier rejection, which dynamically decide how to separate inliers from outliers. Our third contribution is an evaluation of the proposed algorithms on robot perception problems: mesh registration, image-based object detection (shape alignment), and pose graph optimization. $\textrm{ADAPT}$ and $\textrm{GNC}$ execute in real time, are deterministic, outperform $\textrm{RANSAC}$, and are robust up to 80-90% outliers. Their minimally tuned versions also compare favorably with the state of the art, even though they do not rely on a noise bound for the inliers.

关键词： Estimation Signal processing algorithms Probabilistic logic Approximation algorithms Simultaneous localization and mapping Particle measurements Measurement uncertainty Algorithms autonomous systems computational complexity computer vision maximum likelihood estimation resilient perception robust estimation

来源：评论

学校读者我要写书评

暂无评论

A comprehensive survey on convolutional neural network in medical image analysis

引用

MULTIMEDIA TOOLS AND applications 2022年第29期81卷 41361-41405页

作者： Yao, Xujing Wang, Xinyue Wang, Shui-Hua Zhang, Yu-Dong Univ Leicester Sch Informat Leicester LE1 7RH Leics England King Abdulaziz Univ Fac Comp & Informat Technol Dept Informat Syst Jeddah 21589 Saudi Arabia Loughborough Univ Sch Architecture Bldg & Civil Engn Loughborough LE11 3TU Leics England

CNN is inspired from Primary Visual (V1) neurons. It is a typical deep learning technique and can help teach machine how to see and identify objects. In the most recent decade, deep learning develops rapidly and has been well used in various fields of expertise such as computer vision and natural language processing. As the representative algorithm of deep learning, Convolution Neural Network (CNN) has been regarded as a breakthrough of historic significance in image processing and visual recognition tasks since the astonishing results achieved on imageNet Large Scale Visual Recognition Competition (ILSVRC) Unlike methods based on handcrafted features, CNN models can build high-level features from low-level ones in a data-driven fashion and have displayed great potential in medical image analysis among the aspects of segmentation of histological images identification, lesion detection, tissue classification, etc. This paper provides a review on CNN from the perspectives of its basic mechanism introduction, structure, typical architecture and main application in medical image analysis through analyzing over 100 references from Google Scholar, PubMed, Web of Science and various sources published from 1958 to 2020.

关键词： Deep learning Feedforward Neural Network Convolutional neural network Breast Cancer Lung Nodule Brain Tumor Medical image analysis

来源：评论

学校读者我要写书评

暂无评论

A Hybrid Density-Based Clustering Pipeline for Track Reconstruction

A Hybrid Density-Based Clustering Pipeline for Track Reconst...

引用

image processing, Computer vision and machine Learning (ICICML), International Conference on

作者： Bijia You Zhiyun Xia School of Computer Science Beijing University of Posts and Telecommunications Beijing China School of Cyberspace Security Beijing University of Posts and Telecommunications Beijing China

ISBN: (数字)9798350355413

ISBN: (纸本)9798350355420

In high-energy physics, the capability to accurately and efficiently track charged particles is essential for effective data analysis. This article introduces an innovative density-based clustering pipeline intended for the track reconstruction task, incorporating Density-Based Spatial Clustering of applications with Noise (DBSCAN) algorithm and Ordering Points To Identify the Clustering Structure (OPTICS) algorithm. Results on simulated data suggest that the proposed method offers improvements in both effectiveness and robustness compared to traditional techniques, with performance on par with state-of-the-art neural network-based approaches. Furthermore, this pipeline demonstrates significant potential for real-time applications in high-energy physics experiments, offering a scalable and robust solution.

关键词： machine learning algorithms Pipelines Noise Clustering algorithms Optics Robustness Real-time systems Pattern recognition Trajectory image reconstruction

来源：评论

学校读者我要写书评

暂无评论

Target tracking using video surveillance for enabling machine vision services at the edge of marine transportation systems based on microwave remote sensing

引用

JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND applications 2024年第1期13卷 47-47页

作者： Li, Meiyan Wang, Qinyong Liao, Yuwei Baise Univ Sch Informat Engn Baise 533000 Peoples R China Zhejiang Coll Secur Technol Coll Artificial Intelligence Wenzhou 325000 Peoples R China Guangxi Baise Agr Sch Ind Robot Technol Baise 533000 Peoples R China

Automatic target tracking in emerging remote sensing video-generating tools based on microwave imaging technology and radars has been investigated in this paper. A moving target tracking system is proposed to be low complexity and fast for implementation through edge nodes in a mini-satellite or drone network enabling machine intelligence into large-scale vision systems, in particular, for marine transportation systems. The system uses a group of image processing tools for video pre-processing, and Kalman filtering to do the main task. For testing the system performance, two measures of accuracy and false alarms probability are computed for real vision data. Two types of scenes are analyzed including the scene with single target, and the scene with multiple targets that is more complicated for automatic target detection and tracking systems. The proposed system has achieved a high performance in our tests.

关键词： Edge computing Radar imaging Microwave remote sensing Automatic target tracking False alarm

来源：评论

学校读者我要写书评

暂无评论

Asynchronous Perception machine for Efficient Test Time Training 38

Asynchronous Perception Machine for Efficient Test Time Trai...

引用

38th Conference on Neural Information processing Systems, NeurIPS 2024

作者： Modi, Rajat Singh Rawat, Yogesh Centre for Research in Computer Vision University of Central Florida OrlandoFL32765 United States

In this work, we propose Asynchronous Perception machine (APM), a computationally-efficient architecture for test-time-training (TTT). APM can process patches of an image one at a time in any order asymmetrically, and still encode semantic-awareness in the net. We demonstrate APM's ability to recognize out-of-distribution images without dataset-specific pre-training, augmentation or any-pretext task. APM offers competitive performance over existing TTT approaches. To perform TTT, APM just distills test sample's representation once. APM possesses a unique property: it can learn using just this single representation and starts predicting semantically-aware features. APM demostrates potential applications beyond test-time-training: APM can scale up to a dataset of 2D images and yield semantic-clusterings in a single forward pass. APM also provides first empirical evidence towards validating GLOM's insight, i.e. if input percept is a field. Therefore, APM helps us converge towards an implementation which can do both interpolation and perception on a shared-connectionist hardware. Our code is publicly available at this link. © 2024 Neural information processing systems foundation. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：