检索结果-内蒙古大学图书馆

An effective framework based on hybrid learning and kernel principal component analysis for face manipulation detection

引用

SIGNAL image AND VIDEO processing 2024年第5期18卷 4811-4820页

作者： Thakur, Rahul Rohilla, Rajesh Delhi Technol Univ Elect & Commun Engn Delhi 110042 India

Face manipulation is the process of modifying facial features in videos or images to produce a variety of artistic or deceptive effects. Face manipulation detection looks for altered or falsified visual media in order to differentiate between real and fake facial photographs or videos. The intricacy of the techniques used makes it difficult to detect face manipulation, particularly in the context of technologies like deepFake. This paper presents an efficient framework based on Hybrid learning and Kernel Principal Component Analysis (KPCA) to extract more extensive and refined face-manipulating attributes. The proposed method utilizes the EfficientNetV2-L model for feature extraction, topped up with KPCA for feature dimensionality reduction, to distinguish between real and fake facial images. The proposed method is robust to various facial manipulations techniques such as identity swap, expression swap, attribute-based manipulation, and entirely synthesized faces. In this work, data augmentation is used to solve the problem of class imbalance present in the dataset. The proposed method has less execution time while achieving an accuracy of 99.3% and an F1 Score of 0.98 on the Diverse Fake Face Dataset (DFFD).

关键词： deepFake Face manipulation detection deep learning Hybrid learning EfficientNetV2

来源：评论

学校读者我要写书评

暂无评论

Hardware-Independent deep Signal processing: A Feasibility Study in Echocardiography

引用

IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL 2024年第11期71卷 1491-1500页

作者： Gundersen, Erlend Loland Smistad, Erik Jahren, Tollef Struksnes Masoy, Svein-Erik GE Vingmed Ultrasound AS N-3183 Horten Norway Norwegian Univ Sci & Technol Dept Circulat & Med Imaging N-7491 Trondheim Norway SINTEF Hlth N-7052 Trondheim Norway

deep learning (DL) models have emerged as alternative methods to conventional ultrasound (US) signal processing, offering the potential to mimic signal processing chains, reduce inference time, and enable the portability of processing chains across hardware. This article proposes a DL model that replicates the fine-tuned BMode signal processing chain of a high-end US system and explores the potential of using it with a different probe and a lower end system. A deep neural network (DNN) was trained in a supervised manner to map raw beamformed in-phase and quadrature component data into processed images. The dataset consisted of 30 000 cardiac image frames acquired using the GE HealthCare Vivid E95 system with the 4Vc-D matrix array probe. The signal processing chain includes depth-dependent bandpass filtering, elevation compounding, frequency compounding, and image compression and filtering. The results indicate that a lightweight DL model can accurately replicate the signal processing chain of a commercial scanner for a given application. Evaluation on a 15-patient test dataset of about 3000 image frames gave a structural similarity index measure (SSIM) of 98.56 +/- 0.49. Applying the DL model to data from another probe showed equivalent or improved image quality. This indicates that a single DL model may be used for a set of probes on a given system that targets the same application, which could be a cost-effective tuning and implementation strategy for vendors. Furthermore, the DL model enhanced image quality on a Verasonics dataset, suggesting the potential to port features from high-end US systems to lower end counterparts.

关键词： Probes Signal processing Array signal processing Filtering image quality image coding Recording deep learning (DL) image quality in-phase and quadrature medical ultrasound imaging signal processing supervised learning

来源：评论

学校读者我要写书评

暂无评论

Detection of image Tampering Using deep learning, Error Levels and Noise Residuals

引用

NEURAL processing LETTERS 2024年第2期56卷 112-112页

作者： Chakraborty, Sunen Chatterjee, Kingshuk Dey, Paramita Haldia Inst Technol Dept Comp Sci & Engn Haldia India Govt Coll Engn & Ceram Technol Dept Comp Sci & Engn Kolkata India Govt Coll Engn & Ceram Technol Dept Informat Technol Kolkata India

images once were considered a reliable source of information. However, when photo-editing software started to get noticed it gave rise to illegal activities which is called image tampering. These days we can come across innumerable tampered images across the internet. Software such as Photoshop, GNU image Manipulation Program, etc. are applied to form tampered images from real ones in just a few minutes. To discover hidden signs of tampering in an image deep learning models are an effective tool than any other methods. Models used in deep learning are capable of extracting intricate features from an image automatically. Here we proposed a combination of traditional handcrafted features along with a deep learning model to differentiate between authentic and tampered images. We have presented a dual-branch Convolutional Neural Network in conjunction with Error Level Analysis and noise residuals from Spatial Rich Model. For our experiment, we utilized the freely accessible CASIA dataset. After training the dual-branch network for 16 epochs, it generated an accuracy of 98.55%. We have also provided a comparative analysis with other previously proposed work in the field of image forgery detection. This hybrid approach proves that deep learning models along with some well-known traditional approaches can provide better results for detecting tampered images.

关键词： image tampering Error level analysis Spatial rich model deep learning Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

High-speed hardware accelerator based on brightness improved by Light-DehazeNet

引用

JOURNAL OF real-time image processing 2024年第3期21卷 87-87页

作者： Teng, Peiyi Du, Gaoming Li, Zhenmin Wang, Xiaolei Yin, Yongsheng Hefei Univ Technol Hefei Peoples R China

Due to the increasing demand for artificial intelligence technology in today's society, the entire industrial production system is undergoing a transformative process related to automation, reliability, and robustness, seeking higher productivity and product competitiveness. Additionally, many hardware platforms are unable to deploy complex algorithms due to limited resources. To address these challenges, this paper proposes a computationally efficient lightweight convolutional neural network called Brightness Improved by Light-DehazeNet, which removes the impact of fog and haze to reconstruct clear images. Additionally, we introduce an efficient hardware accelerator architecture based on this network for deployment on low-resource platforms. Furthermore, we present a brightness visibility restoration method to prevent brightness loss in dehazed images. To evaluate the performance of our method, extensive experiments were conducted, comparing it with various traditional and deep learning-based methods, including images with artificial synthesis and natural blur. The experimental results demonstrate that our proposed method excels in dehazing ability, outperforming other methods in comprehensive comparisons. Moreover, it achieves rapid processing speeds, with a maximum frame rate of 105 frames per second, meeting the requirements of real-time processing.

关键词： CNN Defogging Hardware accelerator FPGA image processing

来源：评论

学校读者我要写书评

暂无评论

Non-invasive vision-based personal comfort model using thermographic images and deep learning

引用

AUTOMATION IN CONSTRUCTION 2024年 168卷

作者： Zakka, Vincent Gbouna Lee, Minhyun Zhang, Ruixiaoxiao Huang, Lijie Jung, Seunghoon Hong, Taehoon Aston Univ Coll Engn & Phys Sci Birmingham England Hong Kong Polytech Univ Dept Bldg & Real Estate Kowloon Hong Kong Peoples R China Yonsei Univ Dept Architecture & Architectural Engn Seoul South Korea

An efficient method for predicting occupants' thermal comfort is crucial for developing optimal environmental control strategies while minimizing energy consumption in buildings. This paper presents a non-invasive visionbased personal comfort model that integrates thermographic images and deep learning. Unlike previous studies, the entire thermographic image of the upper body is directly used during model training, minimizing complex data processing and maximizing the use of rich skin temperature distribution. The proposed method is validated using thermographic images and corresponding thermal sensation votes (TSV) from 10 participants under different experimental conditions. Results show that the model based on a 3-point TSV scale achieves exceptional classification performance with an average accuracy of 99.51 %, outperforming existing models. The model performance using a 7-point TSV scale is slightly lower, with an average accuracy of 89.90 %. This method offers potential for integrating thermal comfort models into real-time building environmental control, optimizing occupant comfort and energy consumption.

关键词： Occupant thermal comfort Personal comfort model Thermographic imaging deep learning Non-invasive approach Occupant-centric control

来源：评论

学校读者我要写书评

暂无评论

EFFECTIVE FEATURE I NFORMATION IDENTIFICATION OF SUGARCANE BASED ON HYBRID deep learning MODELS

引用

APPLIED ENGINEERING IN AGRICULTURE 2024年第3期40卷 243-257页

作者： Pan, Mingzhang Gou, Xuanyuan Zeng, Yue Wang, Zongrun Yuan, Leyi Liang, Ke Guangxi Univ State Key Lab Conservat & Utilizat Biol Resources Nanning Guangxi Peoples R China Guangxi Univ Nanning Guangxi Peoples R China

The efficiency of intelligent sugarcane harvesters in harvesting depends on the effectiveness of identifying and locating the sugarcane during the harvesting process. In the actual harvesting process, accurately extracting valid features of sugarcane amidst the dense and interwoven sugarcane becomes a challenging task. To address this issue, we propose a hybrid deep learning approach to extract sugarcane stem contours and internal stem node feature information from sugarcane efficiently in the context of a complex harvest. Firstly, this study combined the MobileNetV3 and U-Net networks to segment overall images that contain information about the external contours of the sugarcane stem. Then, the extracted overall profile images were optimized using a variety of image processing techniques to meet the requirements of harvesting. Lastly, the improved YOLOX model was utilized to identify the internal stem node features of sugarcane from the optimized overall images. The experimental results on a real sugarcane dataset show that the proposed external sugarcane stem segmentation model achieves a high mean intersection over union (MIoU) of 91.68% with an average segmentation time of just 0.025 seconds. Moreover, the proposed model for internal stem node recognition in sugarcane achieves an average precision (AP) of 96.19% with an average detection time of 0.026 seconds. Additionally, this study compares image segmentation models such as PSPNet and deepLabv3+ with target detection models such as YoloV5 and YoloV7. The experimental results show that the sugarcane feature extraction models proposed in this article all exhibit high accuracy and robustness.

关键词： Keywords. deep learning image segmentation Machine vision Sugarcane identification Target detection.

来源：评论

学校读者我要写书评

暂无评论

real-time image processing and deep learning 2019

Real-Time Image Processing and Deep Learning 2019

引用

real-time image processing and deep learning 2019

ISBN: (纸本)9781510626577

The proceedings contain 27 papers. The topics discussed include: fast multi-modal reuse: co-occurrence pre-trained deep learning models;deep learning for fast super-resolution reconstruction from multiple images;an efficient algorithm for fast block matching motion estimation using an adaptive threshold scheme;low exposure image frame generation algorithms for feature extraction and classification;parallel image and video self-recovery scheme with high recovery capability;learning optimal actions with imperfect images;CNN classification based on global and local features;kalman-based motion estimation in video surveillance systems for safety applications;and recent advances in integrated photonic-electronic technologies for high-speed processing and communication circuits for light-based transducers.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Autonomous traffic sign detection for self-driving car system using convolutional neural network algorithm

引用

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2024年第3期46卷 5975-5984页

作者： Yu, Zhao Ye, Ting Shanghai Inst Technol Sch Art & Design Shanghai Peoples R China Shanghai Normal Univ Res Adm Shanghai Peoples R China

The accurate detection of traffic signs is a critical component of self-driving systems, enabling safe and efficient navigation. In the literature, various methods have been investigated for traffic sign detection, among which deep learning-based approaches have demonstrated superior performance compared to other techniques. This paper justifies the widespread adoption of deep learning due to its ability to provide highly accurate results. However, the current research challenge lies in addressing the need for high accuracy rates and real-time processing requirements. In this study, we propose a convolutional neural network based on the YOLOv8 algorithm to overcome the aforementioned research challenge. Our approach involves generating a custom dataset with diverse traffic sign images, followed by conducting training, validation, and testing sets to ensure the robustness and generalization of the model. Experimental results and performance evaluation demonstrate the effectiveness of the proposed method. Extensive experiments show that our model achieved remarkable accuracy rates in traffic sign detection, meeting the real-time requirements of the input data.

关键词： Traffic sign detection deep learning YOLOv8 model self-driving cars real-time processing

来源：评论

学校读者我要写书评

暂无评论

DLgram cloud service for deep-learning analysis of microscopy images

引用

MICROSCOPY RESEARCH AND TECHNIQUE 2024年第5期87卷 991-998页

作者： Matveev, Andrey V. Nartova, Anna V. Sankova, Natalya N. Okunev, Alexey G. Novosibirsk State Univ Inst Intellectual Robototechn Novosibirsk Russia Boreskov Inst Catalysis SB RAS Dept Physico Chem Res Methods Novosibirsk Russia Boreskov Inst Catalysis SB RAS Dept Nontradit Catalyt Proc Novosibirsk Russia Novosibirsk State Univ Inst Intellectual Robototechn Novosibirsk 630090 Russia

To analyze images in various fields of science and technology, it is often necessary to count observed objects and determine their parameters. This can be quite labor-intensive and time-consuming. This article presents DLgram, a universal, user-friendly cloud service that is developed for this purpose. It is based on deep learning technologies and does not require programming skills. The user labels several objects in the image and uploads it to the cloud where the neural network is trained to recognize the objects being studied. The user receives recognition results, which if necessary, can be corrected, errors removed, or missing objects added. In addition, it is possible to carry out mathematical processing of the data obtained to get information about the sizes, areas, and coordinates of the observed objects. The article describes the service features and discusses examples of its application. The DLgram service allows to reduce significantly the time spent on quantitative image analysis, reduce subjective factor influence, and increase the accuracy of analysis.

关键词： automation deep learning image processing microscopy recognition

来源：评论

学校读者我要写书评

暂无评论

Harnessing the Power of 6G Connectivity for Advanced Big Data Analytics with deep learning

引用

WIRELESS PERSONAL COMMUNICATIONS 2024年 1-18页

作者： Sun, Maojin Sun, Luyi CEICloud Data Storage Technol Beijing Co Ltd 15 Countyard Kechuang Nine Rd Beijing 101111 Peoples R China

The smart applications development worldwide demands for ultra-reliable data communication to assure the richness of data and processing in time. These smart applications create massive amounts of data to be processed in 6G networks with advanced technologies. 6G big data analytics become the demand for next-generation data communication and smart city applications. Traditional data analytics algorithms lag in efficiency while processing big data due to huge volume, data dependency and timely processing. A deep learning model called reinforcement learning is promising for processing big data in smart applications. The proposed study, advanced big data Analytics using deep learning (ABDAS-DL), gives a pioneering approach that combines deep Reinforcement learning (DRL) based deep Q network (DQN) with long-term, short-term memory (LSTM) for harnessing the vast capacity of 6G connectivity within the domain of advanced big data analytics. This study utilises smart transport-based data for taxi route optimisation by analysing climatic and surrounding factors. The look of 6G connectivity guarantees incredible facts of data transmission speeds and tremendously low latency, taking off new horizons for managing large datasets in real time. The performance of the proposed model is measured in terms of processing time, network, reliability and scalability. The proposed model takes 30 s to process the data and fix the taxi route, while another traditional model consumes more than an hour.

关键词： Advanced big data Analytics deep learning 6G connectivity DRL DQN LSTM

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：