检索结果-内蒙古大学图书馆

2nd International Conference on Big Data, machine Learning, and applications, BigDML 2021

ISBN: (纸本)9789819934805

The proceedings contain 58 papers. The special focus in this conference is on Big Data, machine Learning, and applications. The topics include: A Comparative Study of Loss Functions for Deep Neural Networks in Time Series Analysis;learning Algorithm for Threshold Softmax Layer to Handle Unknown Class Problem;traffic Monitoring and violation Detection Using Deep Learning;conjugate Gradient Method for finding Optimal Parameters in Linear Regression;rugby Ball Detection, Tracking and Future Trajectory Prediction Algorithm;early Detection of Heart Disease Using Feature Selection and Classification Techniques;Gun Detection System for Surveillance Cameras Using HOG-Assisted KNN Classifier;Optimized Detection, Classification, and Tracking with YOLOv5, HSv Color Thresholding, and KCF Tracking;realtime Object Distance Measurement Using Stereo vision image processing;COvID-19 Detection Using Chest X-ray images;Comparative Analysis of LDA Algorithm for Low Resource Indian Languages with Its Translated English Documents;text Style Transfer: A Comprehensive Study on Methodologies and Evaluation;classification of Hindustani Musical Ragas Using One-Dimensional Convolutional Neural Networks;w-Tree: A Concept Correlation Tree for Data Analysis and Annotations;crawl Smart: A Domain-Specific Crawler;evaluating the Effect of Leading Indicators in Customer Churn Prediction;classification of Skin Lesion Using image processing and ResNet50;data Collection and Pre-processing for machine Learning-Based Student Dropout Prediction;Nested Named-Entity Recognition in Multilingual Code-Switched NLP;an Insight on Drone applications in Surveillance Domain;deep Learning-Based Semantic Segmentation of Blood Cells from Microscopic images;a Partitioned Task Offloading Approach for Privacy Preservation at Edge;Artificial Intelligence in Radiological COvID-19 Detection: A State-of-the-Art Review.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A systematic review on diabetic retinopathy detection and classification based on deep learning techniques using fundus images

引用

PEERJ COMPUTER SCIENCE 2024年 10卷 e1947页

作者： Bhulakshmi, Dasari Rajput, Dharmendra Singh Vellore Inst Technol Sch Comp Sci Engn & Informat Syst Vellore Tamil Nadu India

Diabetic retinopathy (DR) is the leading cause of visual impairment globally. It occurs due to long-term diabetes with fluctuating blood glucose levels. It has become a significant concern for people in the working age group as it can lead to vision loss in the future. Manual examination of fundus images is time-consuming and requires much effort and expertise to determine the severity of the retinopathy. To diagnose and evaluate the disease, deep learning -based technologies have been used, which analyze blood vessels, microaneurysms, exudates, macula, optic discs, and hemorrhages also used for initial detection and grading of DR. This study examines the fundamentals of diabetes, its prevalence, complications, and treatment strategies that use artificial intelligence methods such as machine learning (ML), deep learning (DL), and federated learning (FL). The research covers future studies, performance assessments, biomarkers, screening methods, and current datasets. various neural network designs, including recurrent neural networks (RNNs), generative adversarial networks (GANs), and applications of ML, DL, and FL in the processing of fundus images, such as convolutional neural networks (CNNs) and their variations, are thoroughly examined. The potential research methods, such as developing DL models and incorporating heterogeneous data sources, are also outlined. Finally, the challenges and future directions of this research are discussed.

关键词： DR DL Convolutional neural networks Recurrent neural networks Generative adversarial networks Fundus image

来源：评论

学校读者我要写书评

暂无评论

A comprehensive survey on techniques to handle face identity threats: challenges and opportunities

引用

MULTIMEDIA TOOLS AND applications 2023年第2期82卷 1669-1748页

作者： Rusia, Mayank Kumar Singh, Dushyant Kumar MNNIT Allahabad CSED Prayagraj Uttar Pradesh India

The human face is considered the prime entity in recognizing a person's identity in our society. Henceforth, the importance of face recognition systems is growing higher for many applications. Facial recognition systems are in huge demand, next to fingerprint-based systems. Face-biometric has a highly dominant role in various applications such as border surveillance, forensic investigations, crime detection, access management systems, information security, and many more. Facial recognition systems deliver highly meticulous results in every of these application domains. However, the face identity threats are evenly growing at the same rate and posing severe concerns on the use of face-biometrics. This paper significantly explores all types of face recognition techniques, their accountable challenges, and threats to face-biometric-based identity recognition. This survey paper proposes a novel taxonomy to represent potential face identity threats. These threats are described, considering their impact on the facial recognition system. State-of-the-art approaches available in the literature are discussed here to mitigate the impact of the identified threats. This paper provides a comparative analysis of countermeasure techniques focusing on their performance on different face datasets for each identified threat. This paper also highlights the characteristics of the benchmark face datasets representing unconstrained scenarios. In addition, we also discuss research gaps and future opportunities to tackle the facial identity threats for the information of researchers and readers.

关键词： Biometrics Face recognition Authentication Computer vision machine learning Deep learning image processing

来源：评论

学校读者我要写书评

暂无评论

MisConv: Convolutional Neural Networks for Missing Data 22

MisConv: Convolutional Neural Networks for Missing Data

引用

22nd IEEE/CvF Winter Conference on applications of Computer vision (WACv)

作者： Likowski, Marcin Przewiez Smieja, Marek Struski, Lukasz Tabor, Jacek Jagiellonian Univ Fac Math & Comp Sci 6 Lojasiewicza St PL-30348 Krakow Poland

ISBN: (纸本)9781665409155

processing of missing data by modern neural networks, such as CNNs, remains a fundamental, yet unsolved challenge, which naturally arises in many practical applications, like image inpainting or autonomous vehicles and robots. While imputation-based techniques are still one of the most popular solutions, they frequently introduce unreliable information to the data and do not take into account the uncertainty of estimation, which may be destructive for a machine learning model. In this paper, we present MisConv, a general mechanism, for adapting various CNN architectures to process incomplete images. By modeling the distribution of missing values by the Mixture of Factor Analyzers, we cover the spectrum of possible replacements and find an analytical formula for the expected value of convolution operator applied to the incomplete image. The whole framework is realized by matrix operations, which makes MisConv extremely efficient in practice. Experiments performed on various image processing tasks demonstrate that MisConv achieves superior or comparable performance to the state-of-the-art methods.

关键词： Adaptation models Uncertainty Convolution image processing Neural networks Estimation machine learning

来源：评论

学校读者我要写书评

暂无评论

Thermal image Enhancement by Artificial Multiscale-Exposure image Fusion

Thermal Image Enhancement by Artificial Multiscale-Exposure ...

引用

Conference on Multimodal image Exploitation and Learning

作者： voronin, v. Gapon, N. Zhdanova, M. Semenishchev, E. Moscow State Univ Technol STANKIN Ctr Cognit Technol & Machine Vis Moscow Russia Don State Tech Univ Rostov Na Donu Russia

ISBN: (纸本)9781510673854;9781510673847

Infrared and thermal images have been used widely in different security applications. One of the drawbacks of such images is low contrast and noisy images, which should be enhanced. We present a new image enhancement algorithm based on block-rooting processing with artificial multi-scale-exposure image fusion. The proposed block-based multi-scale enhancement method is based on a 3-D block-rooting transform domain technique comprised: finding similar blocks in the image by block-matching;block-grouping for different block sizes;applying 3-D block-matching image enhancement;decomposition of the weight map and multi-scale enhanced images into the Gaussian and Laplacian pyramids;fusion by multiplying multi-scale images and weights. A new stage is proposed to obtain a local-global estimate of high-contrast images, also used in the general artificial fusion model. Some presented experimental results illustrate the performance of the proposed method on the thermal image dataset compared with the traditional methods.

关键词： image enhancement thermal imaging multi-scale processing frequency-domain transform multi-exposure fusion

来源：评论

学校读者我要写书评

暂无评论

Microneural Network System Based on MoS2/h-BN/Graphene van der Waals Heterojunction Transistor

引用

ACS APPLIED NANO MATERIALS 2023年第17期6卷 16046-16054页

作者： Liu, Xiangkai Wang, Zhongzheng Huang, Hao Liu, Congye Niu, Wencheng Xie, Zhengdao Hao, Dandan Fu, Houqiang Liu, Xingqiang Zou, Xuming Shan, Fukai Yang, Zhenyu Qingdao Univ Coll Elect & Informat Qingdao 266071 Peoples R China Shanghai Police Coll Shanghai 200137 Peoples R China Guangxi Univ Guangxi Key Lab Proc Nonferrous Met & Featured Ma Nanning 530004 Peoples R China Harbin Engn Univ Coll Informat & Commun Engn Harbin 150001 Peoples R China Hunan Univ Sch Phys & Elect Changsha 410082 Peoples R China Arizona State Univ Sch Elect Comp & Energy Engn Tempe AZ 85287 USA

The integration of data storage and computing capabilities into a single physical component has led to the development of a microneuronal network system with high precision and speed rates. To achieve this system architecture, bold innovations in the underlying hardware structure and neural network architecture are required. This article introduces an optoelectronic storage device based on the MoS2/h-BN/graphene van der Waals heterojunction transistor. At room temperature, the transistor exhibits an electron mobility of up to 340 cm(2)/(v center dot s) and a large storage window due to its unique van der Waals heterojunction. The transistor's reconfigurable nonvolatile optoelectronic properties enable the construction of logic gates, including "AND", "OR", "NAND", and "NOR". Leveraging these logic gates, a microneural network system is created that simulates future machine vision applications. The system achieves a remarkable recognition rate of 96.3% for images in a multidimensional color space, demonstrating the significant development potential of the microneural network system based on the MoS2/h-BN/graphene vdW heterojunction transistor in future machine vision.

关键词： heterojunction electron mobility optoelectronic storage logic gate microneural network machine vision

来源：评论

学校读者我要写书评

暂无评论

Gen-CNN: a framework for the automatic generation of CNNs for image classification

引用

Neural Computing and applications 2025年第1期37卷 149-168页

作者： García-Aguirre, Rogelio Navarro-López, Eva María Torres-Treviño, Luis Facultad de Ingeniería Mecánica y Eléctrica Universidad Autónoma de Nuevo León Ave. Universidad S/N San Nicolás de los Garza Nuevo León66455 Mexico School of Interactive Games and Media Golisano College of Computing and Information Sciences Rochester Institute of Technology 20 Lomb Memorial Drive RochesterNY14623 United States School of Environment Education and Development University of Manchester Oxford Road ManchesterM13 9PL United Kingdom

Convolutional neural networks (CNNs) have become widely adopted for computer vision tasks. However, the vast amount of design choices and the complex interactions among their hyperparameters, which ultimately influence the model’s performance, impede their accessibility to users who are not experts in machine learning (ML). To address this challenge, we present AutoML as a solution, leveraging hyperparameter optimization (HPO) for effective parameter selection. Particularly good at handling non-convex, non-differentiable optimization tasks, genetic algorithms are easy to implement and parallelize, making them well suited for deep learning applications. In this context, we introduce Gen-CNN, an AutoML framework based on a genetic algorithm that generates CNN models for image classification. Our framework incorporates transfer learning and operates in a low-compute regime to accelerate the hyperparameter optimization phase. We test Gen-CNN on four datasets, including Sign Language Digits for convergence assessment and KvASIR-v2, ISIC-2019, and BreakHis for performance evaluation. Our results prove that Gen-CNN automatically generates CNN models with classification performance comparable to state-of-the-art custom models already published in the literature. Moreover, in the recommended testing regime for heuristic optimization techniques, we surpassed other HPO algorithms by achieving better mean categorical accuracy. Gen-CNN code is available at—omitted for anonymous review. © The Author(s), under exclusive licence to Springer-verlag London Ltd., part of Springer Nature 2024.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Automatic Rice variety Identification System: state-of-the-art review, issues, challenges and future directions

引用

MULTIMEDIA TOOLS AND applications 2023年第18期82卷 27305-27336页

作者： Komal, Ganesh Kumar Sethi, Ganesh Kumar Bawa, Rajesh Kumar Punjabi Univ Patiala India MM Modi Coll Dept Comp Sci Patiala India Punjabi Univ Dept Comp Sci Patiala India

Automatic rice variety identification or quality analysis is a challenging task in image processing and reflects advanced insights into agricultural research with the help of emerging computational technologies. It is the process of identifying the variety of the rice grains by matching them with the training dataset. It is an arduous task because the quality of rice grains is distinct from each other due to the availability of their numerous varieties in the market and unique inherent characteristics. Therefore, customers must identify the superior quality of rice from different available types in the market. This paper demonstrates an exhaustive and transparent perspective on the recent research studies for developing various identification systems using other techniques and a broad view towards this peculiar research area. The paper's main aim is to present in an organized way the related works on identification systems of rice and finally throws exposure on the synthesis analysis based on the research findings. This research study provides valuable and valuable assistance to novice researchers in the agricultural field by amalgamating the studies of various methods and techniques of feature extractions and classification required for automatic variety identification of rice. It is evident from the study that research work carried out on the automated variety identification systems with higher accuracy rates in deep learning using a conjunction of various features of rice is minimal as compared to other techniques and indeed presents a future direction.

关键词： machine learning Neural networks Computer vision Support vector machine Discriminant analysis

来源：评论

学校读者我要写书评

暂无评论

Analysis of Nanoscale Ferroelectric Domain Dynamics Based on image processing of Local C-v Maps

Analysis of Nanoscale Ferroelectric Domain Dynamics Based on...

引用

2024 IEEE Ultrasonics, Ferroelectrics, and Frequency Control Joint Symposium

作者： Hiranaga, Yohiomi Mimura, Takanori Shimizu, Takao Funakubo, Hiroshi Cho, Yasuo Tohoku Univ Res Inst Elect Commun Sendai Miyagi Japan Tokyo Inst Technol Sch Mat & Chem Technol Yokohama Kanagawa Japan Tohoku Univ New Ind Creat Hatchery Ctr Sendai Miyagi Japan

ISBN: (纸本)9798350371918;9798350371901

Local C-v mapping is a method to analyze and visualize the dynamics of polarization switching in ferroelectric materials with nanoscale resolution. This method uses a probe electrode to measure the butterfly-shaped C-v curve characteristic of ferroelectrics, and then repeats the measurement while scanning the probe to investigate the inplane distribution. This method enables the acquisition of a large amount of measurement data reflecting the spatial distribution of domain switching characteristics in a short time. On the other hand, we are still searching for a method to analyze the huge amount of data and extract meaningful information from it. So far, we have attempted to analyze the data using unsupervised cluster analysis to classify each pixel into a pre-specified number of clusters based on the similarity of the C-v curve shape. This time, we introduced a different image processing method, more specifically, a differential filtering method, and attempted to extract information different from conventional methods.

关键词： Scanning Probe Microscopy Domain Switching Dynamics Grain Boundary Hafnium Oxide machine Learning

来源：评论

学校读者我要写书评

暂无评论

PMTL: A Progressive Multi-level Training Framework for Retail Taxonomy Classification

PMTL: A Progressive Multi-level Training Framework for Retai...

引用

IEEE/CvF Winter Conference on applications of Computer vision (WACv)

作者： Bhattacharya, Gaurab Sharma, Gaurav Chatterjee, Kallol Chakrapani Bagya, Lakshmi, v Pal, Jayavardhana Gubbi Arpan Rajagopalan, Ramachandran Tata Consultancy Serv Mumbai Maharashtra India

ISBN: (纸本)9798350370287;9798350370713

Retail taxonomy classification provides hierarchical labelling of items and it has widespread applications, ranging from product on-boarding, product arrangement and faster retrieval. It is fundamental to both physical space as well as e-commerce. Manual processing based on meta-data was adopted and more recently, image based approaches have emerged. Traditionally, hierarchical classification in retail domain is performed using feature extractors and using different classifier branches for different levels. There are two challenges with this approach: error propagation from previous levels which affects the decision-making of the model and the label inconsistency within levels creating unlikely taxonomy tree. Further, the training frameworks rely on large datasets for generalized performance. To address these challenges, we propose PMTL, a progressive multi-level training framework with logit-masking strategy for retail taxonomy classification. PMTL employs a level-wise training framework using cumulative global representation to enhance and generalize output at every level and minimize error propagation. Also, we have proposed logit masking strategy to mask all irrelevant logits of a level and enforce the model to train using only the relevant logits, thereby minimizing label inconsistency. Further, PMTL is a generalized framework that can be employed to any full-shot and few-shot learning scheme without bells and whistles. Our experiments with three datasets with varied complexity in full-shot and few-shot scenario demonstrates the effectiveness of our proposed method compared to the state-of-the-art.

关键词： Large datasets

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：