检索结果-内蒙古大学图书馆

17th International Conference on Interfaces and Human Computer Interaction 2023, IHCI 2023, the 17th International Conference on Computer Graphics, Visualization, Computer vision and image processing 2023, CGVCVIP 2023 and 16th International Conference on Game and Entertainment Technologies 2023, GET 2023

作者： Sautter, Rubens Andreas Rosa, Reinaldo Roberto Alavarce, Debora Cristina Silva, Daniel Guimarães Av. Dos Astronautas 1.758 CEP 12245-027 São José dos Campos SP Brazil Hipocampus EdTech - Digital Learning R. do Serimbura 320 CEP 12243-360 São José dos Campos SP Brazil

ISBN: (纸本)9789898704498

This paper describes a new application of the technique known as Gradient Pattern Analysis (GPA), focused here on computer vision. In the GPA domain, the image is translated into a tessellation triangulation field based on the vectors positions that make up the gradient lattice of the matrix image. The GPA version considered here generates three attributes (G1, G2 and G3) that can be used as labels for a supervised machine-learning model. The case study presented here shows that GPA is a useful tool for real-time fetal biometry from 2D ultrasound images. The application in obstetrics indicates that the technique can also be useful for learning diagnostic imaging in gynecology, hepatology and oncology. The generalization of the technique to other applications in practical learning in health is discussed. © 2023 Proceedings of the International Conferences on Interfaces and Human Computer Interaction 2023, IHCI 2023;Computer Graphics, Visualization, Computer vision and image processing 2023, CGVCVIP 2023;and Game and Entertainment Technologies 2023, GET 2023. All rights reserved.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

A new stereo matching energy model based on image local features

引用

MULTIMEDIA TOOLS AND applications 2023年第23期82卷 35651-35684页

作者： Hongjin, Zhang Hui, Wei Gang, Ma Fudan Univ Sch Comp Sci Lab Algorithms Cognit Models Shanghai 200433 Peoples R China

This paper constructs an energy model based on local features used in stereo matching. The local features include the similarity between different image areas, the matching cost function pattern, the connection between neighbor pixels, and the occlusion geometric relationship. Based on these features, we define the weight of each data term and smoothing term in the energy function and then design an algorithm to solve the energy model and get disparity results. The significant improvements of this paper include as following. 1) We modify the structure of the energy function. First, we define the weight of the data term based on the reliability of its corresponding disparity result, which is obtained by cost function features and the occlusion geometric relationship. Then we define the weight of the smoothing term by analyzing the characteristic relation between neighbor super-pixels. We can also reduce the computational complexity by detecting and reducing some low-strength connections. 2) We proposed an algorithm based on pairwise Markov random field (MRF) (Taniai et al., IEEE Trans Pattern Anal machine Intell 40(11): 2725-2739, 2017) and local greedy iteratively, which can be used to solve the energy model. 3) In post-optimation, we select some areas with severe occlusion and fewer matching clues for post-interpolation fitting to optimize the results. The experiment shows that the proposed method reduced the average percentage of bad pixels (in bad 3) to 6.06 on the Middlebury dataset and 1.42 on the KITTI dataset. Finally, we compare our results with those of MC-Cnn (Zbontar and LeCun 2015), CF-Net (Shen et al., 2021), Guided-Stereo (Poggi et al., 2019), Gwc-Net (Guo et al., 2019) and Patchmatch-Net(PM-Net) (Wang et al., 2021) to verify the improved speed and accuracy of our algorithm, especially at recognizing the depth of changing edges and small objects. This paper's relevant research can contribute to practical engineering practices such as assisted vision, i

关键词： Computer vision Stereo image processing Disparity reliable Local feature Cost function Weight measurement

来源：评论

学校读者我要写书评

暂无评论

Continual Learning for Blind image Quality Assessment

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND machine INTELLIGENCE 2023年第3期45卷 2864-2878页

作者： Zhang, Weixia Li, Dingquan Ma, Chao Zhai, Guangtao Yang, Xiaokang Ma, Kede Shanghai Jiao Tong Univ AI Inst MoE Key Lab Artificial Intelligence Shanghai 200240 Peoples R China Peng Cheng Lab Shenzhen Guangdong Peoples R China City Univ Hong Kong Dept Comp Sci Kowloon Hong Kong Peoples R China

The explosive growth of image data facilitates the fast development of image processing and computer vision methods for emerging visual applications, meanwhile introducing novel distortions to processed images. This poses a grand challenge to existing blind image quality assessment (BIQA) models, which are weak at adapting to subpopulation shift. Recent work suggests training BIQA methods on the combination of all available human-rated IQA datasets. However, this type of approach is not scalable to a large number of datasets and is cumbersome to incorporate a newly created dataset as well. In this paper, we formulate continual learning for BIQA, where a model learns continually from a stream of IQA datasets, building on what was learned from previously seen data. We first identify five desiderata in the continual setting with three criteria to quantify the prediction accuracy, plasticity, and stability, respectively. We then propose a simple yet effective continual learning method for BIQA. Specifically, based on a shared backbone network, we add a prediction head for a new dataset and enforce a regularizer to allow all prediction heads to evolve with new data while being resistant to catastrophic forgetting of old data. We compute the overall quality score by a weighted summation of predictions from all heads. Extensive experiments demonstrate the promise of the proposed continual learning method in comparison to standard training techniques for BIQA, with and without experience replay. We made the code publicly available at https://***/zwx8981/BIQA_CL.

关键词： Blind image quality assessment continual learning subpopulation shift

来源：评论

学校读者我要写书评

暂无评论

PROP - where learning is made fun

PROP - where learning is made fun

引用

2024 IEEE International Students' Conference on Electrical, Electronics and Computer Science, SCEECS 2024

作者： Rajeshwari, P. Banushree, B. Nair, Niveditha G. Nivya, Nivya Krupa, Shri Dayananda Sagar College of Engineering Department of Electronics and Telecommunication Engineering Bangalore India

ISBN: (纸本)9798350348460

Writing in air has become a significant research area in image processing and pattern recognition, contributing to automation and improving human-machine interfaces in various applications. Object tracking, a crucial task in Computer vision, is increasingly popular due to faster computers, high-quality video cameras, and automated video analysis demands. This study provides a comparison of virtual air canvas using OpenCV and Mediapipe and air canvas application using OpenCV and Numpy in python. Further it dwells into usage of Augmented Reality for educational purposes and how if these two if are integrated together will revolutionize teaching and learning © 2024 IEEE.

关键词： Augmented Reality Computer vision Mediapipe Numpy Object Tracking Virtual Air Canvas

来源：评论

学校读者我要写书评

暂无评论

A Flexible machine vision AI System for Edge-Oriented Deep Learning Accelerators

A Flexible Machine Vision AI System for Edge-Oriented Deep L...

引用

2023 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2023

作者： Song, Joon Boum Kim, Yumi Lee, Minkyu Lee, Sang-Seol Kim, Kyungho Korea Electronics Technology Institute Korea Republic of

ISBN: (纸本)9798350326413

In recent 10 years, deep learning has successfully shown its effectiveness in various computer vision fields such as autonomous vehicles, robotics, and AI surveillance. Numerous machine vision AI systems have been accordingly developed to run those deep learning algorithms. However, existing PC-based machine vision AI systems have the disadvantage of having to modify the entire system although just a small change is required. They are also considerably high cost/power consuming systems for edge device purpose which is targeted in this study. In addition, a number of FPGA-based machine vision AI systems are not suitable for multiple applications as they are dedicatedly designed. In this work, in order to overcome those disadvantages, we have developed a flexible FPGA-based machine vision AI system where various type of accelerator can be implemented. An accelerator integrated with a post processing unit(PPU) that runs an object detection was successfully demonstrated on the system satisfying the required input image resolution/format. The proposed system can also provide multiple resolution/format of input images which is proved via Xilinx integrated logic analyzer(ILA) debugging. We therefore ensure that this system can further support more accelerators which can be deployed in diverse applications. © 2023 IEEE.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Transparent machine vision Techniques For Medicinal Plant Species Identification 5

Transparent Machine Vision Techniques For Medicinal Plant Sp...

引用

5th International Conference on image processing and Capsule Networks, ICIPCN 2024

作者： Kikome, Christine Okumu, Geoffrey Wagisha, Emmanuel Jjingo, Daudi Kizito, John Marvin, Ggaliwango Makerere University Computer Science Department Kampala Uganda

ISBN: (纸本)9798350367171

Medicinal plants have long been the foundation of the medical system and a source of health and healing, but many people nowadays are unaware of these priceless natural resources or the range of possible applications they provide. Traditional medical professionals known as herbalists mostly depend on their understanding of therapeutic herbs to cure a wide range of illnesses. Unfortunately, identifying medicinal plants by hand from their leaves takes a lot of time, labour, and specialised knowledge. The purpose of this project was to create and construct a web application for the identification of plant species, namely medicinal plants that are present across Uganda. On a publicly available dataset, many machine learning and deep learning models, including Convolutional Neural Networks (CNN), VGG19 and 16, Resnet50, vision Transformer (ViT), and EfficientNet B0, were trained. Many features in the dataset were extracted from each leaf such as its length, width, texture, area, and color. For the ten classes of medicinal plant species identified in the globe, the model's accuracy rates were 99%, 95.70%, 9 2. 5 8 %, 9 7. 9 0 %, 9 1. 2 0 %, and 3 5. 6 9 %, respectively. In order to improve transparency and visual explanations for its picture classifications, this model makes use of Grad-CAM and SHAP, which are explainable AI (XAI) approaches. It is anticipated that a web-based system for the automatic identification of medicinal plants will contribute significantly to the production of pharmaceutical drugs, help members of the community increase their knowledge of medicinal plants, and assist taxonomists in creating more effective methods for identifying species. © 2024 IEEE.

关键词： Medicinal chemistry

来源：评论

学校读者我要写书评

暂无评论

Electoral Symbols and Vote detection in Paper Ballots - A Case from Nepal's Election 17

Electoral Symbols and Vote detection in Paper Ballots - A Ca...

引用

17th International Conference on machine vision, ICMV 2024

作者： Shrestha, Raju Acharya, Suraj Department of Computer Science Oslo Metropolitan University Oslo Norway

ISBN: (纸本)9781510688278

This paper investigates the potential of advanced object detection technologies to automate and enhance the accuracy and efficiency of the vote counting process in democratic elections that utilize paper-based ballots with electoral symbols. The study focuses on detecting electoral symbols and votes on paper ballots by utilizing two state-of-the-art object detection models: Faster R-CNN, and YOLO. These models were fine-tuned by training them with the ballot papers created using the dataset prepared from the electoral symbols used in Nepal's general election to ensure high accuracy and reliability in recognizing and validating votes. The system's effectiveness was demonstrated through a comparison of the models, highlighting the superior performance of Faster R-CNN in terms of precision, despite its slower processing speed compared to YOLO. The results indicate that incorporating object detection technologies into electoral systems can significantly improve efficiency of vote counting process. The study underscores the potential for broader applications of these technologies in promoting transparent and fair elections, especially in countries like Nepal, where traditional paper ballots are still prevalent. This innovative approach ensures a more reliable and efficient electoral process, reducing human error and increasing trust in election outcomes. © 2025 SPIE.

关键词： Object detection Systems modeling image processing Performance modeling Transparency Data modeling Artificial intelligence Deep learning

来源：评论

学校读者我要写书评

暂无评论

Analysis of Impact of image Restoration and Segmentation on Classification Model 7

Analysis of Impact of Image Restoration and Segmentation on ...

引用

2023 7th International Conference On Computing, Communication, Control And Automation, ICCUBEA 2023

作者： Vispute, Sushma Rahul Rajeswari, K. Nema, Aryan Jagtap, Arya Kulkarni, Mrugendra Mohite, Pranav PCCOE Department of Computer Engineering Pune India

ISBN: (纸本)9798350304268

A widely studied problem in computer science is the restoration, segmentation, and classification of images, which involves image processing, computer vision, and machine learning techniques. Deep learning has made significant contributions to this field, bringing machine learning closer to artificial intelligence. Deep learning has a broad range of applications, including in surveillance, healthcare, medicine, and sports. Convolutional neural networks (CNN), a combination of artificial neural networks (ANN) and deep learning techniques, have made incredible advancements in deep learning. This survey compares methods for restoring noisy images, such as wiener filter, wavelet method, and wiener filtering with BM3D technique, using Gaussian blurring and noising methods. The survey also examines the RGB colour model and YCbCr colour model for image segmentation. image classification is studied using CNN, where the survey discusses various parameters of convolutional neural networks, including activation functions and pooling methods. © 2023 IEEE.

关键词： computer vision convolutional neural network face detection face recognition image classification image restoration noise reduction segmentation

来源：评论

学校读者我要写书评

暂无评论

Proceedings - 2024 International Conference on Advances in Electrical Engineering and Computer applications, AEECA 2024

Proceedings - 2024 International Conference on Advances in E...

引用

5th International Conference on Advances in Electrical Engineering and Computer applications, AEECA 2024

ISBN: (纸本)9798350355253

The proceedings contain 127 papers. The topics discussed include: Advanced data storage and processing technologies in a next-generation electric information acquisition system;analyzing file access characteristics for deep learning workloads on mobile devices;optimal scheduling of distributed energy storage for electric vehicles based on evolutionary dissipation theory;a novel semi-supervised learning approach for referring expression comprehension;research and implementation of material image subject segmentation method based on machine vision;application of image recognition and 3D reconstruction technology in virtual museum system;knowledge graph technology-based active research and judgment technology for electric power customer complaint risk;and path planning for unmanned underwater vehicles based on improved ant colony algorithm.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Automated Fundus image Standardization Using a Dynamic Global Foreground Threshold Algorithm 8

Automated Fundus Image Standardization Using a Dynamic Globa...

引用

8th International Conference on image, vision and Computing, ICIVC 2023

作者： Kiefer, Riley Abid, Muhammad Ardali, Mahsa Raeisi Steen, Jessica Amjadian, Ehsan Florida Polytechnic University Department of Computer Science LakelandFL United States College of Optometry Nova Southeastern University Fort LauderdaleFL United States Cheriton School of Computer Science University of Waterloo WaterlooON Canada

ISBN: (纸本)9798350335231

A generic fundus foreground extractor is required for the standardization of fundus datasets in machine-learning applications due to the vast range of retinal fundus images. Some fundus images have a large amount of non-essential background data and others have missing data because of clipping. To standardize these varied images for machine learning applications while preserving the aspect resolution, a generalized threshold algorithm is needed to separate the foreground and background. Existing threshold algorithms fail to segment images with low contrast. There is a need for a generalized algorithm to handle varied image conditions in a dynamic manner. The proposed segmentation algorithm uses shifts in histogram frequency using intensity extrema to find the ideal threshold value. The proposed post-processing algorithm crops, pads, and resizes the image to a standardized size of 512x512 pixels using the segmentation map output. To demonstrate the effectiveness of this proposed standardization approach on downstream tasks, an ablation experiment of popular standardization strategies is evaluated on a newly proposed benchmark dataset, EyePACS-light. The experimental results demonstrate the benefits of using this standardization approach for resizing fundus images. © 2023 IEEE.

关键词： Standardization

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：