检索结果-内蒙古大学图书馆

Reconstruction of Film and TV Scenes Based on Computer-Aided Design and machine vision

Computer-Aided Design and applications 2024年第S15期21卷 290-307页

作者： Li, Qingyang Wang, Kai College of Arts Cheongju University Cheongju28503 Korea Republic of School of Literature and Journalism Yantai University Yantai264005 China

The reconstruction of film and TV scenes is an important part of the film and TV production process, which has a decisive impact on the visual effect of the film and the audience's viewing experience. The modeling method of automatically obtaining the 3D geometric structure of natural scenes using 3D reconstruction technology can break away from the tedious manual interaction mode of traditional 3D modeling, making the 3D modeling process simpler and more convenient. This study attempts to apply computer-aided design (CAD) and machine vision technology to the reconstruction of film and TV scenes, aiming to reduce the complexity of the model while ensuring its accuracy, thereby improving the overall efficiency of film and TV scene reconstruction. The study also introduced an assessment function based on wavelet transform (WT) to evaluate the quality of film and TV scene reconstruction. Compared with the WT model, the improved algorithm proposed in this article significantly improves image processing efficiency and reduces processing time. In addition, by introducing lighting and texture information, the reconstructed model has a higher sense of realism, providing the audience with an immersive viewing experience, thereby improving the quality of the viewing experience. The research results have played a crucial role in various stages of film and TV scene reconstruction, bringing higher value and broader creative space to film and TV production. © 2024 U-turn Press LLC.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Computer vision: applications of Visual AI and image processing

引用

2023年

ISBN: (数字)9783110756722;9783110756821

ISBN: (纸本)9783110756678

This book focuses on the latest developments in the fields of visual AI, image processing and computer vision. It shows research in basic techniques like image pre-processing, feature extraction, and enhancement, along with applications in biometrics, healthcare, neuroscience and forensics. The book highlights algorithms, processes, novel architectures and results underlying machine intelligence with detailed execution flow of models.

关键词： Computer Sciences

来源：评论

学校读者我要写书评

暂无评论

Robust visual-based method and new datasets for ego-lane index estimation in urban environment

引用

machine vision AND applications 2024年第5期35卷 112页

作者： Wang, Dianzheng Liang, Dongyi Li, Shaomiao Chinese Acad Sci Inst Engn Thermophys Beijing 100190 Peoples R China HAOMO AI Technol Co Ltd Beijing 100192 Peoples R China

Correct and robust ego-lane index estimation is crucial for autonomous driving in the absence of high-definition maps, especially in urban environments. Previous ego-lane index estimation approaches rely on feature extraction, which limits the robustness. To overcome these shortages, this study proposes a robust ego-lane index estimation framework upon only the original visual image. After optimization of the processing route, the raw image was randomly cropped in the height direction and then input into a double supervised LaneLoc network to obtain the index estimations and confidences. A post-process was also proposed to achieve the global ego-lane index from the estimated left and right indexes with the total lane number. To evaluate our proposed method, we manually annotated the ego-lane index of public datasets which can work as an ego-lane index estimation baseline for the first time. The proposed algorithm achieved 96.48/95.40% (precision/recall) on the CULane dataset and 99.45/99.49% (precision/recall) on the TuSimple dataset, demonstrating the effectiveness and efficiency of lane localization in diverse driving environments. The code and dataset annotation results will be exposed publicly on https://***/haomo-ai/LaneLoc.

关键词： Ego-lane index estimation Visual image Dataset Double supervision Urban environments Autonomous driving

来源：评论

学校读者我要写书评

暂无评论

Annotation Tools for Computer vision Tasks 17

Annotation Tools for Computer Vision Tasks

引用

17th International Conference on machine vision, ICMV 2024

作者： Moschidis, Christos Vrochidou, Eleni Papakostas, George A. Department of Informatics Democritus University of Thrace Kavala65404 Greece

ISBN: (纸本)9781510688278

Common computer vision (CV) tasks include image classification, object detection, segmentation, and recognition. To handle such tasks, machine learning (ML) models for image processing require a great amount of annotated training data. While datasets are expanding in size and variety, annotation becomes demanding, since its quality can severely affect the models' performance. Thus, several annotation tools have been developed and designated for specific applications and model requirements. This work aims to provide an overview of the most up-to-date annotation tools for computer vision tasks, including 2D and 3D image data and video, comparatively highlighting their advantages and limitations. The appropriateness of each tool for specific tasks is emphasized, providing a reference map for researchers towards determining the annotation tool best tailored to their needs. Future trends in image annotation are also discussed. © 2025 SPIE.

关键词： image annotation

来源：评论

学校读者我要写书评

暂无评论

TGGLinesPlus: A Robust Topological Graph-Guided Computer vision Algorithm for Line Detection From images

引用

TRANSACTIONS IN GIS 2025年第1期29卷

作者： Yang, Liping Driscol, Joshua Gong, Ming Slack, Katie Zhang, Wenbin Wang, Shujie Potts, Catherine G. Univ New Mexico Dept Geog & Environm Studies GeoAIR Lab Albuquerque NM 87106 USA Univ New Mexico Ctr Adv Spatial Informat Res & Educ ASPIRE Albuquerque NM 87106 USA Univ New Mexico Dept Comp Sci Albuquerque NM 87106 USA Woodwell Climate Res Ctr Falmouth MA USA Univ Dayton Dept Elect & Comp Engn Dayton OH USA Florida Int Univ Knight Fdn Sch Comp & Informat Sci Miami FL USA Penn State Univ Dept Geog University Pk PA USA Penn State Univ Earth & Environm Syst Inst University Pk PA USA D Wave Quantum Inc Burnaby BC Canada

Line detection is a classic and essential problem in image processing, computer vision, and machine intelligence. Line detection has many important applications, including image vectorization (e.g., document recognition and art design), indoor mapping, and important societal challenges (e.g., sea ice fracture line extraction from satellite imagery). Many line detection algorithms and methods have been developed, but robust and intuitive methods are still lacking. In this paper, we proposed and implemented a topological graph-guided algorithm, named TGGLinesPlus, for line detection. Our experiments on images from a wide range of domains have demonstrated the flexibility of our TGGLinesPlus algorithm. We benchmarked our algorithm with five classic and state-of-the-art line detection methods and evaluated the benchmark results qualitatively and quantitatively, the results demonstrate the robustness of TGGLinesPlus.

关键词： algorithms computational geometry computer vision edge and feature detection graph theory image processing line detection spatial sciences topological graph

来源：评论

学校读者我要写书评

暂无评论

Accelerating Brillouin fiber sensing via destructive-interference-enabled precise raw data acquisition and nonredundant image denoising

引用

OPTICA 2025年第2期12卷 216-227页

作者： Li, Zonglei Zhou, Yin Hu, Jianqi Yao, Jianping Yan, Lianshan Southwest Jiaotong Univ Ctr Informat Photon & Commun Sch Informat Sci & Technol Chengdu 610031 Sichuan Peoples R China Southwest Jiaotong Univ Sch Informat Sci & Technol Lab Intelligent Percept & Smart Operat & Maintenan Chengdu 610031 Sichuan Peoples R China Sorbonne Univ Coll France CNRS Lab Kastler BrosselENS Univ PSL 24 Rue Lhomond F-75005 Paris France Univ Ottawa Sch Elect Engn & Comp Sci Microwave Photon Res Lab 25 Templeton St Ottawa ON K1N 6N5 Canada

Distributed Brillouin fiber sensing, based on the linear relationship between Brillouin frequency shift (BFS) and physical quantities applied to sensing fibers, has found numerous applications in the past few decades. Recently, various advanced image denoising methods have been used for performance enhancements in Brillouin fiber sensors. Yet, even though these methods do significantly remove noises contained in raw data, the BFS measurement uncertainty is not reduced-the newly introduced image denoising appears redundant with the conventional signal processing. Here, in order to truly make Brillouin fiber sensing benefit from image denoising, we directly map BFS from the image-denoised data via the slope-assisted analysis of the Brillouin phase-gain ratio. As such, noise reduction resulting from image denoising fully translates into measurement uncertainty reduction. In order to further optimize the performance of image-denoising-enhanced Brillouin fiber sensing, we improve the quality of the raw Brillouin gain and phase data by designing an advanced coherent detection scheme called a microwave-photonic interferometer, which converts some amplitude and phase noises into common-mode noises and further eliminates them through destructive interference. A more than 20-fold sensing speed acceleration compared to the state-of-the-art is experimentally achieved. This remarkable performance enhancement is obtained by only optimizing the signal detection and processing unit, without modifying Brillouin scattering between pump and probe waves. Our method seamlessly connects Brillouin fiber sensing with advanced image denoising methods developed for computer vision and artificial intelligence, and makes imagedenoising-enhanced Brillouin fiber sensing outperform the state-of-the art significantly. (c) 2025 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement

关键词： Brillouin scattering Fiber optic sensors Heterodyne detection machine vision Phase noise Raman scattering

来源：评论

学校读者我要写书评

暂无评论

Optimized vision transformer encoder with cnn for automatic psoriasis disease detection

引用

MULTIMEDIA TOOLS AND applications 2023年第21期83卷 59597-59616页

作者： Vishwakarma, Gagan Nandanwar, Amit Kumar Thakur, Ghanshyam Singh Indian Inst Informat Technol Comp Sci & Engn Bhopal Madhya Pradesh India Maulana Azad Natl Inst Technol Comp Sci Engn Bhopal MP India Maulana Azad Natl Inst Technol Dept Comp Applicat Bhopal MP India

Psoriasis is a skin disorder that results in swollen skin cells and red, itchy areas on the skin. 40% of the world's population is currently affected by psoriasis. Nowadays, using skin image analysis technology is the main way for detecting psoriasis. Additionally, a number of academics have identified potential machine learning methods for categorising the psoriasis illness. However, the accuracy and computational efficiency of the model still need to be improved. Thus, in this paper, we present an optimized vision transformer for autonomous psoriasis disease detection. Following pre-processing, feature optimized image is attained using convolutional neural network (CNN) which embeds full image and concatenates to each vision transformer encoder layer. It leads the network to always "retain" the full image at the end of each transformer block output. In parallel, the pre-processed images are cropped into patches and these patches along with its positional encoded information are given as input to the optimized transformer encoder. To enhance the performance of transformer, the hyper-parameters of it are optimized using adaptive rabbit optimization algorithm (AROA). Results of this article confirm that the proposed optimized vision transformer model achieved better classification accuracy of 97.7% and F-Score of 96.5%.

关键词： CNN vision transformer AROA Psoriasis

来源：评论

学校读者我要写书评

暂无评论

Quantum Visual Computing

引用

IEEE COMPUTER GRAPHICS AND applications 2024年第5期44卷 10-13页

作者： Mueller-Roemer, Johannes S. Golyanik, Vladislav Birdal, Tolga Fraunhofer IGD D-64283 Darmstadt Germany Tech Univ Darmstadt D-64283 Darmstadt Germany Max Planck Inst Informat D-66123 Saarbrucken Germany Imperial Coll London London SW7 2AZ England

Quantum computing is emerging as a transformative force in computer science, offering significant advantages in speed and efficiency over classical computing methods. Despite this promise, the practical application of quantum computing to visual computing faces numerous challenges, including the complexity of quantum algorithms and the limitations of current quantum hardware. These challenges underscore the necessity for focused research and collaboration in this interdisciplinary area. This Special Issue of IEEE Computer Graphics and applications, "Quantum Visual Computing," aims at drawing attention to these challenges and bringing together pioneering research at the intersection of quantum and visual computing. By fostering dialogue and innovation between these fields, we hope to inspire new solutions and advance the state of the art in both domains.

关键词： machine Learning image processing Computer vision Computer Science Quantum Computing Interactive Tool Computer Graphics Computer applications Advantages Of Speed Use Of Registers Quantum Algorithms Interdisciplinary Area Current Hardware Quantum applications Quantum machine image Segmentation Software Development Articles In Issue Quantum State Game Development Quantum Phenomena applications In Quantum Computing

来源：评论

学校读者我要写书评

暂无评论

machine Learning applications: From Computer vision to Robotics

引用

2024年

作者： Chatterjee Indranath

ISBN: (数字)9781394173341;9781394173334

ISBN: (纸本)9781394173327

machine Learning applications Practical resource on the importance of machine Learning and Deep Learning applications in various technologies and real-world situations machine Learning applications discusses methodological advancements of machine learning and deep learning, presents applications in image processing, including face and vehicle detection, image classification, object detection, image segmentation, and delivers real-world applications in healthcare to identify diseases and diagnosis, such as creating smart health records and medical imaging diagnosis, and provides real-world examples, case studies, use cases, and techniques to enable the reader’s active learning. Composed of 13 chapters, this book also introduces real-world applications of machine and deep learning in blockchain technology, cyber security, and climate change. An explanation of AI and robotic applications in mechanical design is also discussed, including robot-assisted surgeries, security, and space exploration. The book describes the importance of each subject area and detail why they are so important to us from a societal and human perspective. Edited by two highly qualified academics and contributed to by established thought leaders in their respective fields, machine Learning applications includes information on:

Content based medical image retrieval (CBMIR), covering face and vehicle detection, multi-resolution and multisource analysis, manifold and image processing, and morphological processing
Smart medicine, including machine learning and artificial intelligence in medicine, risk identification, tailored interventions, and association rules
AI and robotics application for transportation and infrastructure (e.g., autonomous cars and smart cities), along with global warming and climate change
Identifying diseases and diagnosis, drug discovery and manufacturing, medical imaging diagnosis, personalized medicine, and smart health records

关键词：

来源：评论

学校读者我要写书评

暂无评论

vision based leather defect detection: a survey

引用

MULTIMEDIA TOOLS AND applications 2023年第1期82卷 989-1015页

作者： Jawahar, Malathy Anbarasi, L. Jani Geetha, S. CSIR Cent Leather Res Inst Leather Proc Technol Div Chennai 600020 Tamil Nadu India Vellore Inst Technol Sch Comp Sci & Engn Chennai 600127 Tamil Nadu India

Increasing consumer quality awareness and increase in consumer wealth drives the market demand for high quality leather and leather products. Reliable and effective detection and classification of leather surface defects is of profound significance to tanneries and industries where leather is a major raw material for leather accessories and leather parts manufacturers. This paper presents a methodical and a detailed review of the leather surface defects detection methods starting from leather image acquisition, leather image processing, feature extraction and classification for defect detection. Firstly, we introduce the fundamentals of leather image acquisition and various related image processing methods, feature extraction and classification for the defect inspection. Next, the existing datasets and summary of the recent methodologies used in this field are discussed. Finally, the challenges and suggested improvements to further the development of the application of advanced machine Learning and Deep Learning in this field are discussed. Deep learning algorithms are shown to have a great potential for leather surface defect detection and can help prepare a robust system that would greatly guarantee quality leather and provide monetary wealth from such leather products. Finally, research guidelines are presented to fellow researchers regarding data augmentation, leather defect detection models which need to be investigated in the future to make progress in this crucial area of research.

关键词： Leather defect detection vision based approach image capture Stages in leather processing Leather quality classification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：