检索结果-内蒙古大学图书馆

Atomvision: A machine vision Library for Atomistic images

JOURNAL OF CHEMICAL INFORMATION AND MODELING 2023年第6期63卷 1708-1722页

作者： Choudhary, Kamal Gurunathan, Ramya DeCost, Brian Biacchi, Adam NIST Mat Measurement Lab Gaithersburg MD 20899 USA NIST Phys Measurement Lab Gaithersburg MD 20899 USA

Computer vision techniques have immense potential for materials design applications. In this work, we introduce an integrated and general-purpose Atomvision library that can be used to generate and curate microscopy image (such as scanning tunneling microscopy and scanning transmission electron microscopy) data sets and apply a variety of machine learning techniques. To demonstrate the applicability of this library, we (1) establish an atomistic image data set of about 10 000 materials with large structural and chemical diversity, (2) develop and compare convolutional and atomistic line graph neural network models to classify the Bravais lattices, (3) demonstrate the application of fully convolutional neural networks using U-Net architecture to pixelwise classify atom versus background, (4) use a generative adversarial network for super resolution, (5) curate an image data set on the basis of natural language processing using an open-access arXiv data set, and (6) integrate the computational framework with experimental microscopy images for Rh, Fe3O4, and SnS systems. The Atomvision library is available at https://***/ usnistgov/atomvision.

关键词： Scanning tunneling microscopy

来源：评论

学校读者我要写书评

暂无评论

A court line extraction algorithm for badminton tournament videos with horizontal line projection learning

引用

IET image processing 2023年第10期17卷 2907-2924页

作者： Wei, Chun-Ta Weng, Shiuh-Ku Natl Def Univ Chung Cheng Inst Technol Sch Def Sci Taoyuan Taiwan Chien Hsin Univ Sci & Technol Dept Elect Engn Taoyuan Taiwan Chien Hsin Univ Sci & Technol Dept Elect Engn Taoyuan 320678 Taiwan

Court line extraction is one of the important steps in the analysis of sport videos. The court extraction is the foundation of the analysis of badminton video, and an efficient method with horizontal line projection K-means machine learning algorithm to extract court lines from different broadcast badminton tournament videos is proposed in this paper. The horizontal lines are projected into 1-D histogram signal;then the signal is trained to learn the intensity of the histogram signal for locating the positions of the horizontal court lines. After the equations of the horizontal court lines and the court lines in the vertical direction have been formulized, the intersection points of the court lines can be calculated and the court line can be extracted. The experimental results show that the proposed method can extract the court lines more efficiently than that done by the Hough transform-related algorithms, which are widely applied in computer vision and self-driving car applications.

关键词： court lines horizontal line projections Hough transform k-means machine learning algorithm self-driving car applications

来源：评论

学校读者我要写书评

暂无评论

Enhanced Artificial vision for Visually Impaired Using Visual Implants

引用

IEEE ACCESS 2023年 11卷 80020-80029页

作者： Mohammadi, Hossein Mahvash Edrisi, Mohammad Hadi Savaria, Yvon Univ Isfahan Comp Engn Dept Esfahan *** Iran Polytech Montreal Elect Engn Dept Montreal PQ H3T 1J4 Canada

Argus ii is the most advanced retina implants approved by the US FDA and almost 350 visually impaired people are using it. This implant uses 60 microelectrodes implanted in the retina. The goal of this implant is to improve mobility and quality of life of its users. However, users' satisfaction is not very high due to the very low resolution of the phosphene images and features created by this device. This article proposes a system to improve the artificial vision created by visual implants. The proposed method extracts information about the people around the visually impaired person by using image processing and machine vision algorithms. This information includes the number of the people in the scene, whether they are known or unknown, their gender, estimated ages, facial emotions, and approximate distance from the visually impaired person. This information is extracted from the frames received by a camera mounted on the glasses of the user to generate signals that are fed into a visual stimulator. This information is shown to the user by a schematic vision created by some pre-trained patterns of phosphenes reflecting the information communicated to the user. The proposed system is validated with a simulated prosthetic vision comprising 150 microelectrodes that is compatible with the retina and visual cortex implants. A low-cost and energy efficient implementation of the proposed method executing on a Raspberry Pi 4 B at a frame rate of 4.5 frames/second shows the feasibility of using it in portable systems.

关键词： Argus ii artificial vision for visually impaired people simulated prosthetic vision scene understanding visual prosthesis visual implants retina implant visual cortex implant

来源：评论

学校读者我要写书评

暂无评论

SpikeSen: Low-Latency In-Sensor-Intelligence Design With Neuromorphic Spiking Neurons

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS ii-EXPRESS BRIEFS 2023年第6期70卷 1876-1880页

作者： Li, Ziru Zheng, Qilin Chen, Yiran Li, Hai Duke Univ Dept Elect & Comp Engn Durham NC 27707 USA

In-sensor-processing (ISP) paradigm has been exploited in state-of-the-art vision system designs to pave the way towards power-efficient sensing and processing. The redundant data transmission between sensors and processors is significantly minimized by local computation within each pixel. However, existing ISP designs suffer from limited frame rates and degraded fill factors. In this brief, we introduce a low-latency in-sensor-intelligence neuromorphic vision system using neuromorphic spiking neurons, namely SpikeSen. SpikeSen directly operates on the photocurrents and executes the computation in the frequency domain, reducing the long exposure time and speeding up the computation. Experiments show that SpikeSen can achieve more than 6.1x computation speedup compared to existing ISP designs with competitive energy consumption per pixel.

关键词： Neurons Photoconductivity Program processors image sensors Sensors machine vision Low latency communication In-sensor-processing neuromorphic computing low latency frequency-domain computation CMOS

来源：评论

学校读者我要写书评

暂无评论

3D Information in Robot vision System Based on Artificial Neural Network 14th

3D Information in Robot Vision System Based on Artificial Ne...

引用

14th International Conference on Frontier Computing, FC 2024

作者： Liu, Xiaoxiao Department of Mechanical and Electrical Engineering Jinan Engineering Vocational Technical College Shandong Jinan China

ISBN: (纸本)9789819627974

In the exploration of robot vision systems based on artificial neural networks, the research mainly focuses on their applications in 3D information recognition and processing. By simulating the processing of the human visual system, this technology enables robots to more effectively interpret and understand the three-dimensional spatial information of their environment, which has a revolutionary role in robot navigation, object recognition, obstacle avoidance and the execution of complex tasks. And this technology shows great potential in many fields such as industrial automation, autonomous vehicles, drone monitoring, and service robots. Not only that, it also plays a very important role in providing in-depth information, accurate positioning and efficient decision-making applications. According to the experimental data, among the 800 experimental subjects, more than 754 people were satisfied with the recognition accuracy, recognition speed, machine efficiency improvement, image recognition clarity, and overall satisfaction with the system. These findings indicate that with the continuous advancement of technology, 3D vision systems based on artificial neural networks will show more significant performance and value in future applications. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： machine vision

来源：评论

学校读者我要写书评

暂无评论

Semantic Document Layout Analysis of Handwritten Manuscripts

引用

Computers, Materials & Continua 2023年第5期75卷 2805-2831页

作者： Emad Sami Jaha Department of Computer Science Faculty of Computing and Information TechnologyKing Abdulaziz UniversityJeddah21589Saudi Arabia

A document layout can be more informative than merely a document’s visual and structural ***,document layout analysis(DLA)is considered a necessary prerequisite for advanced processing and detailed document image analysis to be further used in several applications and different *** research extends the traditional approaches of DLA and introduces the concept of semantic document layout analysis(SDLA)by proposing a novel framework for semantic layout analysis and characterization of handwritten *** proposed SDLA approach enables the derivation of implicit information and semantic characteristics,which can be effectively utilized in dozens of practical applications for various purposes,in a way bridging the semantic gap and providingmore understandable high-level document image analysis and more invariant characterization via absolute and relative *** approach is validated and evaluated on a large dataset ofArabic handwrittenmanuscripts comprising complex *** experimental work shows promising results in terms of accurate and effective semantic characteristic-based clustering and retrieval of handwritten *** also indicates the expected efficacy of using the capabilities of the proposed approach in automating and facilitating many functional,reallife tasks such as effort estimation and pricing of transcription or typing of such complex manuscripts.

关键词： Semantic characteristics semantic labeling document layout analysis semantic document layout analysis handwritten manuscripts clustering retrieval image processing computer vision machine learning

来源：评论

学校读者我要写书评

暂无评论

Systematic Review of Retinal Blood Vessels Segmentation Based on AI-driven Technique

引用

JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024年第4期37卷 1783-1799页

作者： Verma, Prem Kumari Kaur, Jagdeep Dr B R Ambedkar Natl Inst Technol Dept Comp Sci & Engn Jalandhar 144008 Punjab India

image segmentation is a crucial task in computer vision and image processing, with numerous segmentation algorithms being found in the literature. It has important applications in scene understanding, medical image analysis, robotic perception, video surveillance, augmented reality, image compression, among others. In light of this, the widespread popularity of deep learning (DL) and machine learning has inspired the creation of fresh methods for segmenting images using DL and ML models respectively. We offer a thorough analysis of this recent literature, encompassing the range of ground-breaking initiatives in semantic and instance segmentation, including convolutional pixel-labeling networks, encoder-decoder architectures, multi-scale and pyramid-based methods, recurrent networks, visual attention models, and generative models in adversarial settings. We study the connections, benefits, and importance of various DL- and ML-based segmentation models;look at the most popular datasets;and evaluate results in this Literature.

关键词： Retinal image segmentation machine learning Deep learning

来源：评论

学校读者我要写书评

暂无评论

A Review on machine Learning Styles in Computer vision-Techniques and Future Directions

引用

IEEE ACCESS 2022年 10卷 107293-107329页

作者： Mahadevkar, Supriya, V Khemani, Bharti Patil, Shruti Kotecha, Ketan Vora, Deepali R. Abraham, Ajith Gabralla, Lubna Abdelkareim Symbiosis Int Deemed Univ Symbiosis Inst Technol Pune 412115 Maharashtra India Symbiosis Int Deemed Univ Symbiosis Ctr Appl Artificial Intelligence Symbiosis Inst Technol Pune 412115 Maharashtra India Machine Intelligence Res Labs MIR Labs Auburn WA 98071 USA Princess Nourah Bint Abdulrahman Univ Coll Appl Dept Comp Sci & Informat Technol Riyadh 11671 Saudi Arabia

Computer applications have considerably shifted from single data processing to machine learning in recent years due to the accessibility and availability of massive volumes of data obtained through the internet and various sources. machine learning is automating human assistance by training an algorithm on relevant data. Supervised, Unsupervised, and Reinforcement Learning are the three fundamental categories of machine learning techniques. In this paper, we have discussed the different learning styles used in the field of Computer vision, Deep Learning, Neural networks, and machine learning. Some of the most recent applications of machine learning in computer vision include object identification, object classification, and extracting usable information from images, graphic documents, and videos. Some machine learning techniques frequently include zero-shot learning, active learning, contrastive learning, self-supervised learning, life-long learning, semi-supervised learning, ensemble learning, sequential learning, and multi-view learning used in computer vision until now. There is a lack of systematic reviews about all learning styles. This paper presents literature analysis of how different machine learning styles evolved in the field of Artificial Intelligence (AI) for computer vision. This research examines and evaluates machine learning applications in computer vision and future forecasting. This paper will be helpful for researchers working with learning styles as it gives a deep insight into future directions.

关键词： machine learning Computer vision Object detection Artificial intelligence machine learning algorithms image segmentation Feature extraction machine learning techniques computer vision supervised learning multi-task learning object detection artificial intelligence image categorization zero-shot learning

来源：评论

学校读者我要写书评

暂无评论

On a Variational Problem with a Nonstandard Growth Functional and Its applications to image processing

引用

JOURNAL OF MATHEMATICAL IMAGING AND vision 2023年第3期65卷 472-491页

作者： D'Apice, Ciro Kogut, Peter, I Kupenko, Olha P. Manzo, Rosanna Univ Salerno Dipartimento Sci Aziendali Management & Innovat S Via Giovanni Paolo II 132 I-84084 Salerno Italy Oles Honchar Dnipro Natl Univ Dept Differential Equat Gagarin Av 72 UA-49005 Dnipro Ukraine EOS Data Analyt Ukraine Gagarin Av 103a UA-49005 Dnipro Ukraine Dnipro Univ Technol Dept Syst Anal & Control Yavornitskii Av 19 UA-49005 Dnipro Ukraine Natl Acad Sci Inst Appl Syst Anal Peremogy Av 37-35 UA-03056 Kviv Ukraine Minist Educ & Sci Ukraine Peremogy Av 37-35 UA-03056 Kviv Ukraine Univ Salerno Dept Informat Engn Elect Engn & Appl Math Via Giovanni Paolo II 132 I-84084 Salerno Italy

We propose a new variational model in Sobolev-Orlicz spaces with non-standard growth conditions of the objective functional and discuss its applications to image processing. The characteristic feature of the proposed model is that the variable exponent, which is associated with non-standard growth, is unknown a priori and it depends on a particular function that belongs to the domain of objective functional. So, we deal with a constrained minimization problem that lives in variable Sobolev-Orlicz spaces. In view of this, we discuss the consistency of the proposed model, give the scheme for its regularization, derive the corresponding optimality system, and propose an iterative algorithm for practical implementations.

关键词： Inverse problem Nonconvex programming image reconstruction Constrained minimization problems Approximation methods Sobolev-Orlicz space

来源：评论

学校读者我要写书评

暂无评论

machine vision inspection systems /

引用

2020年

作者： edited by Muthukumaran Malarvel Soumya Ranjan Nayak Sury Narayan Panda Prasant Kumar Pattnaik...

来源：内蒙古大学图书馆图书评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：