检索结果-内蒙古大学图书馆

作者： Getty, Neil Illinois Institute of Technology

学位级别：Ph.D., Doctor of Philosophy

While an extremely rich research field, compared to other applications of AI such as natural language processing (NLP) and image processing/generation, AI in medicine has been much slower to be applied in real-world clinical settings. Often the stakes of failure are more dire, the access of private and proprietary data more costly, and the burden of proof required by expert clinicians is much higher. Beyond these barriers, the often typical data-driven approach towards validation is interrupted by a need for expertise to analyze results. Whereas the results of a trained imagenet or machine translation model are easily verified by a computational researcher, analysis in medicine can be much more multi-disciplinary demanding. AI in medicine is motivated by a great demand for progress in health-care, but an even greater responsibility for high accuracy, model transparency, and expert validation. This thesis develops machine and deep learning techniques for medical image enhancement, patient outcome prognosis, and minimally invasive robotic surgery awareness and augmentation. Each of the works presented were undertaken in direct collaboration with medical domain experts, and the efforts could not have been completed without them. Pursuing medical image enhancement we worked with radiologists, neuroscientists and a neurosurgeon. In patient outcome prognosis we worked with clinical neuropsychologists and a cardiovascular surgeon. For robotic surgery we worked with surgical residents and a surgeon expert in minimally invasive surgery. Each of these collaborations guided priorities for problem and model design, analysis, and long-term objectives that ground this thesis as a concerted effort towards clinically actionable medical AI. The contributions of this thesis focus on three specific medical domains. (1) Deep learning for medical brain scans: developed processing pipelines and deep learning models for image annotation, registration, segmentation and diagnosis in both tr

关键词： Computer vision Deep learning machine learning Robotic surgery

来源：评论

学校读者我要写书评

暂无评论

Noncontact Clearance Measurement Research Based on machine vision 11th

Noncontact Clearance Measurement Research Based on Machine V...

引用

11th International Workshop of Advanced Manufacturing and Automation, IWAMA 2021

作者： Che, Kai Lu, Dongli Guo, Jun Chen, Yufeng Peng, Guosheng Xu, Lianbing School of Electrical and Information Engineering Hubei University of Automotive Technology Hubei Shiyan442002 China Zhejiang Hong Cheng Computer Systems Co. Ltd. Hangzhou311100 China

ISBN: (纸本)9789811905711

Parts assembly clearance measurement is facing a trend towards high-precision and noncontact. This work aims to measure clearance by image processing based on machine vision. The machine vision system is to highlight the assembly clearance region. Hence, clearance regions are segmenting, and rotating to vertical, then get the geometric center of the region and the inclination relative to the horizontal direction, the two points intersecting the boundary of the region can be obtained through the linear relationship, the clearance width is the pixel distance between two points mapped to the actual width in the world coordinate system. Results of the measurement results show that the system works effectively and meets the requirements, which makes it suitable for industrial applications. © 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

image Information Assistance Neural Network for VideoPose3D-based Monocular 3D Pose Estimation 17

Image Information Assistance Neural Network for VideoPose3D-...

引用

17th International Conference on machine vision applications (MVA)

作者： Wang, Hao Luo, Dingli Ikenaga, Takeshi Waseda Univ Grad Sch Informat Prod & Syst Kitakyushu Fukuoka 8080135 Japan

ISBN: (纸本)9784901122207

3D pose estimation based on a monocular camera can be applied to various fields such as human-computer interaction and human action recognition. As a two-stage 3D pose estimator, VideoPose3D achieves state-of-the-art accuracy. However, because of the limitation of two-stage processing, image information is partially lost in the process of mapping 2D poses to 3D space, which results in limited final accuracy. This paper proposes an image-assisting pose estimation model and a back-projection based offset generating module. The image-assisting pose estimation model consists of a 2D pose processing branch and an image processing branch. image information is processed to generate an offset to refine the intermediate 3D pose produced by the 2D pose processing network. The back-projection based offset generating module projects the intermediate 3D poses to 2D space and calculates the error between the projection and input 2D pose. With the error combining with extracted image feature, the neural network generates an offset to decrease the error. By evaluation, the accuracy on each action of Human3.6M dataset gets an average improvement of 0.9 mm over the VideoPose3D baseline.

关键词： Human computer interaction Three-dimensional displays machine vision Pose estimation Neural networks Feature extraction Cameras

来源：评论

学校读者我要写书评

暂无评论

Novel Sensing Approaches for Structural Deformation Monitoring and 3D Measurements

引用

IEEE SENSORS JOURNAL 2021年第10期21卷 11318-11328页

作者： Castro-Toscano, Moises J. Rodriguez-Quinonez, Julio C. Sergiyenko, Oleg Flores-Fuentes, Wendy Ramirez-Hernandez, Luis Roberto Hernandez-Balbuena, Daniel Lindner, Lars Rascon, Raul Univ Autonoma Baja California Fac Ingn Mexicali 21280 Baja California Mexico Univ Autonoma Baja California Inst Ingn Mexicali 21280 Baja California Mexico

Nowadays, laser vision systems have allowed the development of different applications such as reverse engineering, manufacturing, navigation systems and, structural health monitoring (SHM). However, most of the machine vision systems for structural behavior analysis have restricted field of view, consume high levels of computational resources for image processing and require special illumination conditions to achieve lower error rates. Therefore, the purpose of this paper is to present a technical vision system (TVS) for structural behavior analysis using dynamic laser triangulation and k-Nearest Neighbor (k-NN) machine learning regression algorithm. The proposed vision system was tested in order to demonstrate the practicality of it, different deformations and displacements were analyzed over real structures in controlled laboratory conditions to assure the reproducibility of the experimentation. The TVS prototype proved to be a reliable option on SHM tasks, presenting balance between precision and operating ranges, without the issues aforementioned.

关键词： Dynamic triangulation vision sensors laser scanner vision systems machine learning SHM

来源：评论

学校读者我要写书评

暂无评论

Artificial Intelligence applications in Intrusion Detection Systems for Unmanned Aerial Vehicles

Artificial Intelligence Applications in Intrusion Detection ...

引用

作者： Hamadi, Raby King Abdullah University of Science and Technology

学位级别：硕士

This master thesis focuses on the cutting-edge application of AI in developing intrusion detection systems (IDS) for unmanned aerial vehicles (UAVs) in smart cities. The objective is to address the escalating problem of UAV intrusions, which pose a significant risk to the safety and security of citizens and critical infrastructure. The thesis explores the current state of the art and provides a comprehensive understanding of recent advancements in the field, encompassing both physical and network attacks. The literature review examines various techniques and approaches employed in the development of AI-based IDS. This includes the utilization of machine learning algorithms, computer vision technologies, and edge computing. A proposed solution leveraging computer vision technologies is presented to detect and identify intruding UAVs in the sky effectively. The system employs machine learning algorithms to analyze video feeds from city-installed cameras, enabling real-time identification of potential intrusions. The proposed approach encompasses the detection of unauthorized drones, dangerous UAVs, and UAVs carrying suspicious payloads. Moreover, the thesis introduces a Cycle GAN network for image denoising that can translate noisy images to clean images without the need for paired training data. This approach employs two generators and two discriminators, incorporating a cycle consistency loss that ensures the generated images align with their corresponding input images. Furthermore, a distributed architecture is proposed for processing collected images using an edge-offloading approach within the UAV network. This architecture allows flying and ground cameras to leverage the computational capabilities of their IoT peers to process captured images. A hybrid neural network is developed to predict, based on input tasks, the potential edge computers capable of real-time processing. The edge-offloading approach reduces the computational burden on the centralized system a

关键词：

来源：评论

学校读者我要写书评

暂无评论

Rate-Distortion in image Coding for machines

Rate-Distortion in Image Coding for Machines

引用

Picture Coding Symposium (PCS)

作者： Harell, Alon De Andrade, Anderson Bajic, Ivan, V Simon Fraser Univ Sch Engn Sci Burnaby BC Canada

ISBN: (纸本)9781665492577

In recent years, there has been a sharp increase in transmission of images to remote servers specifically for the purpose of computer vision. In many applications, such as surveillance, images are mostly transmitted for automated analysis, and rarely seen by humans. Using traditional compression for this scenario has been shown to be inefficient in terms of bit-rate, likely due to the focus on human based distortion metrics. Thus, it is important to create specific image coding methods for joint use by humans and machines. One way to create the machine side of such a codec is to perform feature matching of some intermediate layer in a Deep Neural Network performing the machine task. In this work, we explore the effects of the layer choice used in training a learnable codec for humans and machines. We prove, using the data processing inequality, that matching features from deeper layers is preferable in the sense of rate-distortion. Next, we confirm our findings empirically by re-training an existing model for scalable human-machine coding. In our experiments we show the trade-off between the human and machine sides of such a scalable model, and discuss the benefit of using deeper layers for training in that regard.

关键词： image coding Deep neural networks Collaborative intelligence Object detection

来源：评论

学校读者我要写书评

暂无评论

Deep Ensemble Learning with Frame Skipping for Face Anti-Spoofing 12

Deep Ensemble Learning with Frame Skipping for Face Anti-Spo...

引用

12th International Conference on image processing Theory, Tools and applications, IPTA 2023

作者： Muhammad, Usman Hoque, Md Ziaul Oussalah, Mourad Laaksonen, Jorma University of Oulu Center for Machine Vision and Signal Analysis Finland Aalto University Department of Computer Science Finland

ISBN: (纸本)9798350325416

Face presentation attacks, also known as spoofing attacks, pose a substantial threat to biometric systems that rely on facial recognition systems, such as access control systems, mobile payments, and identity verification systems. To mitigate the spoofing risk, several video-based methods have been presented in the literature that analyze facial motion in successive video frames. However, estimating the motion between adjacent frames is a challenging task and requires high computational cost. In this paper, we rephrase the face anti-spoofing task as a motion prediction problem and introduce a deep ensemble learning model with a frame skipping mechanism. In particular, the proposed frame skipping adopts a uniform sampling approach by dividing the original video into video clips of fixed size. By doing so, every nth frame of the clip is selected to ensure that the temporal patterns can easily be perceived during the training of three different recurrent neural networks (RNNs). Motivated by the performance of individual RNNs, a meta-model is developed to improve the overall detection performance by combining the prediction of individual RNNs. Extensive experiments were performed on four datasets, and state-of-the-art performance is reported on MSU-MFSD (3.12%), Replay-Attack (11.19%), and OULU-NPU (12.23%) databases by using half total error rates (HTERs) in the most challenging cross-dataset testing scenario. © 2023 IEEE.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

Real Time Object Detection and Clasification Using Small and Similar Figures in image processing 3

Real Time Object Detection and Clasification Using Small and...

引用

3rd IEEE Asian Conference on Innovation in Technology, ASIANCON 2023

作者： Thakur, Kanika Banerjee, Pallab Naaz, Farheen Banerjee, Probal Amity School of Engg. and Technology Amity University Jharkhand Ranchi India Govt. Polytechnic Jharkhand Ranchi India

ISBN: (纸本)9798350302288

In the present time, there has been many adaptations of Object Detection is developed. Object Detection means catching the object name and it's other characteristics in an image or a video. This field is known to be the most difficult technique in computer background. In this very paper, a new and simplest way of detection images and objects is shown using the machine Learning approach. Two methodologies is adapted in this project. First is Open cv algorithm which is a library of machine learning software and open to all developers. It provide an infrastructure that is common for all computer applications which helps to accelerate machine learning commercial works and products. Second is Yolo algorithm short for You Only Look Once. This algorithm is used for the prediction of objects and images which defines the point location of that object in dimensional way. In this, the technique of object detection is applied to like figures which are small and similar to one another. © 2023 IEEE.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Development of a Low-Cost Software to Obtain Quantitative Parameters in the Open Field Test for Application in Neuroscience Research 27th

Development of a Low-Cost Software to Obtain Quantitative Pa...

引用

27th Brazilian Congress on Biomedical Engineering (CBEB)

作者： Costalat, T. R. M. Negrao, I. P. R. Gomes-Leal, W. Fed Univ Para Inst Technol Belem Para Brazil Fed Univ Para Inst Biol Sci Lab Expt Neuroprotect & Neuroregenerat Belem Para Brazil

ISBN: (纸本)9783030706012;9783030706005

This paper describes the development of a low-cost software, called Rat Steps, which allows the obtention of quantitative data (total distance traveled and average speed) as well as the graphic trajectory performed by an animal in the open field test. This behavioral test is widely used in neuroscience in order to visualize locomotor impairment following acute brain injury, including stroke, as well as the effect of experimental therapies for these neural disorders. The main tools used for the software development were digital image processing techniques, Python programming, OpenCV library and machine learning algorithms, including the Mean Shift method. The software was successfully developed with effective obtention of quantitative parameters from the Open Field Test, which allows several applications in neuroscience research.

关键词： Computer vision image processing machine learning Neuroscience Open field test

来源：评论

学校读者我要写书评

暂无评论

3D reconstruction of moving object by double sampling based on phase shifting profilometry 9

3D reconstruction of moving object by double sampling based ...

引用

9th Symposium on Novel Photoelectronic Detection Technology and applications

作者： Zhang, Qinghui Li, Hao Lu, Lei Pan, Wei Su, Zhilong Zhang, Mengya Lv, Pengtao Key Laboratory of Grain Information Processing and Control of Ministry of Education Henan University of Technology Ministry of Education Zhengzhou450001 China College of Information Science and Engineering Henan University of Technology Zhengzhou450001 China Department of R & D OPT Machine Vision Tech Co. Ltd Dongguan523860 China Shanghai Institute of Applied Mathematics and Mechanics School of Mechanics and Engineering Science Shanghai University Shanghai200444 China

ISBN: (纸本)9781510664432

When using traditional phase-shift profilometry for 3D measurement, it is necessary to keep the measured object static during the shooting process. When the measured object is moving, errors will occur if the projection and capture of the fringe image is not fast enough. This paper proposes a new method to reconstruct the moving object by double sampling. A trigger control device is applied to the camera and projector, which ensures that after each projection, two consecutive images are captured before the next projection. Then, the phase information is retrieved by analyzing the relationship between the motion and fringe patterns. Finally, the moving object is retrieved successfully. The proposed method increased the frame rate of the moving object reconstruction. © 2023 SPIE.

关键词： Profilometry

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：