检索结果-内蒙古大学图书馆

2025 IEEE/CVF Winter Conference on Applications of computer vision, WACV 2025

作者： Lenhard, Tamara R. Weinmann, Andreas Franke, Kai Koch, Tobias Germany University of Applied Sciences Darmstadt Working Group Algorithms for Computer Vision Imaging and Data Analysis Germany

ISBN: (纸本)9798331510831

Developing robust drone detection systems is often constrained by the limited availability of large-scale annotated training data and the high costs associated with real-world data collection. However, leveraging synthetic data generated via game engine-based simulations provides a promising and cost-effective solution to overcome this issue. Therefore, we present SynDronevision, a synthetic dataset specifically designed for RGB-based drone detection in surveillance applications. Featuring diverse backgrounds, lighting conditions, and drone models, SynDronevision offers a comprehensive training foundation for deep learning algorithms. To evaluate the dataset's effectiveness, we perform a comparative analysis across a selection of recent YOLO detection models. Our findings demonstrate that SynDronevision is a valuable resource for real-world data enrichment, achieving notable enhancements in model performance and robustness, while signifcantly reducing the time and costs of real-world data acquisition. SynDronevision can be accessed at https://***/records/13360116. © 2025 IEEE.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

Scaling vision-Based End-to-End Autonomous Driving with Multi-View Attention Learning

Scaling Vision-Based End-to-End Autonomous Driving with Mult...

引用

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

作者： Yi Xiao Felipe Codevilla Diego Porres Antonio M. López Department of Computer Science Computer Vision Center (CVC) Universitat Autònoma de Barcelona (UAB) Spain Montreal Institute for Learning Algorithms (MILA) Montreal Canada

On end-to-end driving, human driving demonstrations are used to train perception-based driving models by imitation learning. This process is supervised on vehicle signals (e.g., steering angle, acceleration) but does not require extra costly supervision (human labeling of sensor data). As a representative of such vision-based end-to-end driving models, CILRS is commonly used as a baseline to compare with new driving models. So far, some latest models achieve better performance than CILRS by using expensive sensor suites and/or by using large amounts of human-labeled data for training. Given the difference in performance, one may think that it is not worth pursuing vision-based pure end-to-end driving. However, we argue that this approach still has great value and potential considering cost and maintenance. In this paper, we present CIL++, which improves on CILRS by both processing higher-resolution images using a human-inspired HFOV as an inductive bias and incorporating a proper attention mechanism. CIL++ achieves competitive performance compared to models which are more costly to develop. We propose to replace CILRS with CIL++ as a strong vision-based pure end-to-end driving baseline supervised by only vehicle signals and trained by conditional imitation learning.

关键词：

来源：评论

学校读者我要写书评

暂无评论

SynDronevision: A Synthetic Dataset for Image-Based Drone Detection

arXiv

引用

arXiv 2024年

作者： Lenhard, Tamara R. Weinmann, Andreas Franke, Kai Koch, Tobias Germany Working Group Algorithms for Computer Vision Imaging and Data Analysis University of Applied Sciences Darmstadt Germany

Developing robust drone detection systems is often constrained by the limited availability of large-scale annotated training data and the high costs associated with real-world data collection. However, leveraging synthetic data generated via game engine-based simulations provides a promising and cost-effective solution to overcome this issue. Therefore, we present SynDronevision, a synthetic dataset specifically designed for RGB-based drone detection in surveillance applications. Featuring diverse backgrounds, lighting conditions, and drone models, SynDronevision offers a comprehensive training foundation for deep learning algorithms. To evaluate the dataset's effectiveness, we perform a comparative analysis across a selection of recent YOLO detection models. Our findings demonstrate that SynDronevision is a valuable resource for real-world data enrichment, achieving notable enhancements in model performance and robustness, while significantly reducing the time and costs of real-world data acquisition. SynDronevision will be publicly released upon paper acceptance. © 2024, CC BY.

关键词： Aircraft detection

来源：评论

学校读者我要写书评

暂无评论

YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection

arXiv

引用

arXiv 2024年

作者： Lenhard, Tamara R. Weinmann, Andreas Jäger, Stefan Koch, Tobias Sankt Augustin Germany Working Group Algorithms for Computer Vision Imaging and Data Analysis University of Applied Sciences Darmstadt Darmstadt Germany

Predominant methods for image-based drone detection frequently rely on employing generic object detection algorithms like YOLOv5. While proficient in identifying drones against homogeneous backgrounds, these algorithms often struggle in complex, highly textured environments. In such scenarios, drones seamlessly integrate into the background, creating camouflage effects that adversely affect the detection quality. To address this issue, we introduce a novel deep learning architecture called YOLO-FEDER FusionNet. Unlike conventional approaches, YOLO-FEDER FusionNet combines generic object detection methods with the specialized strength of camouflage object detection techniques to enhance drone detection capabilities. Comprehensive evaluations of YOLO-FEDER FusionNet show the efficiency of the proposed model and demonstrate substantial improvements in both reducing missed detections and false alarms. © 2024, CC BY-NC-SA.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Automated Bioacoustic Monitoring for South African Bird Species on Unlabeled Data 32

Automated Bioacoustic Monitoring for South African Bird Spec...

引用

32nd International Conference in Central Europe on computer Graphics, Visualization and computer vision, WSCG 2024

作者： Doell, Michael Kuehn, Dominik Suessle, Vanessa Burnett, Matthew J. Downs, Colleen T. Weinmann, Andreas Hergenroether, Elke Department of Computer Science University of Applied Sciences Darmstadt Schoefferstrasse 3 Darmstadt Germany Centre for Functional Biodiversity School of Life Sciences University of KwaZulu-Natal P/Bag X01 Scottsville Pietermaritzburg3209 South Africa Algorithms for Computer Vision Imaging and Data Analysis Group University of Applied Sciences Darmstadt Schoefferstrasse 3 Darmstadt Germany

Analyses for biodiversity monitoring based on passive acoustic monitoring (PAM) recordings is time-consuming and challenged by the presence of background noise in recordings. Existing models for sound event detection (SED) worked only on certain avian species and the development of further models required labeled data. The developed framework automatically extracted labeled data from available platforms for selected avian species. The labeled data were embedded into recordings, including environmental sounds and noise, and were used to train convolutional recurrent neural network (CRNN) models. The models were evaluated on unprocessed real world data recorded in urban KwaZulu-Natal habitats. The Adapted SED-CRNN model reached a F1 score of 0.73, demonstrating its efficiency under noisy, real-world conditions. The proposed approach to automatically extract labeled data for chosen avian species enables an easy adaption of PAM to other species and habitats for future conservation projects. © 2024 University of West Bohemia. All rights reserved.

关键词： Biotic

来源：评论

学校读者我要写书评

暂无评论

Pose-graph via Adaptive Image Re-ordering 33

Pose-graph via Adaptive Image Re-ordering

引用

33rd British Machine vision Conference Proceedings, BMVC 2022

作者： Barath, Daniel Noskova, Jana Eichhardt, Ivan Matas, Jiri Computer Vision and Geometry Group ETH Zurich Switzerland Visual Recognition Group FEE CTU in Prague Czech Republic Algorithms and Applications Department Eotvos Lorand University Budapest Hungary

We introduce novel methods that speed up the pose-graph generation for global Structure-from-Motion algorithms. We replace the widely used "accept-or-reject" strategy for image pairs, where often thousands of RANSAC iterations are wasted on pairs with low inlier ratio or on non-matchable ones. The new algorithm exploits the fact that every unsuccessful RANSAC iteration reduces the probability of an image pair being matchable, i.e., it reduces its inlier ratio expectation. The method always selects the most promising pair for matching. While running RANSAC on the pair, it updates the distribution of its inlier ratio probability in a principled way via a Bayesian approach. Once the expected inlier ratio drops below an adaptive threshold, the method puts back the pair in the processing queue ordered by the updated inlier ratio expectations. The algorithms are tested on more than 600k real image pairs. They accelerate the pose-graph generation by an order-of-magnitude on average. The source code is available at https://***/danini/pose-graph-creation. © 2022. The copyright of this document resides with its authors.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

Multicenter aortic vessel tree extraction using deep learning

Multicenter aortic vessel tree extraction using deep learnin...

引用

Medical Imaging 2023: Biomedical Applications in Molecular, Structural, and Functional Imaging

作者： Scharinger, Bernhard Pepe, Antonio Jin, Yuan Gsaxner, Christina Li, Jianning Egger, Jan Graz University of Technology Institute for Computer Graphics and Vision Inffeldgasse 16c/II GrazA-8010 Austria Computer Algorithms for Medicine Laboratory Graz Austria Girardetstrasse 2 Essen45131 Germany Research Center for Connected Healthcare Big Data Zhejiang Lab Zhejiang Hangzhou311121 China

ISBN: (纸本)9781510660410

The aorta is the largest vessel of the human body and its pathological degenerations, such as dissections and aneurysms, can be life threatening. An automatic and fast segmentation of the aorta can therefore be a helpful tool to quickly identify an abnormal anatomy. The segmentation of the aortic vessel tree (AVT) typically requires extensive manual labor, but, in recent years, progress in deep learning techniques made the automation of this process viable. For this purpose, we tested different deep learning networks to segment the aortic vessel tree from computed tomography angiography (CTA) scans with a deep neural network consisting of an encoder-decoder architecture with skip connections and an optional self-attention block. The networks were trained on a dataset of 56 CTA scans from three different sources and resulted in Dice score similarities between 0.043-0.897. Generally, the classical U-Nets performed better than the ones containing a self-attention block, indicating that they might diminish performance for AVT segmentation. The quality of the resulting segmentations was highly dependent on the CTA image quality, especially on the contrast between the aorta and the surrounding tissues. However, the trained deep neural network can segment CTA scans well with limited computational resources and training data. © COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

SynDronevision: A Synthetic Dataset for Image-Based Drone Detection

SynDroneVision: A Synthetic Dataset for Image-Based Drone De...

引用

IEEE Workshop on Applications of computer vision (WACV)

作者： Tamara R. Lenhard Andreas Weinmann Kai Franke Tobias Koch Institute for the Protection of Terrestrial Infrastructures German Aerospace Center (DLR) Germany Working Group Algorithms for Computer Vision Imaging and Data Analysis University of Applied Sciences Darmstadt Germany

ISBN: (数字)9798331510831

ISBN: (纸本)9798331510848

关键词： YOLO Training Costs Surveillance Training data Lighting Data models Robustness Drones Synthetic data

来源：评论

学校读者我要写书评

暂无评论

YOLO-Feder Fusionnet: A Novel Deep Learning Architecture for Drone Detection

YOLO-Feder Fusionnet: A Novel Deep Learning Architecture for...

引用

IEEE International Conference on Image Processing

作者： Tamara R. Lenhard Andreas Weinmann Stefan Jäger Tobias Koch German Aerospace Center (DLR) Institute for the Protection of Terrestrial Infrastructures Sankt Augustin Germany Working Group Algorithms for Computer Vision Imaging and Data Analysis University of Applied Sciences Darmstadt Darmstadt Germany

ISBN: (数字)9798350349399

ISBN: (纸本)9798350349405

关键词： YOLO Image processing Deep architecture Detectors Streaming media Neck Complexity theory

来源：评论

学校读者我要写书评

暂无评论

Blessemflood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support

Blessemflood21: Advancing Flood Analysis with a High-Resolut...

引用

IEEE International Symposium on Geoscience and Remote Sensing (IGARSS)

作者： Vladyslav Polushko Alexander Jenal Jens Bongartz Immanuel Weber Damjan Hatic Ronald Rösch Thomas März Markus Rauhut Andreas Weinmann Image Processing Department Fraunhofer ITWM Kaiserslautern Germany Working Group Algorithms for Computer Vision Imaging and Data Analysis Darmstadt Germany Center for Machine Learning and Sensor Technology Hochschule Koblenz Remagen Germany

ISBN: (数字)9798350360325

ISBN: (纸本)9798350360332

Floods are an increasingly common global threat, causing emergencies and severe damage to infrastructure. During crises, organisations such as the World Food Programme use remotely sensed imagery, typically obtained through drones, for rapid situational analysis to plan life-saving actions. computer vision tools are needed to support task force experts on-site in the evaluation of the imagery to improve their efficiency and to allocate resources strategically. We introduce the BlessemFlood21 dataset to stimulate research on efficient flood detection tools. The imagery was acquired during the 2021 Erftstadt-Blessem flooding event and consists of high-resolution and georeferenced RGB-NIR images. In the resulting RGB dataset, the images are supplemented with detailed water masks, obtained via a semi-supervised human-in-the-loop technique, where in particular the NIR information is leveraged to classify pixels as either water or non-water. We evaluate our dataset by training and testing established Deep Learning models for semantic segmentation. With BlessemFlood21 we provide labeled high-resolution RGB data and a baseline for further development of algorithmic solutions tailored to flood detection in RGB imagery.

关键词： Deep learning Training Semantic segmentation Human in the loop Sensors Floods Task analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：