检索结果-内蒙古大学图书馆

IEEE Conference on Advanced Video and Signal Based Surveillance (AVSS)

作者： François-Guillaume Landry Moulay A. Akhloufi Department of Computer Science Perception Robotics and Intelligent Machines Research Group (PRIME) Université de Moncton Moncton NB Canada

ISBN: (数字)9781665463829

ISBN: (纸本)9781665463836

Road surface monitoring in winter conditions is of great importance to ensure the safety of road users. Estimation of snow coverage on roads can be included in intelligent transportation systems to alert drivers or improve snow removal processes. Several models have been proposed for estimating snow coverage using surveillance cameras, but these models have focused on predicting few snow levels, which limits their usefulness in practice. In this paper, we present a model that allows a more granular estimation of the percentage of road surface covered by snow by predicting snow coverage from 0% (no snow) to 100% (fully snow-covered) using increments of 10%. We propose an ensemble learning model combining a deep convolutional neural network (CNN) and a support-vector machine (SVM). The accuracy of our model is similar to the state-of-the-art accuracy despite the higher task complexity associated with the increased granularity of predictions.

关键词： Support vector machines Image segmentation Snow Computational modeling Roads Surveillance Predictive models

来源：评论

学校读者我要写书评

暂无评论

Distributed Monitoring and Control System of Gas Concentrations in Different Environment

Distributed Monitoring and Control System of Gas Concentrati...

引用

International Conference on Control Systems and computer science (CSCS)

作者： Iulian Furdu Eusebiu Pruteanu Costică Nițu Department of Mathematics and Informatics University Vasile Alecsandri of Bacau Bacau Romania Department of Energy and Computer Science University Vasile Alecsandri of Bacau Bacau Romania Department of Intelligent Robotics) Polytechnic University) Bucharest Romania

Wireless Sensor Systems (WSN) is a broad, exciting area with new perspectives and growing growth over the past decades, where more research is being done. WSNs contain many (hundreds of thousands) of micro-sized, cheap chips, powered by low-cost wireless interconnected batteries. These chips are called nodes, which could be of several types, including acoustic, radar, low-frequency magnetic, thermal, and visual sampling frequencies. Many applications are based nowadays on WSN, such as environmental control, smart cities, wildlife monitoring, vehicles and infrastructure, natural disasters, home security, underwater investigations, military, airplane surveillance and drone body sensors. One of the aims of the paper is to develop a smart architecture based on intelligent microprocessor sensors, WSN802GCA/GPA and the software needed to operate with a TINI control unit. TINI allows this system to be connected to Ethernet via TCP/I$\mathbf{P}$

关键词：

来源：评论

学校读者我要写书评

暂无评论

Data Engineering and AI-Powered Skin Cancer Identification for Healthcare Applications

引用

Procedia computer science 2024年 246卷 179-188页

作者： Imran Ahmed Misbah Ahmad Abdellah Chehri Gwanggil Jeon School of Computing and Information Science Anglia Ruskin University Cambridge UK Animal and Agriculture Department Hartpury University Gloucester UK Centre for Machine Vision Bristol Robotics Laboratory University of the West of England Bristol UK Department of Mathematics and Computer Science at the Royal Military College of Canada (RMC) Canada Department of Embedded Systems Engineering Incheon National University Incheon Korea

Skin cancer diagnosis, a critical task in the medical domain, can be revolutionized through the application of advanced deep-learning techniques. This work investigates the efficacy of Convolutional Neural Networks (CNNs) in the automated classification of skin cancer. The process begins with a comprehensive explanation of key CNN layers: Conv2D, MaxPool2D, Dropout, and Dense. The Conv2D layers employ learnable filters that transform localized image segments, while MaxPool2D contributes to downsampling, effectively reducing computational cost and overfitting risk. Integrating these layers enables the network to capture local and global characteristics, which is crucial for accurate classification. Adding Dropout layers enhances generalization and mitigates overfitting by introducing randomness during training. ReLU activation functions infuse non-linearity, and the Flatten layer facilitates the transition to fully connected layers. The proposed CNN architecture is meticulously designed considering filter counts, kernel sizes, and pooling dimensions. The trained model demonstrates promising performance by utilizing the HAM10000 dataset, encompassing diverse skin lesion images across seven classes. The CNN model’s parameters and architecture are systematically presented, offering insights into its design rationale. The model undergoes optimization with the Adam optimizer and annealing techniques to facilitate convergence. The model’s effectiveness is evaluated on validation and test datasets, demonstrating an accuracy of 78.55% and 76.49%, respectively, for skin cancer classification. Data augmentation strategies are introduced to enhance model generalization further. The results underscore CNN’s potential as a robust tool for automating skin cancer diagnosis, aligning with the broader trend of leveraging deep learning for medical image analysis

关键词： Image Processing Medical Imaging Deep learning Skin Cancer Data Analytics

来源：评论

学校读者我要写书评

暂无评论

P1AC: Revisiting Absolute Pose From a Single Affine Correspondence

P1AC: Revisiting Absolute Pose From a Single Affine Correspo...

引用

International Conference on computer vision (ICCV)

作者： Jonathan Ventura Zuzana Kukelova Torsten Sattler Dániel Baráth Department of Computer Science & Software Engineering Cal Poly San Luis Obispo Visual Recognition Group Faculty of Electrical Engineering Czech Technical University in Prague Czech Institute of Informatics Robotics and Cybernetics Czech Technical University in Prague Computer Vision and Geometry Group ETH Zürich

Affine correspondences have traditionally been used to improve feature matching over wide baselines. While recent work has successfully used affine correspondences to solve various relative camera pose estimation problems, less attention has been given to their use in absolute pose estimation. We introduce the first general solution to the problem of estimating the pose of a calibrated camera given a single observation of an oriented point and an affine correspondence. The advantage of our approach (P1AC) is that it requires only a single correspondence, in comparison to the traditional point-based approach (P3P), significantly reducing the combinatorics in robust estimation. P1AC provides a general solution that removes restrictive assumptions made in prior work and is applicable to large-scale image-based localization. We propose a minimal solution to the P1AC problem and evaluate our novel solver on synthetic data, showing its numerical stability and performance under various types of noise. On standard image-based localization benchmarks we show that P1AC achieves more accurate results than the widely used P3P algorithm. Code for our method is available at https://***/jonathanventura/P1AC/.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Augmented Box Replay: Overcoming Foreground Shift for Incremental Object Detection

Augmented Box Replay: Overcoming Foreground Shift for Increm...

引用

International Conference on computer vision (ICCV)

作者： Yuyang Liu Yang Cong Dipam Goswami Xialei Liu Joost van de Weijer State Key Laboratory of Robotics Shenyang Institute of Automation Chinese Academy of Sciences Institutes for Robotics and Intelligent Manufacturing Chinese Academy of Sciences University of Chinese Academy of Sciences South China University of Technology Computer Vision Center Barcelona VCIP CS Nankai University Department of Computer Science Universitat Autònoma de Barcelona

In incremental learning, replaying stored samples from previous tasks together with current task samples is one of the most efficient approaches to address catastrophic forgetting. However, unlike incremental classification, image replay has not been successfully applied to incremental object detection (IOD). In this paper, we identify the overlooked problem of foreground shift as the main reason for this. Foreground shift only occurs when replaying images of previous tasks and refers to the fact that their background might contain foreground objects of the current task. To overcome this problem, a novel and efficient Augmented Box Replay (ABR) method is developed that only stores and replays foreground objects and thereby circumvents the foreground shift problem. In addition, we propose an innovative Attentive RoI Distillation loss that uses spatial attention from region-of-interest (RoI) features to constrain current model to focus on the most important information from old model. ABR significantly reduces forgetting of previous classes while maintaining high plasticity in current classes. Moreover, it considerably reduces the storage requirements when compared to standard image replay. Comprehensive experiments on Pascal-VOC and COCO datasets support the state-of-the-art performance of our model 1 .

关键词：

来源：评论

学校读者我要写书评

暂无评论

Entropy-Guided Reinforced Open World Active 3D Object Detection Learning

Entropy-Guided Reinforced Open World Active 3D Object Detect...

引用

Chinese Automation Congress (CAC)

作者： Haozhe Zhang Liyan Ma Shihui Ying Institute of Artificial Intelligence Shanghai University Shanghai China Shanghai Key Laboratory of Intelligent Manufacturing and Robotics School of Computer Engineering and Science School of Mechatronic Engineering and Automation Shanghai University Shanghai China Department of Mathematics School of Science Shanghai University Shanghai China

ISBN: (数字)9798350368604

ISBN: (纸本)9798350368611

Traditional fully annotated closed set 3D object detection methods improve model performance but are impractical in real-world settings due to the emergence of new categories and the complexity of 3D annotations. Open-World Object Detection (OWOD) addresses these issues but relies heavily on manual labeling, which is costly. This paper focuses on open world active learning and proposes an entropy-guided reinforced open world active 3D object detection (EROA). EROA regards active learning as a reinforcement learning problem tailored for open driving scenarios. We use entropy as a reward metric for efficient reinforcement learning. We also leverage knowledge from the 2D domain using object-level large-scale vision-language models to enhance sample selection. Extensive experiments evidence that the proposed EROA meets the dynamic and cost-sensitive requirements of autonomous driving, enabling real-time detection of both known and unknown objects.

关键词： Measurement Solid modeling Three-dimensional displays Active learning Object detection Reinforcement learning Entropy Real-time systems Labeling Autonomous vehicles

来源：评论

学校读者我要写书评

暂无评论

FeCAM: Exploiting the Heterogeneity of Class Distributions in Exemplar-Free Continual Learning

arXiv

引用

arXiv 2023年

作者： Goswami, Dipam Liu, Yuyang Twardowski, Bartlomiej van de Weijer, Joost Department of Computer Science Universitat Autònoma de Barcelona Spain Computer Vision Center Barcelona Spain University of Chinese Academy of Sciences China State Key Laboratory of Robotics Shenyang Institute of Automation Chinese Academy of Sciences China Institutes for Robotics and Intelligent Manufacturing Chinese Academy of Sciences China IDEAS-NCBR

Exemplar-free class-incremental learning (CIL) poses several challenges since it prohibits the rehearsal of data from previous tasks and thus suffers from catastrophic forgetting. Recent approaches to incrementally learning the classifier by freezing the feature extractor after the first task have gained much attention. In this paper, we explore prototypical networks for CIL, which generate new class prototypes using the frozen feature extractor and classify the features based on the Euclidean distance to the prototypes. In an analysis of the feature distributions of classes, we show that classification based on Euclidean metrics is successful for jointly trained features. However, when learning from non-stationary data, we observe that the Euclidean metric is suboptimal and that feature distributions are heterogeneous. To address this challenge, we revisit the anisotropic Mahalanobis distance for CIL. In addition, we empirically show that modeling the feature covariance relations is better than previous attempts at sampling features from normal distributions and training a linear classifier. Unlike existing methods, our approach generalizes to both many- and few-shot CIL settings, as well as to domain-incremental settings. Interestingly, without updating the backbone network, our method obtains state-of-the-art results on several standard continual learning benchmarks. Code is available at https://***/dipamgoswami/FeCAM. © 2023, CC BY.

关键词： Normal distribution

来源：评论

学校读者我要写书评

暂无评论

Fast-moving object counting with an event camera

arXiv

引用

arXiv 2022年

作者： Bialik, Kamil Kowalczyk, Marcin Blachut, Krzysztof Kryjak, Tomasz Embedded Vision Systems Group Computer Vision Laboratory Department of Automatic Control and Robotics AGH University of Science and Technology Al. Mickiewicza 30 Krakow30-059 Poland

This paper proposes the use of an event camera as a component of a vision system that enables counting of fast-moving objects – in this case, falling corn grains. These type of cameras transmit information about the change in brightness of individual pixels and are characterised by low latency, no motion blur, correct operation in different lighting conditions, as well as very low power consumption. The proposed counting algorithm processes events in real time. The operation of the solution was demonstrated on a stand consisting of a chute with a vibrating feeder, which allowed the number of grains falling to be adjusted. The objective of the control system with a PID controller was to maintain a constant average number of falling objects. The proposed solution was subjected to a series of tests to determine the correctness of the developed method operation. On their basis, the validity of using an event camera to count small, fast-moving objects and the associated wide range of potential industrial applications can be confirmed. © 2022, CC BY.

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

PointPillars Backbone Type Selection For Fast and Accurate LiDAR Object Detection

TechRxiv

引用

TechRxiv 2022年

作者： Lis, Konrad Kryjak, Tomasz Embedded Vision Systems Group Computer Vision Laboratory Department of Automatic Control and Robotics AGH University of Science and Technology Al. Mickiewicza 30 Krakow30-059 Poland

3D object detection from LiDAR sensor data is an important topic in the context of autonomous cars and drones. In this paper, we present the results of experiments on the impact of backbone selection of a deep convolutional neural network on detection accuracy and computation speed. We chose the PointPillars network, which is characterised by a simple architecture, high speed, and modularity that allows for easy expansion. During the experiments, we paid particular attention to the change in detection efficiency (measured by the mAP metric) and the total number of multiply-addition operations needed to process one point cloud. We tested 10 different convolutional neural network architectures that are widely used in image-based detection problems. For a backbone like MobilenetV1, we obtained an almost 4x speedup at the cost of a 1.13% decrease in mAP. On the other hand, for CSPDarknet we got an acceleration of more than 1.5x at an increase in mAP of 0.33%. We have thus demonstrated that it is possible to significantly speed up a 3D object detector in LiDAR point clouds with a small decrease in detection efficiency. This result can be used when PointPillars or similar algorithms are implemented in embedded systems, including SoC FPGAs. The code is available at https://***/vision-agh/pointpillars backbone. © 2022, CC BY.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Signal propagation in transformers: theoretical perspectives and the role of rank collapse 22

Signal propagation in transformers: theoretical perspectives...

引用

Proceedings of the 36th International Conference on Neural Information Processing Systems

作者： Lorenzo Noci Sotiris Anagnostidis Luca Biggio Antonio Orvieto Sidak Pal Singh Aurelien Lucchi Dept of Computer Science ETH Zürich Dept of Computer Science ETH Zürich and Robotics & ML CSEM SA Alpnach Switzerland Dept of Computer Science ETH Zürich and MPI for Intelligent Systems Tübingen Department of Mathematics and Computer Science University of Basel

ISBN: (纸本)9781713871088

Transformers have achieved remarkable success in several domains, ranging from natural language processing to computer vision. Nevertheless, it has been recently shown that stacking self-attention layers — the distinctive architectural component of Transformers — can result in rank collapse of the tokens' representations at initialization. The question of if and how rank collapse affects training is still largely unanswered, and its investigation is necessary for a more comprehensive understanding of this architecture. In this work, we shed new light on the causes and the effects of this phenomenon. First, we show that rank collapse of the tokens' representations hinders training by causing the gradients of the queries and keys to vanish at initialization. Furthermore, we provide a thorough description of the origin of rank collapse and discuss how to prevent it via an appropriate depth-dependent scaling of the residual branches. Finally, our analysis unveils that specific architectural hyperparameters affect the gradients of queries and values differently, leading to disproportionate gradient norms. This suggests an explanation for the widespread use of adaptive methods for Transformers' optimization.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：