Ultra-Wide Band (UWB) is a widely used technology to provide real-time and accurate indoor localization to mobile robots, allowing their safe operation in the absence of a satellite-based navigation solution. However,...
详细信息
In this article, a system for automatic recognition of selected road users using artificial neural networks is proposed. Five models of neural networks were tested with various configurations. With the special prepare...
详细信息
ISBN:
(数字)9788362065424
ISBN:
(纸本)9788362065424
In this article, a system for automatic recognition of selected road users using artificial neural networks is proposed. Five models of neural networks were tested with various configurations. With the special prepared image database, training, validation and tests were conducted to identify five classes of road users: cyclists, people on electric scooters, roller skates, pedestrians and people using personal transport devices. A web application that recognize these classes of road users was also prepared.
This paper presents Agri Bot, a clever automated framework intended to help farmers in overseeing crop well being and water systems. The framework coordinates controller, plant infection discovery, soil dampness detec...
详细信息
Cross-domain few-shot learning (CDFSL) hyper-spectral image classification (HSIC) techniques have significantly enhanced the performance of HSIC under a few annotations. Due to the scattered spatial distribution of HS...
详细信息
Recent advances in vision transformers (ViTs) have achieved outstanding performance in visual recognition tasks, including image classification and detection. ViTs can learn global representations with their self-atte...
详细信息
ISBN:
(纸本)9781728198354
Recent advances in vision transformers (ViTs) have achieved outstanding performance in visual recognition tasks, including image classification and detection. ViTs can learn global representations with their self-attention mechanism, but they are usually heavy-weight and unsuitable for resource-constrained devices. In this paper, we propose a novel linear feature attention (LFA) module to reduce computation costs for vision transformers and combine efficient mobile CNN modules to form a parameter-efficient and high-performance CNN-ViT hybrid model, called LightFormer, which can serve as a general-purpose backbone to learn both global and local representation. Comprehensive experiments demonstrate that LightFormer achieves competitive performance across different visual recognition tasks. On the ImageNet-1K dataset, LightFormer achieves top-1 accuracy of 78.5% with 5.5 million parameters. Our model also performs well when transferred to object detection and semantic segmentation tasks. On the MS COCO dataset, LightFormer attains mAP of 33.2 within the YOLOv3 framework, and on the Cityscapes dataset, with only a simple all-MLP decoder, LightFormer achieves mIoU of 78.5 and FPS of 15.3, surpassing state-of-the-art lightweight segmentation networks.
Keyword spotting is necessary for triggering human-machine speech interaction. It is a challenging task especially in low signal-to-noise ratio and moving scenarios, such as on a sweeping robot with strong ego-noise. ...
详细信息
ISBN:
(纸本)9781665405409
Keyword spotting is necessary for triggering human-machine speech interaction. It is a challenging task especially in low signal-to-noise ratio and moving scenarios, such as on a sweeping robot with strong ego-noise. This paper proposes a novel approach for joint ego-noise suppression and keyword detection. The keyword detection model accepts outputs from multi-look adaptive beamformers. The noise covariance matrix in the beamformer is in turn updated using the keyword absence probability given by the model, forming an end-to-end loop-back. The keyword model also adopts a multi-channel feature fusion using self-attention, and a hidden Markov model for online decoding. The performance of the proposed approach is verified on real-word datasets recorded on a sweeping robot.
The defects of the Printed Circuit Board(PCB) directly affect the performance and reliability of electronic products. Therefore, detecting PCB defects is crucial. Lightweight models in PCB production inspection can ef...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
The defects of the Printed Circuit Board(PCB) directly affect the performance and reliability of electronic products. Therefore, detecting PCB defects is crucial. Lightweight models in PCB production inspection can effectively reduce equipment costs, but they exhibit limited feature extraction capabilities. Moreover, complex background conditions can interfere with the model's ability to locate and recognize small defects. To address these challenges, we propose LSDM-PCB, a lightweight PCB defect detection model based on YOLOv8n. Firstly, we improve the network structure to reduce the number of model parameters while enhancing the model's ability to capture small defects. Additionally, we adopt Receptive-Field Attention Convolution(RFAConv) as a downsampling module to enhance the model's feature extraction by considering the importance of each feature within the receptive field. Finally, we propose a Global and Local Mixed Attention(GLMA) mechanism to strengthen multi-scale feature representation, allowing the model to focus more on small defects. Results show LSDM-PCB reduces model parameters by 74% and improves mAP50 to 96.8%, a 2.7% enhancement compared to the baseline model YOLOv8n.
As the prevalence of lower limb motor dysfunction due to conditions such as cerebral palsy, hemiplegia, and paraplegia continues to rise, the current medical infrastructure for sports rehabilitation is insufficient to...
详细信息
In this article, a system for automatic recognition of Polish historic vehicles is proposed. A unique database of vehicle images was prepared, including real and scale vehicles. A software was developed to automatical...
详细信息
ISBN:
(数字)9788362065424
ISBN:
(纸本)9788362065424
In this article, a system for automatic recognition of Polish historic vehicles is proposed. A unique database of vehicle images was prepared, including real and scale vehicles. A software was developed to automatically classify vehicle models and assess their condition using artificial neural networks. Training, validation and test were performed for various datasets and neural network models. In the best case the VGG16 network reached 95% of vehicle model classification accuracy. In addition, a web application visualizing vehicle recognition is offered.
This paper compares the performance of two control techniques on a DC motor-driven differential drive robot. Regular cascaded PI controller is the first scheme. In the second scheme, the outer PI controller is replace...
详细信息
ISBN:
(纸本)9781665482370
This paper compares the performance of two control techniques on a DC motor-driven differential drive robot. Regular cascaded PI controller is the first scheme. In the second scheme, the outer PI controller is replaced by direct model reference adaptive control (MRAC). The main objective of the motor controller is that it follows a desired angular speed. Both schemes are compared in terms of average absolute speed error and power consumption. Preliminary simulations are performed on a DC motor model;whereas, experimental runs are done on a differentially driven robot powered by DC motors, to compare the performance of both control laws. It is found that the control scheme with MRAC has lesser speed error and power consumption compared to the PI controller, both in simulation and the experimental investigation.
暂无评论