In order to ensure the power supply reliability of the power system, most of the power line maintenance operations use live work to complete wire breaking, wiring, replacement of fall insurance and other work. The dis...
详细信息
This paper proposes a structure called SPBNet for enhancing binarized convolutional neural networks (BCNNs) using a low-cost 1-D spatial attention structure. Attention blocks can compensate for the performance drop in...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
This paper proposes a structure called SPBNet for enhancing binarized convolutional neural networks (BCNNs) using a low-cost 1-D spatial attention structure. Attention blocks can compensate for the performance drop in BCNNs. However, the hardware overhead of complex attention blocks can be a significant burden in BCNNs. The proposed attention block consists of low-cost 1-D height-wise and width-wise 1-D convolutions, It has the attention bias to adjust the effects of attended features in x0.5 - x1.5. In experiments, the proposed block used in ResNet18-based BCNNs improves Top-1 accuracy up to 2.7% over a baseline ReActNet on the CIFAR-100 dataset. Notably, without using teacher-student training, the proposed structure can show comparable performance as the baseline ReActNetA using teacher-student training.
To reduce the cost of tracking moving targets by robots and enhance the accuracy of target recognition and tracking, a vision-based tracking robot is designed. The onboard camera captures image data, which is initiall...
详细信息
Induction generator is the subject of various electrical and mechanical fault that involve the wind turbine reliability. As a few papers studied the electrical imbalanced rotor fault in the wound rotor induction gener...
详细信息
ISBN:
(纸本)9798350373981;9798350373974
Induction generator is the subject of various electrical and mechanical fault that involve the wind turbine reliability. As a few papers studied the electrical imbalanced rotor fault in the wound rotor induction generator, we present a MCSA technique result for rotor electrical imbalance fault using Blackman windows with show its improvement for spectral accuracy. As well as the application of signalprocessing methods such as FFT, EMD and RLMD, in the context of fault diagnosis and highlighting the effect of rotor imbalance on the operation of dfig.
In this paper, we propose a real-time FPGA implementation of the Semi-Global Matching (SGM) stereo vision algorithm. The designed module supports a 4K/Ultra HD (3840 x 2160 pixels @ 30 frames per second) video stream ...
详细信息
ISBN:
(纸本)9783031299698;9783031299704
In this paper, we propose a real-time FPGA implementation of the Semi-Global Matching (SGM) stereo vision algorithm. The designed module supports a 4K/Ultra HD (3840 x 2160 pixels @ 30 frames per second) video stream in a 4 pixel per clock (ppc) format and a 64-pixel disparity range. The baseline SGM implementation had to be modified to process pixels in the 4ppc format and meet the timing constrains, however, our version provides results comparable to the original design. The solution has been positively evaluated on the Xilinx VC707 development board with a Virtex-7 FPGA device.
For a robot to compete at the game of air-hockey requires the ability to track the fast-moving puck, and fast reaction of its control system. Event-cameras can be used to solve the visual tracking task in order to ove...
详细信息
ISBN:
(数字)9781665453493
ISBN:
(纸本)9781665453493
For a robot to compete at the game of air-hockey requires the ability to track the fast-moving puck, and fast reaction of its control system. Event-cameras can be used to solve the visual tracking task in order to overcome problems of motion blur and/or high processing requirements that come from when using traditional RGB cameras. Each pixel of an event-camera responds independently to change in light, resulting in a high frequency, low-latency update of the puck position. A vision-in-the-loop robot control can then maintain stability with much faster movements. In this paper, we introduce the control loop for an iCub robot to follow the position of the puck with its head motion. We evaluate the accuracy and stability of the iCub motion as the latency of the tracked position is varied from 1 ms to 30 ms, achievable in real-time with the event-camera, eventually resulting in control failure. We conclude that the event-driven tracking paradigm is an enabling technology for unlocking smooth dynamic robot motions from vision, also for tasks beyond air-hockey.
Post-training quantization (PTQ) efficiently compresses vision models, but unfortunately, it accompanies a certain degree of accuracy degradation. Reconstruction methods aim to enhance model performance by narrowing t...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Post-training quantization (PTQ) efficiently compresses vision models, but unfortunately, it accompanies a certain degree of accuracy degradation. Reconstruction methods aim to enhance model performance by narrowing the gap between the quantized model and the full-precision model, often yielding promising results. However, efforts to significantly improve the performance of PTQ through reconstruction in the vision Transformer (ViT) have shown limited efficacy. In this paper, we conduct a thorough analysis of the reasons for this limited effectiveness and propose MGRQ (Mixed Granularity Reconstruction Quantization) as a solution to address this issue. Unlike previous reconstruction schemes, MGRQ introduces a mixed granularity reconstruction approach. Specifically, MGRQ enhances the performance of PTQ by introducing Extra-Block Global Supervision and Intra-Block Local Supervision, building upon Optimized Block-wise Reconstruction. Extra-Block Global Supervision considers the relationship between block outputs and the model's output, aiding block-wise reconstruction through global supervision. Meanwhile, Intra-Block Local Supervision reduces generalization errors by aligning the distribution of outputs at each layer within a block. Subsequently, MGRQ is further optimized for reconstruction through Mixed Granularity Loss Fusion. Extensive experiments conducted on various ViT models illustrate the effectiveness of MGRQ. Notably, MGRQ demonstrates robust performance in low-bit quantization, thereby enhancing the practicality of the quantized model.
Understanding user's internal state is indispensable for human-robot interaction in social signalprocessing. To mitigate the bias of sentiments observed by third-party annotators, the importance of self-reported ...
详细信息
ISBN:
(纸本)9783031612800;9783031612817
Understanding user's internal state is indispensable for human-robot interaction in social signalprocessing. To mitigate the bias of sentiments observed by third-party annotators, the importance of self-reported by users themselves was pointed out recently. However, the self-reported internal state is not displayed as similar multimodal behaviors among different individuals, this leads to performance gap between self-reported and third-party sentiment estimations. Speaker adaptation for social signalprocessing (SASSP) is necessary to learn individual social signal characteristics to mitigate the individual differences. Towards effective adaptation for speakers with different characteristics, clarifying influence of individual differences in internal state estimation is necessary but has not been clarified. To address this problem, this study conducted empirical analysis by training and testing models on multimodal data of a group of speakers. Then, we analyze the relationships between the best model's performance and speaker's characteristics including age, gender, personalities, and speaker's expectation before human-robot interaction experiment. The results showed that these aspects all have influence on estimation performance in SASSP due to expression differences. This study provides suggestions and directions on setting SASSP policies for self-reported internal state estimation.
In order to meet the intelligent and automatic requirements of spinning factories, this paper proposes a design scheme of online package identification system based on machine vision in No.1 middle school, and systema...
详细信息
In the process of material sorting, the traditional manual sorting method is not only inefficient, but also has huge security risks. Affected by the types, shapes, quality and other factors of different materials, its...
详细信息
暂无评论