Due to the similarity in mushroom features and the difficulty in distinguishing between poisonous and nonpoisonous varieties, mushrooms pose a threat to human health. To address the challenge of mushroom classificatio...
Due to the similarity in mushroom features and the difficulty in distinguishing between poisonous and nonpoisonous varieties, mushrooms pose a threat to human health. To address the challenge of mushroom classification and identification, this paper proposes a mushroom classification method based on residual networks. Firstly, a network architecture with multiple residual blocks is designed, and it is trained using an image dataset. Then, a transfer learning strategy is employed to initialize the network parameters from a pre-trained model, followed by fine-tuning to adapt to the mushroom classification task. Finally, multiple testing experiments are conducted to evaluate the effectiveness of the proposed method. The experimental results demonstrate excellent performance of the proposed method in mushroom classification tasks. Compared to traditional feature extraction methods, it can better capture the details and texture features of mushrooms, thereby improving classification accuracy. In conclusion, the mushroom classification method based on residual networks exhibits high accuracy and generalization capability. This method has potential applications in the field of mushroom classification, aiding in the better identification and differentiation of poisonous mushrooms, thereby protecting human health.
Hypergraph, an expressive structure with flexibility to model the higher-order correlations among entities, has recently attracted increasing attention from various research domains. Despite the success of Graph Neura...
详细信息
Learning reliable motion representation between consecutive frames, such as optical flow, has proven to have great promotion to video understanding. However, the TV-L1 method, an effective optical flow solver, is time...
详细信息
Arbitrary shape scene text detection becomes a challenge task due to its background complexity and shape diversity. In this paper, we propose a dual-branch multi-resolution feature-aware enhancement network (DMFE), th...
详细信息
Learning reliable motion representation between consecutive frames, such as optical flow, has proven to have great promotion to video understanding. However, the TV-L1 method, an effective optical flow solver, is time...
详细信息
ISBN:
(纸本)9781728176055;9781728176062
Learning reliable motion representation between consecutive frames, such as optical flow, has proven to have great promotion to video understanding. However, the TV-L1 method, an effective optical flow solver, is time-consuming and expensive in storage for caching the extracted optical flow. To fill the gap, we propose UF-TSN, a novel end-to-end action recognition approach enhanced with an embedded lightweight unsupervised optical flow estimator. UF-TSN estimates motion cues from adjacent frames in a coarse-to-fine manner and focuses on small displacement for each level by extracting pyramid of feature and warping one to the other according to the estimated flow of the last level. Due to the lack of labeled motion for action datasets, we constrain the flow prediction with multi-scale photometric consistency and edge-aware smoothness. Compared with state-of-the-art unsupervised motion representation learning methods, our model achieves better accuracy while maintaining efficiency, which is competitive with some supervised or more complicated approaches.
Data-free knowledge distillation aims to learn a compact student network from a pre-trained large teacher network without using the original training data of the teacher network. Existing collection-based and generati...
详细信息
Optical flow estimation is an essential step for many real-world computer vision tasks. Existing deep networks have achieved satisfactory results by mostly employing a pyramidal coarse-to-fine paradigm, where a key pr...
详细信息
Gait planning of quadruped robots plays an important role in achieving less walking, including dynamic and static gait. In this article, a static and dynamic gait control method based on center of gravity stability ma...
Gait planning of quadruped robots plays an important role in achieving less walking, including dynamic and static gait. In this article, a static and dynamic gait control method based on center of gravity stability margin is proposed. Firstly, the robot model and kinematics modeling are introduced. Secondly, the robot’s foot static and dynamic gait were planned and the foot trajectory was designed. Finally, two types of gait of the robot were simulated using Vrep simulation software, and the differences in stability and speed between the coordinated gait with speed and stability in the static and dynamic gait of a 12 degree of freedom robot were analyzed, verifying the effectiveness of the gait control method proposed in this paper.
Remote sensing object detection is an important research area in computer vision, widely applied in both military and civilian domains. However, challenges in remote sensing image object detection such as large image ...
Remote sensing object detection is an important research area in computer vision, widely applied in both military and civilian domains. However, challenges in remote sensing image object detection such as large image sizes, complex backgrounds, and significant variations in target scales are prevalent. To address these issues, this paper proposes a new Feature Denoising and Fusion Module (FDFM) aimed at enhancing the accuracy and robustness of object detection. This module comprises a Multi-Scale Denoising Submodule(MDS) and an Attention Optimization Submodule(AOS). The Multi-Scale Denoising Module aims to suppress lower-level texture noise by utilizing higher-level semantic features before the fusion process, reducing the impact of lower-level noise on subsequent multi-scale feature fusion. Meanwhile, the Attention Optimization Module seeks to enhance the precision of self-attention computations within the Multi-Scale Denoising Module without increasing the parameter count. The efficacy of this method was evaluated on public datasets DOTA, VisDrone, VOC and COCO, showing improvements in comparison to baseline models.
Constructing the pyramidal architecture for the feature is currently a very effective way to obtain feature information of objects at different scales. Although the feature pyramid can realize the recognition and dete...
详细信息
ISBN:
(数字)9781728180281
ISBN:
(纸本)9781728180298
Constructing the pyramidal architecture for the feature is currently a very effective way to obtain feature information of objects at different scales. Although the feature pyramid can realize the recognition and detection of multi-scale objects in the object detection task well, it still has some limitations. Since the feature information of different levels is often not from the same layer of the network, it is difficult to obtain the feature of different objects information at a certain scale from a certain level feature map of the pyramid network. To solve this problem, we present a novel object detection architecture, named Enhanced Multi-scale Feature Fusion Pyramid Network (EMFFPNet). Our network consists of Enhanced Multi-scale Feature Fusion Module (EMFFM) and Predictor Optimization Module (POM). In EMFFM, Features at different levels can be fused into the Enhanced features as outputs, which are more representative and deterministic. In order to enable the enhanced features to play their respective roles in the pyramid network, we assign different weights to fusion features of different levels in POM. We perform the experiments on the COCO detection benchmark. The experimental results indicate that the performance of our model is much better than the state-of-the-art model.
暂无评论