The core of natural language processing is the science of how computers understand and respond to the influence of human language. This is also the main research direction in the field of machine intelligence developm...
详细信息
In the realm of intelligent vehicles, the evolution of object detection algorithms is of paramount importance. Current deep learning-based methodologies excel in identifying medium to large-sized objects but often fal...
详细信息
To address the issue of challenging adjustments and the lack of online adaptability for traditional analog PID controllers when dealing with varying controlled objects, an innovative fuzzy multiterminal memristor devi...
详细信息
The vulnerability of deep learning to adversarial attacks has brought many security risks to its development. However, the currently proposed adversarial attack detection methods are ineffective in defending large-siz...
详细信息
Traditional target detection methods have poor accuracy when processing lower resolution images with many pedestrians, particularly for small and blurry targets whose features are not manifested as significantly as th...
详细信息
Using computervision for the classification of an object39;s 3D position using a 2D camera is a topic that has received some attention from researchers over the years. Visual data is interpreted by the computer to ...
详细信息
ISBN:
(纸本)9783031530357;9783031530364
Using computervision for the classification of an object's 3D position using a 2D camera is a topic that has received some attention from researchers over the years. Visual data is interpreted by the computer to recognize the objects found. In addition, it is possible to infer their orientation, evaluating their spatial arrangement, rotation, or alignment in the scene. The work presented in this paper describes the training and selection of a siamese neural network for classifying the 3D orientation of cars using 2D images. The neural network is composed of an initial phase for feature selection through convolutional neural networks followed by a dense layer for embedding generation. For feature selection, four architectures were tested: VGG16, VGG19, ResNet18 and ResNet50. The best result of 95.8% accuracy was obtained with the VGG16 and input images preprocessed for background removal.
Image denoising (DN), demosaicing (DM) and super-resolution (SR) are the key tasks of the low-level vision. Joint demosaicing, denoising and Super-resolution (JDDSR) can effectively improve the image quality. However,...
详细信息
Aiming at the identification problem of medicine box traceability code, according to the principle that the relative position of each character in the medicine box remains unchanged, this paper proposes an efficient a...
详细信息
In this paper, a neural network-based nonlinear model predictive control strategy for carbon fiber angle link weaving machine tension is proposed. Firstly, the tension nonlinear model considering the opening disturban...
详细信息
2D/3D image registration is one of the key technologies to realize pose estimation in computer-aided surgery. In order to improve the global and local search performance of the model in the pose parameter space, an im...
详细信息
暂无评论