The kinematic analysis of the patrol robot is based on Ackermann theory to derive the relationship between front wheel deflection angle, rear wheel speed and robot target speed. The fitted curve is obtained by plottin...
详细信息
It is a new trend to fine-tune Large Multimodal Models (LMMs) to adapt to specific visual tasks through task-related conversation data. This approach provides a new paradigm for solving various vision-language tasks, ...
详细信息
With the development of science and technology, table tennis robots are becoming more and more common in the daily competition and training of athletes. This study is mainly for the research and development of the vis...
详细信息
Transparent parts are widely used in instrumentation, national defense, aerospace and other fields. These parts have strict operating environment and high optical performance requirements, and in-place measurement is ...
详细信息
This paper presents a fully autonomous assembly line for custom cubes with two sub-block options (plastic and aluminium). Users can design their cubes through an Android app or website interface. The system retrieves ...
详细信息
ISBN:
(纸本)9798350385939;9798350385922
This paper presents a fully autonomous assembly line for custom cubes with two sub-block options (plastic and aluminium). Users can design their cubes through an Android app or website interface. The system retrieves orders, processes them based on deadlines, and dispenses the required sub-blocks. A quality inspection station with computer vision and machine learning ensures that defect-free parts are used. Cobots handle sub-block transportation and manipulation throughout the assembly process, with a dedicated station for inserting pins when necessary. A digital Shadow model facilitates performance analysis and optimization. Cloud-based platforms manage data, order processing, and remote access. Finally, an Automated Storage and Retrieval System (ASRS) manages cube storage and delivery. This research demonstrates the potential of combining automation, machine learning, and real-time monitoring for efficient custom product assembly. The approach offers applications in various industries and lays the groundwork for future research on expanding customization options and production flow optimization.
Research hotspots in food science and dental studies currently focus on analyzing food texture, preparing food boluses, and studying the interaction between food and the chewing system during the mastication process u...
详细信息
ISBN:
(纸本)9798350325621
Research hotspots in food science and dental studies currently focus on analyzing food texture, preparing food boluses, and studying the interaction between food and the chewing system during the mastication process using in vitro experiments. Several food repositioning mechanisms have been developed and applied to the chewing robots for different applications, but they are normally actively actuated by motors or pneumatic/hydraulic valves, which increase the complexity of the system. This study aims to develop a food repositioning mechanism to mimic the function of tongue and cheek muscles during food masticatory process with no actuation. The technical requirements of the new food repositioning mechanism are summarized by analyzing the pros and cons of the existing mechanisms. A new food repositioning mechanism based on spring and V-shaped inclined plates are developed, hinge structure is applied to support the inclined plates, and its performance is validated by using simulations.
Recently, a growing number of work design unsupervised paradigms for point cloud processing to alleviate the limitation of expensive manual annotation and poor transferability of supervised methods. Among them, CrossP...
详细信息
ISBN:
(纸本)9798350323658
Recently, a growing number of work design unsupervised paradigms for point cloud processing to alleviate the limitation of expensive manual annotation and poor transferability of supervised methods. Among them, CrossPoint follows the contrastive learning framework and exploits image and point cloud data for unsupervised point cloud understanding. Although the promising performance is presented, the unbalanced architecture makes it unnecessarily complex and inefficient. For example, the image branch in CrossPoint is similar to 8.3x heavier than the point cloud branch leading to higher complexity and latency. To address this problem, in this paper, we propose a lightweight vision-and-Pointcloud Transformer (ViPFormer) to unify image and point cloud processing in a single architecture. ViPFormer learns in an unsupervised manner by optimizing intra-modal and cross-modal contrastive objectives. Then the pretrained model is transferred to various downstream tasks, including 3D shape classification and semantic segmentation. Experiments on different datasets show ViPFormer surpasses previous state-of-the-art unsupervised methods with higher accuracy, lower model complexity and runtime latency. Finally, the effectiveness of each component in ViPFormer is validated by extensive ablation studies. The implementation of the proposed method is available at https: //***/auniquesun/ViPFormer.
The proliferation of counterfeit pharmaceuticals presents a critical challenge to public health and safety worldwide. A common method to differentiate authentic from counterfeit medicine is visually inspect the medici...
详细信息
ISBN:
(纸本)9798350370058;9798350370164
The proliferation of counterfeit pharmaceuticals presents a critical challenge to public health and safety worldwide. A common method to differentiate authentic from counterfeit medicine is visually inspect the medicine's packaging. However, medicine packaging is susceptible and easy to replication. This research investigates an image-based classification method using machine learning and computer vision to differentiate between counterfeit and authentic medicine packaging. This research aims to develop a system that can authenticate local paracetamol and Biogesic based on support vector machines and convolutional neural networks. This can be done through these specific objectives: (1) To be able to capture PNG images of the packaging of the paracetamol using a Raspberry Pi camera;(2) To be able to use Support Vector Machines for feature extraction and Convolutional Neural Network models for the classification;(3) To be able to verify the reliability of the system using a confusion matrix. The classification of this study was evaluated using a confusion matrix. The resulting accuracy of the system is 88.75 %. Despite the promising results yielded by the developed system, the researchers recommend considering a wider database of medicines as well as features to consider.
Domain knowledge exists in various forms, including text, ontologies, graphs, images, audio, and videos. In plant disease detection, most works solely utilize images with disease labels, neglecting textual description...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Domain knowledge exists in various forms, including text, ontologies, graphs, images, audio, and videos. In plant disease detection, most works solely utilize images with disease labels, neglecting textual descriptions of visual disease symptoms used by human experts for diagnosis. These text descriptions and sample images aid expert identification of visual symptoms. We propose a novel method that leverages text descriptions and image data by modeling domain-specific knowledge about visual symptoms in leaf images as separate feature channels. Each channel corresponds to specific features whose absence or presence in the image influences model predictions. We introduce a channel attention-guided fusion module for weighting each channel based on the input and corresponding output. The combined feature channels are transformed into a standardized 3-channel input format, which can then be processed by any pre-trained convolutional neural network (CNN) as input for feature extraction and subsequent classification. Furthermore, intermediate activations of the channel attention layer combined with the weights from the fusion layer make model predictions explainable. Experimental results on three publicly available datasets of apple and cucumber leaf diseases demonstrate improvements of up to 5% utilizing various state-of-the-art CNN architectures, indicating the efficacy of incorporating textual disease descriptions using the proposed approach.
Manipulating objects is one of the most basic human interactions. This involves holding, shifting, lifting, and passing objects to other people. These interactions can be replicated for human assistive robot systems i...
详细信息
ISBN:
(纸本)9798350322996;9798350323009
Manipulating objects is one of the most basic human interactions. This involves holding, shifting, lifting, and passing objects to other people. These interactions can be replicated for human assistive robot systems in robotics, which can help the elderly with their daily tasks. Many researchers have developed 2D image-based approaches to produce hand landmark detection to recognize hand shapes and gestures. However, there are shortcomings in the resulting data output, namely that it does not provide the characteristics of the distance and orientation of the hand being detected. As a result, it cannot be utilized as input to control the robot manipulator's movements to get closer to the region of the human hand. We have designed a hand position and orientation recognition system with a stereo camera to anticipate its disparity map, which includes distance prediction data for every map pixel, to overcome this challenge. The disparity map is projected onto a point cloud using a hand landmark detection technique, and the points that match the hand landmarks are chosen. We employ the Singular Value Decomposition (SVD) mathematical technique to ascertain the hand position and orientation. We verified the sequence of tasks completed to determine the position and orientation of the stereo camera. Furthermore, our approach is user-friendly and can be easily applied across a range of simpler systems, achieving a processing speed of approximately 10.03 FPS.
暂无评论