With the increasing complexity of machinevision algorithms and growing applications of imageprocessing, how do computers without a dedicated graphics processor perform? This research discusses the computational abil...
详细信息
ISBN:
(纸本)9781665414906
With the increasing complexity of machinevision algorithms and growing applications of imageprocessing, how do computers without a dedicated graphics processor perform? This research discusses the computational abilities of two low-cost single board computers (SBCs) by subjecting them to various Visual Inertial Odometry (VIO) algorithms. The end goal of this research is to identify a SBC which meets the requirements of being employed on an Unmanned Aerial System for autonomous navigation.
It is often difficult to obtain the high-precision inner cavity contour size and 3D model of parts and components in reverse engineering. This paper proposes a method that uses a sequence of section images of a part t...
详细信息
It is often difficult to obtain the high-precision inner cavity contour size and 3D model of parts and components in reverse engineering. This paper proposes a method that uses a sequence of section images of a part to reconstruct their 3D models. This method cuts the part layer by layer to obtain the sectional images and extracts the 3D information of the sectional image contours to generate point clouds. These point clouds are then used to reconstruct a 3D model of the part. High contrast material is used to embed the target part for pre-processing. A machining centre was used to mill the part layer by layer vertically to acquire high precision section profile images. The improved Canny edge detection operator was combined with the spatial moment sub-pixel subdivision algorithm to improve the edge detection accuracy. The camera imaging model algorithm transforms the coordinates of the image edge position to obtain a high-precision 3D point cloud of the part. The 3D solid model of the target part was obtained using NURBS surface reconstruction. The results show that the 3D model reconstruction method using the profile sequence of the cross-sectional images is independent of the complexity of the part's structure and the complete internal structure of the part can be obtained. The proposed edge detection algorithm significantly refines the edge position of the contours in the cross-sectional image and the measurement accuracy was improved. This method improves the minimum deviation to 50 mu m. The shape accuracy of roundness, cylindricity and perpendicularity of the structure is high. The proposed method can meet the reverse precision requirements in general precision machining.
The stability and reliability of the brake system are critically affected by the concentricity error of automotive brake piston components. Traditional contact-based concentricity measurement methods are inefficient. ...
详细信息
It’s great to see the potential of deep learning being applied to medical imaging for the diagnosis of Alzheimer’s disease. The challenges of small data size and overfitting are common issues in many deep learning a...
详细信息
ISBN:
(数字)9798350306545
ISBN:
(纸本)9798350306552
It’s great to see the potential of deep learning being applied to medical imaging for the diagnosis of Alzheimer’s disease. The challenges of small data size and overfitting are common issues in many deep learning applications, and it’s encouraging to see that the ADNI dataset was utilized to address these challenges. The use of MeshCNN network with residual connections borrowed from ResNet is a clever approach to improve the classification accuracy. The Mesh data representation of brain surfaces as triangular meshes is an interesting and innovative technique to incorporate into the classification model. The improvements in accuracy of 0.01 for AD / MCI / NC and 0.02 for AD / NC may seem small, but in the context of Alzheimer’s disease diagnosis, even small improvements in accuracy can have significant implications for early intervention and treatment. Overall, this study demonstrates the potential of deep learning to advance research on Alzheimer’s disease diagnosis and underscores the importance of continued innovation in medical imaging techniques to improve the accuracy and effectiveness of diagnosis and treatment.
Continuous belt monitoring is of utmost importance since wears on its surface can develop into tears and even rupture. It can causes the interruption of the conveyor, and consequently, loss of capital, or even worse, ...
详细信息
ISBN:
(纸本)9789897584886
Continuous belt monitoring is of utmost importance since wears on its surface can develop into tears and even rupture. It can causes the interruption of the conveyor, and consequently, loss of capital, or even worse, serious or fatal accidents. This paper proposes a laser-based machinevision method for detecting defects in conveyor belts to solve the monitoring problem. The approach transforms an image of a laser line into a one-dimensional signal, then analyzes it to detect defects, considering that variations in this signal are caused by defects/imperfections on the belt surface. Differently from previous works, the proposed method can identify a defect through a 2D reconstruction of it. The results reveal that the proposed method was capable to detect superficial imperfections in simulated conveyor belt experiments, achieving high values in metrics such as precision and recall.
imageprocessing plays a vital role in artificial visual systems, which have diverse applications in areas such as biomedical imaging and machinevision. In particular, optical analog imageprocessing is of great inte...
imageprocessing plays a vital role in artificial visual systems, which have diverse applications in areas such as biomedical imaging and machinevision. In particular, optical analog imageprocessing is of great interest because of its parallel processing capability and low power consumption. Here, we present ultra-compact metasurfaces performing all-optical geometric image transformations, which are essential for imageprocessing to correct image distortions, create special image effects, and morph one image into another. We show that our metasurfaces can realize binary image transformations by modifying the spatial relationship between pixels and converting binary images from Cartesian to log-polar coordinates with unparalleled advantages for scale- and rotation-invariant image preprocessing. Furthermore, we extend our approach to grayscale image transformations and convert an image with Gaussian intensity profile into another image with flat-top intensity profile. Our technique will potentially unlock new opportunities for various applications such as target tracking and laser manufacturing. Metasurfaces enable all-optical geometric coordinate transformations, converting images with altered pixel spatial relations, which can facilitate fast, energy-efficient preprocessing for tasks like object tracking, or aid in laser manufacturing.
image recognition, powered by machine learning (ML), has significantly advanced applications in both dance movement recognition and robotic vision. This review examines key ML techniques, including Convolutional Neura...
详细信息
ISBN:
(数字)9798350356755
ISBN:
(纸本)9798350356762
image recognition, powered by machine learning (ML), has significantly advanced applications in both dance movement recognition and robotic vision. This review examines key ML techniques, including Convolutional Neural Networks (CNNs), Deep Neural Networks (DNNs), Self-Organizing Maps (SOMs), and Long Short-Term Memory (LSTM) networks, alongside pose estimation methods like OpenPose and Part Affinity Fields (PAFs). These techniques enhance dance classification, real-time feedback, and motion analysis, with OpenPose + LSTMs and PAFs + LSTMs demonstrating the highest accuracy. Notwithstanding progress, obstacles such as high computational costs, data dependency, and real-time implementation challenges persist. Beyond dance, these methods are critical in robotic vision, intelligent automation, and industrial imageprocessing, enabling autonomous robotic navigation, defect detection in manufacturing, and AI-driven motion tracking. By leveraging human movement analysis for robotics, ML improves human-robot interaction, robotic-assisted rehabilitation, and industrial automation. Despite progress, challenges such as high computational demands, data dependency, and real-time constraints remain. This review explores future directions, including multimodal data fusion, hybrid AI models, and real-time optimization, bridging the gap between AI-driven motion systems and intelligent automation to enhance adaptability and efficiency across domains.
LiDAR semantic segmentation plays a crucial role in enabling autonomous driving and robots to understand their surroundings accurately and robustly. A multitude of methods exist within this domain, including point-bas...
详细信息
ISBN:
(数字)9798350365474
ISBN:
(纸本)9798350365481
LiDAR semantic segmentation plays a crucial role in enabling autonomous driving and robots to understand their surroundings accurately and robustly. A multitude of methods exist within this domain, including point-based, range-image-based, polar-coordinate-based, and hybrid strategies. Among these, range-image-based techniques have gained widespread adoption in practical applications due to their efficiency. However, they face a significant challenge known as the "many-to-one" problem caused by the range image ’s limited horizontal and vertical angular resolution. As a result, around 20% of the 3D points can be occluded. In this paper, we present TFNet, a range-image-based LiDAR semantic segmentation method that utilizes temporal information to address this issue. Specifically, we incorporate a temporal fusion layer to extract useful information from previous scans and integrate it with the current scan. We then design a max-voting-based post-processing technique to correct false predictions, particularly those caused by the "many-to-one" issue. We evaluated the approach on two benchmarks and demonstrated that the plugin post-processing technique is generic and can be applied to various networks.
In the field of imageprocessing and computer vision (CV), machine learning (ML) architectures are widely used. image compression problems can be solved using convolutional neural networks (CNNs). As a result of bandw...
详细信息
In the field of imageprocessing and computer vision (CV), machine learning (ML) architectures are widely used. image compression problems can be solved using convolutional neural networks (CNNs). As a result of bandwidth and memory constraints, compression of images is a necessity. There are three types of information found in images: useful, redundant, and irrelevant. In this survey, we will discuss how ML is used to compress lossy images. Firstly, we describe the background of lossy image compression. Next, we classify ML-based image compression frameworks into subgroups based on their architectures. Auto-encoders (AEs), variational auto-encoders (VAEs), CNNs, recurrent neural networks (RNNs), long short-term memories (LSTMs), gated recurrent units (GRUs), generative adversarial networks (GANs), transformers, principal component analysis (PCA) and fuzzy means clustering are among these subgroups. By analyzing learning-driven image compression frameworks, we present pros and cons of each subgroup. Lastly, we outline several research gaps and future research directions in the field of ML-based image compression.
In response to the problems of uneven quality and unstable accuracy caused by manual grading in the grading process of Korla pear, a Korla pear grading method based on random forest and MLP is proposed, aiming to furt...
详细信息
ISBN:
(数字)9798350360240
ISBN:
(纸本)9798350384161
In response to the problems of uneven quality and unstable accuracy caused by manual grading in the grading process of Korla pear, a Korla pear grading method based on random forest and MLP is proposed, aiming to further optimize the above problems using machinevision methods. This paper collects left and right views and top views of pear images. Using a combination of K-means clustering and threshold segmentation to achieve background separation; This paper constructs the minimum bounding rectangle for the left and top views of Xiangli to obtain its longitudinal and transverse diameters; Based on the measurement difficulty of pear weight, a pear weight prediction model based on random forest model is constructed using pear diameter parameters; This paper constructs a pear grading model using MLP neural network. The experimental results show that the proposed Korla pear grading method based on random forest and MLP has an accuracy of 99.69% in the training set and 98.75
%
in the test set, verifying the feasibility and accuracy of this method.
暂无评论