Accurate estimation of food weight is of utmost importance in the assessment of diet in nutrition, healthcare, and wellness. Although great progress has been achieved with machine learning and computervision in image...
详细信息
ISBN:
(数字)9798331519582
ISBN:
(纸本)9798331519599
Accurate estimation of food weight is of utmost importance in the assessment of diet in nutrition, healthcare, and wellness. Although great progress has been achieved with machine learning and computervision in image-based food recognition, the task of accurate weight estimation is still a challenging one. This review summarizes the current methodologies that address the gap, focusing on the reduction of errors and novel innovations. The review compares 2D and 3D computational methods, emphasizes the advantages of hybrid multimodal models, and assesses their strengths and weaknesses. The role of datasets and annotation techniques is analyzed in detail, highlighting the importance of having diverse and accurate datasets for model improvement. Standard evaluation metrics and benchmarks are addressed to evaluate accuracy and comparability across methods. Realism emerging trends of self-supervised learning, multimodal data integration, and real-time edge computing applications are searched for as answers to these practical constraints found in reality: variability in food, occlusion, environmental factors all challenge; solution proposed here. This review is aimed at outlining future research guidance by summarizing recent innovations, addressing gaps, and providing a roadmap for practical, scalable, and accurate dietary assessment systems.
Due to the development of electronic technology, computer technology, sensor technology, power electronics technology, brushless DC motor and its control technology has been a breakthrough progress [1]. This paper sel...
Due to the development of electronic technology, computer technology, sensor technology, power electronics technology, brushless DC motor and its control technology has been a breakthrough progress [1]. This paper selects double closed-loop control technology for brushless DC motor to regulate speed. Firstly, the principle of brushless DC motor is introduced; secondly, the model of brushless DC motor is established and mathematical analysis is carried out; thirdly, the double closed-loop PI speed control is adopted, and the design and improvement of PI controller are mainly carried out to eliminate the static difference in the steady state of the brushless DC motor[2]; lastly, based on the platform of MATLAB/SIMULINK, the simulation model of the control system is established to simulate the speed closed-loop control system of the brushless DC motor. Finally, based on MATLAB/SIMULINK platform, the simulation model of the control system is established to simulate the closed-loop control system of the brushless DC motor. The simulation results show that the model system response is fast and smooth [3]. The double closed-loop control of brushless DC motor adopts current hysteresis loop, which is simple in structure and fast in response, and has certain theoretical and application significance.
This study presents a comparative analysis of segmentation models based on the YOLO (You Only Look Once) architecture for the task of vocal cord detection in laryngoscopic images. The yolov5, yolov8, and yolov9 archit...
详细信息
In logistic regression, it is often desirable to utilize regularization to promote sparse solutions, particularly for problems with a large number of features compared to available labels. In this paper, we present sc...
详细信息
In autonomous driving scenarios, lidar-based object detectors are widely used in 3D object detection. For the sake of safety redundancy, some self-driving vehicles will be equipped with multiple lidars with different ...
In autonomous driving scenarios, lidar-based object detectors are widely used in 3D object detection. For the sake of safety redundancy, some self-driving vehicles will be equipped with multiple lidars with different beams, but training multiple lidar-based object detectors requires high costs. In our experiments, the performance of a lidar-based detector model trained on a high-beam lidar domain drops dramatically when transferred to a low-beam lidar domain. At present, there is no public 3D object detection dataset across-lidar-beams domains. In this paper, a 3D object detection dataset adapted across-lidar-beams domain is produced by the LGSVL simulator. The reason for the performance degradation of the lidar-based detectors across lidar-beams domains are mainly due to the large gap in the foreground object point count. To solve this problem, this paper proposed a semantic point generation method based on masked autoencoders, which can bridge the data gap between lidar beam-induced domains by generating more foreground semantic points so as to realize the migration of the detector model. In addition to solving the problem of lidar beam-induced domains migration, SP-MAE can also improve the performance of 3D lidar-based detectors. Experiments showed that the PVRCNN detector can improve 5.11 3D AP of pedestrian on the KITTI dataset after using SP-MAE.
Crowd tracking is an important research topic in the field of computervision. In crowd scenes, tracking pedestrians often results in missing individuals, i.e. individuals without trajectory information, and incorrect...
详细信息
ISBN:
(纸本)9798400713231
Crowd tracking is an important research topic in the field of computervision. In crowd scenes, tracking pedestrians often results in missing individuals, i.e. individuals without trajectory information, and incorrectly identifying pedestrian individuals, i.e. identifying non pedestrian targets as pedestrian individuals for tracking. Therefore, it limits the tracking accuracy of existing methods. In the past, most methods only considered pedestrian tracking research in sparse scenes, so they often overlooked the importance of these two issues on crowd tracking. To solve these two problems, this paper proposes a density map based crowd tracking method. This method includes three task branches, namely pedestrian individual detection branch, density map estimation branch, and re identification branch. The pedestrian individual detection branch is responsible for predicting the position information of pedestrian individuals. The density map estimation branch can obtain the number of pedestrian individuals in the image area, which is used to jointly construct the mean square error loss with the pedestrian individual detection branch. In crowded scenes, this method can generate a constraint on the number of pedestrian individuals in the detection branch by counting the population density map, which can help the pedestrian individual detection branch find missed individuals. Similarly, the heatmap of the pedestrian individual detection branch can be used to enhance the localization and counting ability of the population density map, forming a bidirectional constraint. The re identification task branch is responsible for extracting the apparent features of individual pedestrians, which are combined with their location information to establish continuous inter frame pedestrian connections. At the same time, in response to the problem of incorrect recognition of pedestrian individuals during the tracking process, which leads to incorrect association of the same individual bet
This paper presents an approach for learning-based discriminative 3D point cloud descriptor from RGB-D images for place recognition purposes in indoor environments. Existing methods, such as such as Point-NetVLAD, PCA...
详细信息
ISBN:
(纸本)9789897584886
This paper presents an approach for learning-based discriminative 3D point cloud descriptor from RGB-D images for place recognition purposes in indoor environments. Existing methods, such as such as Point-NetVLAD, PCAN or LPD-Net, are aimed at outdoor environments and operate on 3D point clouds from LiDAR. They are based on PointNet architecture and designed to process only the scene geometry and do not consider appearance (RGB component). In this paper we present a place recognition method based on sparse volumetric representation and processing scene appearance in addition to the geometry. We also investigate if using two modalities, appearance (RGB data) and geometry (3D structure), improves discriminativity of a resultant global descriptor.
A new adaptive engine employed for high-speed decision feedback equalizer (DFE) is proposed. The proposed adaptive engine utilizes the least mean square (LMS) of maximum jitter at the cross point of the received data ...
A new adaptive engine employed for high-speed decision feedback equalizer (DFE) is proposed. The proposed adaptive engine utilizes the least mean square (LMS) of maximum jitter at the cross point of the received data signal to update the feedback tap coefficients of the DFE while the eye-opening monitor (EOM) is used to optimize the adaptation process. Updating the tap coefficients relies on the tolerance of maximum jitter at the received data cross point to record the violations and generate two-direction error signals. These signals are used to control the adaptation process. The proposed adaptive engine overcomes the drawback of wasting energy after approaching the convergence time when LMS is utilized and eliminated the drawback of EOM for not handling the adaptation process in two directions. The proposed adaptive engine is verified by employing a serial link designed in an IBM 65 nm 0.8V CMOS technology. The simulation results show that the data link with variable channel lengths is explored utilizing BSIM4 device models with Cadence Design Systems.
As an important data analysis method, LR has been widely used in many fields. In applications to practical classification problems, LR tends to give good results. However, traditional LR has significant weaknesses in ...
详细信息
As an important data analysis method, LR has been widely used in many fields. In applications to practical classification problems, LR tends to give good results. However, traditional LR has significant weaknesses in overcoming the complexity and redundancy of the solution. To this end, many solutions have been proposed, which tend to be a common approach with few results. The main purpose of this paper is to research and design the debugging method of digital grid connection (GC) based on the logistic regression model (LRM). On this basis, this paper proposes an accurate autoregressive model and provides detailed information to ensure the reliability and excellence of the model, which is verified by experiments. The experimental results show that using different classification methods, the test error and selection of data sets have different results. It can be seen that the ten-fold cross-validation error for adaptive elastic net LR is 1/38 and the test error is 1/34. Compared with other methods, the classification performance of adaptive elastic net LR is also superior, even better than other classification methods.
As reported by the Centers for Disease control and Prevention (CDC), heart disease is the major cause of death for both sexes and members of most racial and ethnic groups. Cardiovascular disease claims one life in the...
详细信息
暂无评论