This research paper presents a novel pipeline for synthesizing high-quality, textured 3D models of retail products, addressing the limitations of existing datasets in the retail automation domain. As retail automation...
详细信息
ISBN:
(纸本)9783031441363;9783031441370
This research paper presents a novel pipeline for synthesizing high-quality, textured 3D models of retail products, addressing the limitations of existing datasets in the retail automation domain. As retail automation continues to revolutionize various aspects of business operations, there is a growing need for comprehensive and domain-specific datasets to enable the development and evaluation of robust perception systems. However, current datasets lack the necessary variety and size, often containing only a fraction of the products offered by retail companies and failing to capture domain-specific characteristics. To overcome these limitations, we introduce a modular pipeline that focuses on the synthesis of 3D models with high-resolution textures. Unlike previous methods that prioritize visually appealing shapes, our pipeline explicitly incorporates retail domain-specific details, including readable text, logos, codes, and nutrition tables, in the texture of the models. By disentangling the texturing process from shape augmentation, we ensure the visual quality of text areas, providing realistic and readable representations of retail products. In our methodology, we leverage state-of-the-art generative image models to add retail domain-specific details to the 3D models. Our approach enables the synthesis of diverse retail product packages with accurate textures, enhancing the realism and applicability of the generated models for the development and evaluation of perception systems in retail automation and robotics. Experimental results demonstrate the effectiveness and visual quality of models generated by our pipeline.
In this paper, the studies carried out to detect objectionable expressions in any text will be explained. Experiments were performed with Sentence transformers, supervised machine learning algorithms, and Bert transfo...
详细信息
ISBN:
(纸本)9798350345650
In this paper, the studies carried out to detect objectionable expressions in any text will be explained. Experiments were performed with Sentence transformers, supervised machine learning algorithms, and Bert transformer architecture trained in English, and the results were observed. To prepare the dataset used in the experiments, the natural language processing and machine learning methodologies of the toxic and non-toxic contents in the labeled text data obtained from the Kaggle platform are explained, and then the methods and performances of the models trained using this dataset are summarized in this paper.
Soft robots offer new opportunities because of their compliant physical structure and their wide range of applications. Currently the development of such robots is hampered by their low controllability. One of the mai...
详细信息
ISBN:
(数字)9781665468589
ISBN:
(纸本)9781665468596;9781665468589
Soft robots offer new opportunities because of their compliant physical structure and their wide range of applications. Currently the development of such robots is hampered by their low controllability. One of the main constituents of soft robots are soft actuators. The aim of this project is to improve the control of a non-linear system, the soft actuator, and its interaction with the environment, by training a long short-term memory (LSTM) neural network to accurately predict the actuator's position in space, its curvature, and the force applied by its end-effector on an external object. The increased performance of the trained network resulted in an error as low as 0.01 +/- 0.005 N in estimating the force applied by the end effector on the external object. The results show significantly superior performance (on the order of 10 times) in the positional and curvature predictions of the LSTM network when using one marker per air-chamber.
Recently, numerous institutions have been developing Large Language models (LLMs). This model is ushering in revolutionary changes in various fields including society, economy, and education. The LLM in education is e...
详细信息
ISBN:
(纸本)9798350307573;9798350307566
Recently, numerous institutions have been developing Large Language models (LLMs). This model is ushering in revolutionary changes in various fields including society, economy, and education. The LLM in education is expanding its use, and this expansion includes providing personalized learning experiences. However, the LLM currently being developed is a general model, rather than a model specialized for a specific subject or textbook. This may have limitations in its use by teachers and learners. Therefore, in this study, the LLM development, an open-source model, is being fine-tuned using a specific dataset. Before proceeding, it is necessary to develop a specific dataset. Human-generated datasets are expensive and subject-specific, thereby having disadvantages. Therefore, in this study, we propose a method of developing a textbook dataset by applying the self-instruct technique. It is expected that a textbook-specific dataset can be developed at a low cost through this.
The Harris Hip Score is very important in the clinical assessment of rehabilitation outcomes following Total Hip Arthroplasty (THA). Harris scales often have issues such as subjectivity and ceiling effects, it is diff...
详细信息
As consumers become more concerned about environmental protection, automakers are accelerating the transition from traditional fuel vehicles to hybrid and fully electric vehicles. However, the relatively short driving...
ISBN:
(纸本)9798350358513;9798350358520
As consumers become more concerned about environmental protection, automakers are accelerating the transition from traditional fuel vehicles to hybrid and fully electric vehicles. However, the relatively short driving range of contemporary electric vehicles and the resulting anxiety of running out of power still significantly inhibits the consumers' willingness to purchase them. Accurately predicting vehicle energy consumption has become a key factor in effectively alleviating consumers' range anxiety. This paper explores the use of Machine Learning methods combined with vehicle bus signal data to predict the low-voltage auxiliary component's consumption. It discusses the transferability of Machine Learning models between different car lines and different configurations from the perspectives of supervised and unsupervised learning. Additionally, we present a set of procedures for testing energy consumption of on-board low-voltage equipment, along with a workflow for efficient model transfer assessment. The research findings are expected to substantially decrease the data collection burden on developers and engineers, while expediting project implementation. Consequently, this will result in a more accurate range prediction tackling the consumers' range anxiety.
Based on the shortcomings of existing metal surface damages identification methods such as multiple limitations, and requiring a lot of manpower and resources, this paper intends to introduce deep learning algorithms ...
详细信息
ISBN:
(纸本)9798350352634;9798350352627
Based on the shortcomings of existing metal surface damages identification methods such as multiple limitations, and requiring a lot of manpower and resources, this paper intends to introduce deep learning algorithms into the field of steel surface damage identification, in order to find an efficient, simple, low-threshold and high-precision identification method for surface damages. After comparing existing convolutional neural network models such as VGG16, Resnet50, InceptionV3, DenseNet121, we optimized Resnet50 from the aspects of batchsize and optimizers. Based on the best settings and drawing on the structure of residual blocks in Resnet50, a new convolutional neural network model "Cbam Resnet" specifically designed for surface damage problem recognition was constructed. Finally, Cbam Resnet achieved an accuracy of 99.43% on the test set.
Collaborative robotic configurations for monitoring and tracking human beings for safety and efficiency have attracted interest in industrial revolution. The fusion of different types of sensors embedded in collaborat...
详细信息
ISBN:
(纸本)9798350360882;9798350360899
Collaborative robotic configurations for monitoring and tracking human beings for safety and efficiency have attracted interest in industrial revolution. The fusion of different types of sensors embedded in collaborative robotic systems significantly improve robotic perception. However, current methods have not deeply explored the capabilities of multi-sensory configurations including visible and thermal sensors. In this paper, we propose a contactless multi-sensor fusion including visible and thermal dual camera for collaborative robots to improve the robotic perception for human safety. Remote photoplethysmography detection and infrared thermal camera were used to measure the heart rate and body temperature.
Robotic object packing holds a wide array of practical applications across logistics and manufacturing sectors. The online 3D Bin Packing Problem (BPP) is a challenging task that involves online packing of three-dimen...
详细信息
ISBN:
(纸本)9798350358513;9798350358520
Robotic object packing holds a wide array of practical applications across logistics and manufacturing sectors. The online 3D Bin Packing Problem (BPP) is a challenging task that involves online packing of three-dimensional boxes into a container while considering constraints and objectives. Unlike the offline version, where all boxes are known in advance, the online variant requires decisions about how to pack each box as it arrives, without prior knowledge of upcoming boxes. To maximize space utilization, our study explores two distinct strategies: conventional heuristics-based algorithms and a deep reinforcement learning (DRL)-based approach. For the heuristic strategy, we propose four heuristic criteria alongside two variants of multi-objective optimization (MOO) algorithms. Quantitative experiments reveal that MOO outperforms singleobjective approaches for online bin packing. In our DRL-based approach, we introduce a framework that leverages a candidate map that indicates the potentially feasible placements, ensuring a balanced exploration and exploitation in the considerable discrete action space. Experiments demonstrate the superior performance of our DRL-based approach compared to both DRL-based baseline methods and conventional approaches. Additionally, we discuss the limitations of DRL-based methods and offer practical recommendations for real-world applications.
Automated Machine Learning (AutoML) aims to make machine learning accessible to non-experts by minimizing technical barriers and enabling the creation of high-performance models without extensive programming knowledge...
详细信息
ISBN:
(纸本)9798350389814;9798350389807
Automated Machine Learning (AutoML) aims to make machine learning accessible to non-experts by minimizing technical barriers and enabling the creation of high-performance models without extensive programming knowledge. Despite advances, existing AutoML solutions often struggle with complexity and inefficiency in handling intricate tasks. Herein, we present LLM2AutoML, a zero-code AutoML framework that leverages Large Language models (LLMs) to generate high-performance actionable ML models without coding and with explanations. LLM2AutoML enables bidirectional human-machine alignment by interpreting user intentions expressed in natural language, converting them into executable AutoML tasks, and delivering detailed analytical reports that foster user understanding and trust. We propose the template-bounded AutoML method to ensure the LLM-generated code is highly executable, enabling fully end-to-end automation, including intention parsing, model auto-selection, and hyperparameter auto-tuning. Additionally, we incorporate techniques such as adaptive loss functions and configuration recommendations to improve efficiency and performance. Experiments on the SECOM dataset from the semiconductor manufacturing industry demonstrate that LLM2AutoML enhances the automation and usability of AutoML and LLMs, achieves superior performance, and produces high-quality analytical reports. This framework presents a novel approach to advancing the effectiveness and capabilities of both AutoML and LLMs.
暂无评论