Deep learning models for autonomous driving, encompassing perception, planning, and control, depend on vast datasets to achieve their high performance. However, their generalization often suffers due to domain-specifi...
详细信息
ISBN:
(数字)9798331536626
ISBN:
(纸本)9798331536633
Deep learning models for autonomous driving, encompassing perception, planning, and control, depend on vast datasets to achieve their high performance. However, their generalization often suffers due to domain-specific data distributions, making an effective scene-based categorization of samples necessary to improve their reliability across diverse domains. Manual captioning, though valuable, is both laborintensive and time-consuming, creating a bottle-neck in the data annotation process. Large Visual Language Models (LVLMs) present a compelling solution byautomating image analysis and categorization through contextual queries, often without requiring retraining for new categories. In this study, we evaluate the capabilities of LVLMs, including GPT-4 and LLaVA, to understand and classify urban traffic scenes on both an inhouse dataset and the BDD100K. We propose a scalable captioning pipeline that integrates state-of-the-art models, enabling a flexible deployment on new datasets. Our analysis, combining quantitative metrics with qualitative insights, demonstrates the effectiveness of LVLMs to understand urban traffic scenarios and highlights their potential as an efficient tool for data-driven advancements in autonomous driving.
Oral cancer remains a critical global health challenge, characterized by high morbidity and mortality due to late-stage diagnosis. This paper addresses the need for improved diagnostic accuracy by introducing a novel ...
详细信息
Al2O3/SiO2 micro-stacked composite coatings with varying deposition sequences were fabricated on AISI 304 stainless steel to enhance their performance for applications exceeding 850 °C. Single coatings of Al2O3 a...
详细信息
Hot water drilling is a drilling method that employs high-temperature and high-pressure hot water jetting to achieve ice melting drilling. Characterized by rapid drilling speed and large hole diameter, it is widely us...
详细信息
Ti(C,N)-based cermets are composite materials that combine ceramic hardness with metallic toughness, making them ideal for cutting tools and wear-resistant applications. Microstructural refinement and secondary phases...
详细信息
Aircraft cabins experience translational accelerations along three axes and rotational accelerations around three axes during flight, leading to uncomfortable motion and vibrations. To mitigate these effects, this stu...
详细信息
Bolted joints are crucial in aerospace, machinery, and civil engineering, with failures severely affecting system reliability. Online monitoring of bolt torque is vital for ensuring structural safety. However, existin...
详细信息
Surface with well-defined components and structures possesses unique electronic,magnetic,optical and chemical *** a result,surface chemistry research plays a crucial role in various fields such as catalysis,energy,mat...
详细信息
Surface with well-defined components and structures possesses unique electronic,magnetic,optical and chemical *** a result,surface chemistry research plays a crucial role in various fields such as catalysis,energy,materials,quantum,and *** science mainly investigates the correspondence between surface property and *** probe microscopy(SPM)techniques are important tools to characterize surface properties because of the capability of atomic-scale imaging,spectroscopy and manipulation at the single-atom *** this review,we summarize recent advances in surface electronic,magnetic and optical properties characterized mainly by SPM-based *** focus on elucidating theπ-magnetism in graphene-based nanostructures,construction of spin qubits on surfaces,topology properties of surface organic structures,STM-based light emission,tip-enhanced Raman spectroscopy and integration of machine learning in SPM studies.
This article introduces a stationary wavelet transform-based positioning scheme that is grounded in an inline asymmetric Mach-Zehnder interferometer-based long-range distributed fiber optic disturbance sensing system....
详细信息
In the process of power scaling large-area quantum cascade lasers(QCLs),challenges such as degrada⁃ tion of beam quality and emission of multilobed far-field modes are frequently encountered. These issues become pa...
详细信息
暂无评论