Large language models have demonstrated impressive universal capabilities across a wide range of open-ended tasks and have extended their utility to encompass multi-modal conversations. However, existing methods encou...
详细信息
ISBN:
(数字)9798350353006
ISBN:
(纸本)9798350353013
Large language models have demonstrated impressive universal capabilities across a wide range of open-ended tasks and have extended their utility to encompass multi-modal conversations. However, existing methods encounter challenges in effectively handling both image and video understanding, particularly with limited visual tokens. In this work, we introduce Chat-UniVi, a Unified Vision-language model capable of comprehending and engaging in conver-sations involving images and videos through a unified visual representation. Specifically, we employ a set of dynamic visual tokens to uniformly represent images and videos. This representation framework empowers the model to ef-ficiently utilize a limited number of visual tokens to simul-taneously capture the spatial details necessary for images and the comprehensive temporal relationship required for videos. Moreover, we leverage a multi-scale representation, enabling the model to perceive both high-level seman-tic concepts and low-level visual details. Notably, Chat-UniVi is trained on a mixed dataset containing both images and videos, allowing direct application to tasks involving both mediums without requiring any modifications. Exten-sive experimental results demonstrate that Chat- UniVi con-sistently outperforms even existing methods exclusively de-signed for either images or videos. Code is available at https://***/PKu-Yuan Group/Chat-UniVi.
Early recognition of clinical deterioration (CD) has vital importance in patients' survival from exacerbation or death. Electronic health records (EHRs) data have been widely employed in Early Warning Scores (EWS)...
详细信息
This article presents a dataset of oil palm Fresh Fruit Bunches (FFBs) images from commercial plantations in Central Kalimantan, Indonesia, focusing on five maturity stages: Unripe, Underripe, Ripe, Flower, and Abnorm...
This article presents a dataset of oil palm Fresh Fruit Bunches (FFBs) images from commercial plantations in Central Kalimantan, Indonesia, focusing on five maturity stages: Unripe, Underripe, Ripe, Flower, and Abnormal. The data collection involved smartphone video recordings of unharvested trees from multiple angles under varying conditions. Video frames were extracted and expertly annotated using computer Vision Annotation Tool (CVAT), with annotations exported in Common Objects in Context (COCO) format suitable for object detection tasks. It has 10,207 images in its training set, 2,896 in the validation set, and 1,400 in the test set, which are supplemented using data augmentation to handle class imbalance and increase variation. These images have real-world complications arising from partial visibility, low contrast, occlusion, and blurriness. It forms the basis that will support the development of deep learning models for detection and classification of FFB, particularly for monitoring of harvest times, yield prediction, and optimization of resources in plantation operations.
With the sudden attack of the Corona Virus Disease 2019 (COVID-19), some 300 million children in the world have been forced to study at home because of the mass closure of schools that began on March 18, 2020. This ha...
详细信息
Pain is what anyone would experience, regardless of age or gender. Facial pain tracking technology is a successful tool since it is user-friendly with high precision. Auto pain monitoring benefits include that it will...
详细信息
Pain is what anyone would experience, regardless of age or gender. Facial pain tracking technology is a successful tool since it is user-friendly with high precision. Auto pain monitoring benefits include that it will support patients and care professionals, including physicians and nurses. This paper suggests 2D facial expression and movement for pain perception with data augmentation utilizing deep learning approaches. We used approximately 50,000 UNBC sequential photos in this study. Deep learning is applied to train data and activity approach to assist patient orientation. Our method can separate pain thresholds into three levels: painless, beginning to be painful, and painful. Our work is the standard method for detecting discomfort before heading to the hospital. It is easy, cost-effective, and readily grasped by the general public and healthcare professionals.
Future wireless communication systems will evolve toward multi-functional integrated systems to improve spectrum utilization and reduce equipment sizes. A joint radar and communication (JRC) system, which can support ...
详细信息
Classification is an important technique in data mining to create patterns and data modeling. Hepatitis is a disease that is dangerous to humans as this disease affects the human liver which is a vital organ. Early de...
详细信息
Cloud Computing is a technology widely used in academia and industry, providing varied services on demand. Blockchain technology was developed initially for the creation of a crypto-currency and nowadays is being expl...
详细信息
ISBN:
(数字)9781728174150
ISBN:
(纸本)9781728174167
Cloud Computing is a technology widely used in academia and industry, providing varied services on demand. Blockchain technology was developed initially for the creation of a crypto-currency and nowadays is being exploited for several other applications, such as health, agriculture, IoT and education. Some work initiatives are already taking place with the integration of these two technologies, either for research or for cloud service provision. This article aims to present a preliminary discussion on some aspects of integration between blockchain and cloud computing. Contributions of this paper include: (i) presentation of two integrated commercial cloud computing and blockchain environments; and (ii) some research opportunities on the use of both environments.
The ease of using transportation is one of the most critical things in the city with a significant population like Jakarta. The growth of the population in Jakarta is increased rapidly. The wage that Many transportati...
详细信息
Nowadays, there are so many people in Indonesia particularly young people who are interested in playing badminton, but at the end of the day, this game will make boring for young people particularly with the booming o...
详细信息
暂无评论