In today’s digital age, technology has advanced to the point where it is difficult to distinguish be-tween genuine and forged media content. It was intended to be used for entertainment, but it is now being used to d...
详细信息
This study introduces a novel framework for video-based 3D human pose and shape estimation, termed Selective sampling and Temporal Positional Encoding (STPE). Our method leverages selective sampling and advanced posit...
详细信息
Recently, there has been a lot of research on the Text2Image generative model as social and technological interest in generative models has increased. In addition, the interest in Korean contents (K-Contnets) has incr...
详细信息
ISBN:
(纸本)9791188428137
Recently, there has been a lot of research on the Text2Image generative model as social and technological interest in generative models has increased. In addition, the interest in Korean contents (K-Contnets) has increased. In this paper, we propose "K-Contents Specialized Text2Image Pipeline". "K-Contents Specialized Text2Image Pipelilne" is specialized for understanding Korean prompts and aims to generate images specialized for various K-Contents. To build the pipeline, we collect and preprocess text and image data. Then, we train the Korean Text Encoder on the text data and train three specialized K-Contents generation pipelines on the image data. We also propose to use the SDXL model’s Img2Img Refiner with the SD model for time-efficient image generation. After that, we use the learned pipelines to generate real images and analyze their performance. Finally, we discuss the utilization of the above pipelines, limitations, and future research. Copyright 2025 Global IT research Institute (GIRI). All rights reserved.
Unintentional electromagnetic leakage is generated during the use of various components of computer equipment. These leaked electromagnetic signals contain a significant amount of useful information, which can be capt...
详细信息
In the field of construction, the application of Building information Modeling (BIM) technology combined with computer vision offers a promising avenue for enhancing the quality control and intelligent monitoring of v...
详细信息
Evaluation of marketing stimuli such as static advertisements, video advertisements, promotions, etc. is an important part of marketing research. Traditionally, the evaluation is done through large surveys, focus grou...
详细信息
The increasing complexity of VLSI physical design has elevated the need for advanced CAD tools capable of efficiently managing tasks such as floor planning, placement, routing, and optimization. This study presents a ...
详细信息
With the advancement of digital era comes an increased value for cybersecurity, and these need to be dealt with using advanced techniques. Deep learning - metaheuristic optimization hybrid model for advanced threat de...
详细信息
In recent years, societal changes have led to a growing prominence of pets in people's lives. However, uncontrolled pet reproduction in urban areas has given rise to a significant issue of stray animals, posing se...
详细信息
ISBN:
(数字)9781510686731
ISBN:
(纸本)9781510686724
In recent years, societal changes have led to a growing prominence of pets in people's lives. However, uncontrolled pet reproduction in urban areas has given rise to a significant issue of stray animals, posing serious threats to human health and the environment. Conventional manual methods for counting stray animals face challenges in terms of efficiency and the risk of disease transmission. With technological advancements, image recognition, and sound identification, among other techniques, have emerged as crucial tools to address this issue. Image recognition, leveraging intuitive statistics based on external features, combined with the low-power attributes of sound identification and the health assessment capabilities of thermal imaging, collectively provide comprehensive technological support for stray animal population statistics. In the realm of image algorithms, both traditional target detection algorithms and deep learning methods such as RCNN and Faster RCNN employ convolutional neural networks to accurately identify and locate stray animals. Regarding sound algorithms, traditional Gaussian mixture models and hidden Markov models, as well as deep learning techniques involving convolutional neural networks, have effectively enhanced the accuracy of stray animal sound recognition. The integration of image and audio in a hybrid method significantly enhances stray animal monitoring. Employing advanced techniques in video tracking and sound recognition, this approach offers an efficient and practical solution, crucial for wildlife ecosystem surveillance and conservation. research indicates that the application of deep learning methods in the domains of image and sound has significantly advanced compared to traditional approaches. In terms of image processing, I utilized the YOLO algorithm to perform grid division, feature extraction, and loss computation steps to achieve stray animal detection, demonstrating outstanding performance. Through the application of t
Measurable quantitative information is one of typical quantitative information in unstructured texts. It consists of entities related to numerics, units and their relationships. How to accurately and efficiently extra...
详细信息
ISBN:
(纸本)9789819770069;9789819770076
Measurable quantitative information is one of typical quantitative information in unstructured texts. It consists of entities related to numerics, units and their relationships. How to accurately and efficiently extract measurable quantitative information from unstructured texts remains a challenge. This paper aims to propose an Enhanced span-based joint model named ESert, which uses an n-gram encoder to identify word boundaries with a positive and negative sampling mechanism to extract measurable quantitative information. The experiments evaluate the ESert based on a clinical quantification information dataset containing 1359 Chinese electronic medical records. The results show that our model achieves F1 scores of 97.97% and 97.28% in measurable quantitative information recognition and association, respectively, indicating the effectiveness of the proposed model.
暂无评论