检索结果-内蒙古大学图书馆

IAENG International Journal of Computer Science 2024年第9期51卷 1385-1392页

作者： Yang, Zhen Zhou, Ziwei Wang, Chaoyang Xu, Liang School of Applied Technology University of Science and Technology Liaoning Anshan China School of Computer and Software Engineering University of Science and Technology Liaoning Anshan China School of Computer and Software Engineering University of Science and Technology Liaoning Anshan China

This paper introduces a new network model - the Image Guidance Encoder-Decoder Model (IG-ED), designed to enhance the efficiency of image captioning and improve predictive accuracy. IG-ED, a fusion of the convolutional network VGGNet-16 and the long short-term memory network (LSTM), is designed based on the encoder-decoder structure. The image captioning performance sees significant enhancements when leveraging the IG-ED network model. The network training process unfolds in a series of steps. Initially, the input image undergoes convolution via the VGGNet-16 network, producing a 512-dimensional vector. Concurrently, each word in the image's caption is encoded to generate a corresponding 512-dimensional vector consistent with the image feature dimension. These two vectors form the input for the decoding process. Subsequently, the vectors are fed into the redesigned fusion LSTM (F-LSTM) network at different time steps to gradually train the parameters of the IG-ED framework. The training process is completed by utilizing a loss function for determining convergence. Evaluation of the IG-ED model's performance is conducted using CIDEr and seven other evaluation metrics on the MSCOCO 2014 dataset. The results exhibit substantial improvements over the "Adaptive Attention Mode" network and "Neural Talk" network. Additionally, the parameter count of the IG-ED architecture is significantly reduced compared to the "Adaptive Attention Mode" network, leading to decreased computational resource requirements and enabling edge computing on the neural network. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Long short-term memory

来源：评论

学校读者我要写书评

暂无评论

Development of jute-polyethylene nonwoven fabric for sustainable packaging application

引用

Materials Research Innovations 2025年第2期29卷 85-92页

作者： Habib, Md. Ahasan Shahid, Md. Abdus Bhuiyan, Anamul Hoque Akter, Habiba Department of Textile Engineering BGMEA University of Fashion and Technology Dhaka Bangladesh Department of Textile Engineering Dhaka University of Engineering and Technology Gazipur Bangladesh

Environmental concerns promote demand for biodegradable packaging on a global scale. Jute fiber packaging could be a viable and sustainable alternative to pure synthetic materials. In this study, sustainable antimicrobial jute-polyethylene nonwoven fabric is developed by the heat press of jute web and polyethylene pellets. The performance of the developed samples was evaluated by analyzing their morphological, mechanical, thermal, moisture management, and antibacterial properties. SEM confirmed the homogeneous interfacial adhesion between jute-polyethylene. FTIR spectra proved the existence of jute, polyethylene, and peppermint oil in the developed samples. Mechanical property was investigated using a universal strength tester while tensile strength and elongation (%) were sufficient. The low thermal conductivities were observed in the samples. The moisture management tester confirmed the unavailability of the moisture in the inner surface from the outer surface. Furthermore, the samples exhibited significant antibacterial properties because of the application of peppermint essential oil. © 2024 Informa UK Limited, trading as Taylor & Francis Group.

关键词： Polyethylenes

来源：评论

学校读者我要写书评

暂无评论

Small Object Detection in Aerial Drone Imagery based on YOLOv8

IAENG International Journal of Computer Science

引用

IAENG International Journal of Computer Science 2024年第9期51卷 1346-1354页

作者： Pan, Junyu Zhang, Yujun School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

In recent years, the utilization of unmanned aerial vehicles (UAVs) for aerial target detection has gained significant attention due to their high-altitude perspective and maneuverability, which offer novel opportunities and tremendous potential in this field. However, detecting targets in UAV aerial images remains highly challenging due to the presence of numerous small targets with limited feature information, as well as issues like target occlusion and complex backgrounds that severely impact detection accuracy. To address these challenges, we propose a detection model called BDC-YOLOv8 that aims to enhance accuracy for small targets while minimizing computational complexity. Specifically, we augment the YOLOv8 architecture by incorporating a dedicated detection head tailored for small targets to improve performance when encountering such objects. Additionally, we restructure the neck network of the model to better extract and fuse feature information from targets with significant scale variations. Furthermore, we introduce the concept of DynamicHead to enhance the detection head by incorporating various attention mechanisms suitable for our task ahead of the original detection head, thereby enhancing the model’s capability to detect objects of different scales and complex backgrounds. Moreover, we introduce Convolutional Block Attention Module (CBAM) to identify regions of interest in densely populated areas. Extensive experiments conducted on the VisDrone2019 dataset yield promising results where our model achieves a mean Average Precision (mAP) score of 38% and an AP50 score of 59.6%. Compared to the original YOLOV8 model, improvements are observed with increases in mAP by 2.5% and AP50 by 3.7%, respectively. Notably, our model demonstrates a significant enhancement in detecting small targets with an increase in APs evaluation metric by 4.1%. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Aerial photography

来源：评论

学校读者我要写书评

暂无评论

Emotional Engagement with Haptic Feedback in Virtual Scenarios: A Literature Review

Emotional Engagement with Haptic Feedback in Virtual Scenari...

引用

2024 International Conference on engineering and Computing, ICECT 2024

作者： Qaisar, Taimoor Mumtaz, Mamoona Raza, Hamda Ali, Mir Mutabassim Khalid, Kainaat Bajwa, Ibrahim University of Management and Technology Department of Software Engineering Pakistan

ISBN: (纸本)9798350349719

Virtual reality (VR) is a simulated environment that computer technology generates. Haptic feedback uses touch sensations to enhance user interaction within a VR. Exploring emotional engagement in VR has witnessed a surge in interest, particularly concerning the integration of haptic feedback. This study systematically reviews 26 primary studies published from 2013 to 2023. We identify nineteen emotional engagement techniques and support leveraging the impact of emotional engagement in collaboration with haptics. Then, we develop a relationship based on emotional engagement and haptic feedback, emphasizing the significance of sensory integration and haptic elements in augmenting emotional connections in a VR environment. The investigation unveils various techniques, encompassing multi-sensory emotion recognition, immersive VR environments, interactive digital narratives, and haptic feedback strategies. Our findings suggest that the researchers need to explore emotional engagement in multiple virtual environments for synchronous user interaction to advance immersive technologies. © 2024 IEEE.

关键词： Virtual reality

来源：评论

学校读者我要写书评

暂无评论

Intelligent design of mechanical metamaterials: a GCNN-based structural genome database approach

引用

National Science Review 2025年第4期12卷 268-281页

作者： Wenyu Hao Zongliang Du Xiuquan Hou Yilin Guo Chang Liu Weisheng Zhang Huajian Gao Xu Guo State Key Laboratory of Structural Analysis Optimization and CAE Software for Industrial EquipmentDepartment of Engineering Mechanics Dalian University of Technology Ningbo Institute of Dalian University of Technology Institute of Artificial Intelligence and Robotics Xi'an Jiaotong University Mechano-X Institute Applied Mechanics LaboratoryDepartment of Engineering Mechanics Tsinghua University

The reciprocal mapping between the geometry and properties of a unit cell is crucial for the intelligent and inverse design of advanced materials and structural *** classical homogenization-based numerical methods,this paper presents an efficient and accurate mapping between the geometry and properties of a class of unit cells described by moving morphable components,achieved via a graph convolutional neural *** leads to a structural genome database(SGD) approach for the intelligent design of mechanical *** the SGD approach,metamaterials exhibiting the Hashin-Shtrikman upper bound of bulk modulus,auxetic behavior and the unimodal property have been created,with design efficiency improved by 3-4 orders of ***,transfer learning and a small amount of training data allow the SGD to predict non-local behaviors beyond a unit cell,such as optimized unit cells with critical buckling strength enhanced by nearly 200% and a bandgap metamaterial with a relative bandgap width of 51%.Experimentally validated optimized metamaterials demonstrate auxetic behavior and superior buckling *** proposed SGD approach holds promise for the advanced design of multi-scale and multi-physics systems.

关键词： mechanical metamaterial structural genome database moving morphable component graph convolutional neural network structure-property mapping

来源：评论

学校读者我要写书评

暂无评论

Brain magnetic resonance image (MRI) segmentation using multimodal optimization

引用

Multimedia Tools and Applications 2025年第16期84卷 16971-17020页

作者： Akan, Taymaz Oskouei, Amin Golzari Alp, Sait Bhuiyan, Mohammad Alfrad Nobel Department of Medicine Louisiana State University Health Sciences Center ShreveportLA71103 United States Software Engineering Department Istanbul Topkapi University Istanbul34020 Turkey Faculty of Information Technology and Computer Engineering Azarbaijan Shahid Madani University Tabriz Iran Department of Software Engineering İstinye University İstanbul Turkey Department of Artificial Intelligence Engineering Trabzon University Trabzon61335 Turkey

One of the highly focused areas in the medical science community is segmenting tumors from brain magnetic resonance imaging (MRI). The diagnosis of malignant tumors at an early stage is necessary to provide treatment for patients. The patient’s prognosis will improve if it is detected early. Medical experts use a manual method of segmentation when making a diagnosis of brain tumors. This study proposes a new approach to simplify and automate this process. In recent research, multi-level segmentation has been widely used in medical image analysis, and the effectiveness and precision of the segmentation method are directly tied to the number of segments used. However, choosing the appropriate number of segments is often left up to the user and is challenging for many segmentation algorithms. The proposed method is a modified version of the 3D Histogram-based segmentation method, which can automatically determine an appropriate number of segments. The general algorithm contains three main steps: The first step is to use a Gaussian filter to smooth the 3D RGB histogram of an image. This eliminates unreliable and non-dominating histogram peaks that are too close together. Next, a multimodal particle swarm optimization method identifies the histogram’s peaks. In the end, pixels are placed in the cluster that best fits their characteristics based on the non-Euclidean distance. The proposed algorithm has been applied to a Cancer Imaging Archive (TCIA) and brain MRI Images for brain Tumor detection dataset. The results of the proposed method are compared with those of three clustering methods: FCM, FCM_FWCW, and FCM_FW. In the comparative analysis of the three algorithms across various MRI slices. Our algorithm consistently demonstrates superior performance. It achieves the top mean rank in all three metrics, indicating its robustness and effectiveness in clustering. The proposed method is effective in experiments, proving its capacity to find the proper clusters. © The Auth

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

Improved Infrared Road Object Detection Algorithm Based on Attention Mechanism in YOLOv8

IAENG International Journal of Computer Science

引用

IAENG International Journal of Computer Science 2024年第6期51卷 673-680页

作者： Luo, Zilong Tian, Ying School of Computer Science and Software Engineering University of Science and Technology Liaoning Liaoning114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

In Currently, research in the field of infrared road object detection is primarily focused on enhancing model performance and robustness to address the challenges posed by complex real-world driving scenarios. In response to these challenges, this paper proposes an infrared road object detection algorithm based on an attention mechanism. By incorporating the CPCA module, which utilizes attention mechanisms, into the YOLOv8s model, the algorithm enhances the model’s focus on unobstructed areas and highly illuminated sections, extracting crucial feature information to improve both accuracy and robustness. Additionally, the original model’s downsampling layer is replaced with the Context Grided Network Block Downsampling (CGBD) module, which not only preserves feature edge information but also effectively handles local and contextual features, thereby enhancing the overall feature capturing capabilities of the model. To address the issue of equal aspect ratios in the model’s original loss function, the proposed algorithm adopts the superior Weighted Intersection over Union (WIoU). This not only addresses the shortcomings of the original loss function (CIoU) but also demonstrates increased sensitivity in classification tasks. Experimental results show that the improved algorithm, compared to YOLOv8s, achieves a 1.4% increase in mean average precision (mAP), along with notable improvements in precision and recall. Furthermore, when compared to mainstream model algorithms, the enhanced model significantly outperforms in infrared road object detection tasks, providing validation of its effectiveness. © (2024), (International Association of Engineers). All rights reserved.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Person Re-Identification Algorithm Based on Improved ResNet

引用

IAENG International Journal of Applied Mathematics 2024年第5期54卷 894-901页

作者： Shen, Wenrui Wang, Zhifeng School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

Person Re-Identification falls within the scope of computer vision, acting a technique to ascertain the presence of a specified pedestrian within a video or image library. The related research is of great significance in real-world environments such as criminal investigation and statistical analysis of commercial foot traffic and has received extensive attention from the academic community. However, traditional methods such as manual extraction cannot adapt to large-scale data volumes, and deep learning-based methods at this stage suffer from interference in complex environments such as similar costumes, perspective changes, and occlusion. Therefore, in this paper, we investigate the above problems. Firstly, we expand the dataset by introducing random erasure-based preprocessing of pedestrian images to enhancing the robustness and generalization capability of neural networks. Secondly, a composite attention mechanism is introduced after the network residual layer to enhance the spatial information capability and feature expression. Finally, the union loss composed of Circle Loss, Ternary Loss, and Cross Entropy Loss was chosen for network training in the loss optimization phase. Findings from the experiments reveal that the improved method proposed in this experiment achieves 96.0%Rank-1 and 88.3%mAP in Market1501, which reflects the validity of the approach proposed in this manuscript, and provides valuable reference suggestions for Person Re-Identification related research. © (2024), (International Association of Engineers). All Rights Reserved.

关键词： Image enhancement

来源：评论

学校读者我要写书评

暂无评论

Deep visual-linguistic fusion network considering cross-modal inconsistency for rumor detection

引用

Science China(Information Sciences) 2023年第12期66卷 16-32页

作者： Yang YANG Ran BAO Weili GUO De-Chuan ZHAN Yilong YIN Jian YANG School of Computer Science and Engineering Nanjing University of Science and Technology National Key Laboratory for Novel Software Technology Nanjing University School of Software Shandong University

With the development of the Internet, users can freely publish posts on various social media platforms, which offers great convenience for keeping abreast of the world. However, posts usually carry many rumors, which require plenty of manpower for monitoring. Owing to the success of modern machine learning techniques, especially deep learning models, we tried to detect rumors as a classification problem automatically. Early attempts have always focused on building classifiers relying on image or text information, i.e., single modality in posts. Thereafter, several multimodal detection approaches employ an early or late fusion operator for aggregating multiple source information. Nevertheless, they only take advantage of multimodal embeddings for fusion and ignore another important detection factor, i.e., the intermodal inconsistency between modalities. To solve this problem, we develop a novel deep visual-linguistic fusion network(DVLFN) considering cross-modal inconsistency, which detects rumors by comprehensively considering modal aggregation and contrast information. Specifically, the DVLFN first utilizes visual and textual deep encoders, i.e., Faster R-CNN and bidirectional encoder representations from transformers, to extract global and regional embeddings for image and text modalities. Then, it predicts posts' authenticity from two aspects:(1) intermodal inconsistency, which employs the Wasserstein distance to efficiently measure the similarity between regional embeddings of different modalities, and(2) modal aggregation, which experimentally employs the early fusion to aggregate two modal embeddings for prediction. Consequently, the DVLFN can compose the final prediction based on the modal fusion and inconsistency measure. Experiments are conducted on three real-world multimedia rumor detection datasets collected from Reddit, Good News, and Weibo. The results validate the superior performance of the proposed DVLFN.

关键词： multimodal learning Wasserstein distance rumor detection

来源：评论

学校读者我要写书评

暂无评论

Object Detection Model for Remote Sensing Images Based on YOLOv9

IAENG International Journal of Computer Science

引用

IAENG International Journal of Computer Science 2025年第3期52卷 840-847页

作者： Hou, Donghao Zhang, Yujun School of Computer and Software Engineering University of Science and Technology Liaoning Anshan114051 China

In the field of object detection for remote sensing images, especially in applications such as environmental monitoring and urban planning, significant progress has been made. This paper addresses the common challenges faced by traditional object detection methods in remote sensing images, such as the large number of targets and complex backgrounds, by proposing a novel network based on YOLOv9. The network innovatively introduces the C3_CD_CGA module, an enhanced module based on Cascaded Group Attention, designed to reduce computational redundancy and increase attention diversity, and enhances the processing capability of multi-scale information through the CD module. The C3 module employs deep asymmetric convolution to mitigate information loss and increase the receptive field. Additionally, the network integrates DSConv with the RepNCSPELAN4 module to adaptively focus on and precisely capture the features of elongated and curved local structures, such as vehicles. The introduction of the CARAFE module further improves the spatial resolution of the feature maps, significantly enhancing performance across various visual tasks. Experimental results show that the improved YOLOv9 achieves a mean average precision (mAP) of 88% on the SIMD dataset, which is an improvement of 1.6% compared to the baseline YOLOv9 model and 1.5% higher than the state-of-the-art YOLO-SE model. This model not only achieves more effective multi-target recognition in complex backgrounds but also strikes a good balance between accuracy and efficiency. © (2025), (International Association of Engineers). All rights reserved.

关键词： Urban planning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：