检索结果-内蒙古大学图书馆

Towards machine-learning-driven effective mashup recommendations from big data in mobile networks and the Internet-of-Things

引用

Digital Communications and Networks 2023年第1期9卷 138-145页

作者： Yueshen Xu Zhiying Wang Honghao Gao Zhiping Jiang Yuyu Yin Rui Li School of Computer Science and Technology Xidian UniversityXi'an710126China School of Computer Engineering and Science Shanghai UniversityShanghai200444China School of Computer Science and Technology Hangzhou Dianzi UniversityHangzhou310018China

A large number of Web APIs have been released as services in mobile communications,but the service provided by a single Web API is usually *** enrich the services in mobile communications,developers have combined Web APIs and developed a new service,which is known as a *** emergence of mashups greatly increases the number of services in mobile communications,especially in mobile networks and the Internet-of-Things(IoT),and has encouraged companies and individuals to develop even more mashups,which has led to the dramatic increase in the number of *** a trend brings with it big data,such as the massive text data from the mashups themselves and continually-generated usage ***,the question of how to determine the most suitable mashups from big data has become a challenging *** this paper,we propose a mashup recommendation framework from big data in mobile networks and the *** proposed framework is driven by machine learning techniques,including neural embedding,clustering,and matrix *** employ neural embedding to learn the distributed representation of mashups and propose to use cluster analysis to learn the relationship among the *** also develop a novel Joint Matrix Factorization(JMF)model to complete the mashup recommendation task,where we design a new objective function and an optimization *** then crawl through a real-world large mashup dataset and perform *** experimental results demonstrate that our framework achieves high accuracy in mashup recommendation and performs better than all compared baselines.

关键词： Mashup recommendation Big data Machine learning Mobile networks Internet-of-Things

来源：评论

学校读者我要写书评

暂无评论

A review of explainable AI in medical imaging: implications and applications

引用

International Journal of computers and Applications 2024年第11期46卷 983-997页

作者： Kinger, Shakti Kulkarni, Vrushali Department of Computer Engineering and Technology WPU School of Computer Science & Engineering Dr. Vishwanath Karad MIT World Peace University Maharashtra India

Deep learning approaches have attained remarkable success across various artificial intelligence applications, spanning healthcare, finance, and autonomous vehicles, profoundly impacting human existence. However, their black-box nature, lack of transparency, and inability to elucidate conclusions have hindered their adoption in high-risk applications. Explainable Artificial Intelligence (XAI) encompasses a suite of tools, approaches, and algorithms aimed at furnishing highly accurate explanations while preserving robust accuracy. This research paper investigates the intricate and evolving domain of Explainable Artificial Intelligence (XAI), particularly its implications in healthcare, with the objective of fostering trust in AI utilization within this domain. It encompasses a spectrum of topics, including a compendium of tasks XAI should fulfil in medical imaging, an examination of current methodologies for yielding transparent and understandable outcomes in medical imaging, criteria for assessing AI system explainability, and recommendations for integrating XAI into medical imaging practices. This review facilitates the selection of suitable and efficient XAI techniques for medical imaging while aiding developers in grasping the fundamentals of these methodologies. © 2024 Informa UK Limited, trading as Taylor & Francis Group.

关键词： Medical imaging

来源：评论

学校读者我要写书评

暂无评论

Automatic summarization of cooking videos using transfer learning and transformer-based models

引用

Discover Artificial Intelligence 2025年第1期5卷 1-20页

作者： Sadique, P. M. Alen Aswiga, R.V. School of Computer Science and Engineering Vellore Institute of Technology Tamil Nadu Chennai600127 India

The proliferation of cooking videos on the internet these days necessitates the conversion of these lengthy video contents into concise text recipes. Many online platforms now have a large number of cooking videos, in which, there is a challenge for viewers to extract comprehensive recipes from lengthy visual content. Effective summary is necessary in order to translate the abundance of culinary knowledge found in videos into text recipes that are easy to read and follow. This will make the cooking process easier for individuals who are searching for precise step by step cooking instructions. Such a system satisfies the needs of a broad spectrum of learners while also improving accessibility and user simplicity. As there is a growing need for easy-to-follow recipes made from cooking videos, researchers are looking on the process of automated summarization using advanced techniques. One such approach is presented in our work, which combines simple image-based models, audio processing, and GPT-based models to create a system that makes it easier to turn long culinary videos into in-depth recipe texts. A systematic workflow is adopted in order to achieve the objective. Initially, Focus is given for frame summary generation which employs a combination of two convolutional neural networks and a GPT-based model. A pre-trained CNN model called Inception-V3 is fine-tuned with food image dataset for dish recognition and another custom-made CNN is built with ingredient images for ingredient recognition. Then a GPT based model is used to combine the results produced by the two CNN models which will give us the frame summary in the desired format. Subsequently, Audio summary generation is tackled by performing Speech-to-text functionality in python. A GPT-based model is then used to generate a summary of the resulting textual representation of audio in our desired format. Finally, to refine the summaries obtained from visual and auditory content, Another GPT-based model is used

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Comparative analysis of twelve transfer learning models for the prediction and crack detection in concrete dams,based on borehole images

引用

Frontiers of Structural and Civil engineering 2024年第10期18卷 1507-1523页

作者： Umer Sadiq KHAN Muhammad ISHFAQUE Saif Ur Rehman KHAN Fang Xu Lerui CHEN Yi LEI School of Computer and Information Science Hubei Engineering UniversityXiaogan 432000China Institute for AI Industrial Technology Research Hubei Engineering UniversityXiaogan 432000China College of Water Conservancy and Hydropower Engineering Hohai UniversityNanjing 210098China School of Computer Science and Engineering Central South UniversityChangsha 410083China College of Aviation Zhongyuan University of TechnologyZhengzhou 451191China School of Civil Engineering Central South UniversityChangsha 410083China

Disaster-resilient dams require accurate crack detection,but machine learning methods cannot capture dam structural reaction temporal patterns and *** research uses deep learning,convolutional neural networks,and transfer learning to improve dam crack *** deep-learning models are trained on 192 crack *** research aims to provide up-to-date detecting techniques to solve dam crack *** finding shows that the EfficientNetB0 model performed better than others in classifying borehole concrete crack surface tiles and normal(undamaged)surface tiles with 91%*** study’s pre-trained designs help to identify and to determine the specific locations of cracks.

关键词： concrete dam borehole closed-circuit television deep learning models crack detection water resources management management

来源：评论

学校读者我要写书评

暂无评论

Robust style injection for person image synthesis

引用

CAAI Transactions on Intelligence Technology 2025年第2期10卷 402-414页

作者： Yan Huang Jianjun Qian Shumin Zhu Jun Li Jian Yang School of Computer Science and Engineering Nanjing University of Science and TechnologyNanjingChina School of Fashion and Textiles The Hong Kong Polytechnic UniversityHong KongChina

Person Image Synthesis has been widely used in fashion with extensive application *** point of this task is how to synthesise person image from a single source image under arbitrary *** methods generate the person image with target pose well;however,they fail to preserve the fine style details of the source *** address this problem,a robust style injection(RSI)model is proposed,which is a coarse-to-fine framework to synthesise target the person *** develops a simple and efficient cross-attention based module to fuse the features of both source semantic styles and target pose for achieving the coarse aligned *** adaptive instance normalisation is employed to enhance the aligned features in conjunction with source semantic ***,source semantic styles are further injected into the positional normalisation scheme to avoid the fine style details erosion caused by massive *** training losses,optimal transport theory in the form of energy distance is introduced to constrain data distribution to refine the texture style ***,the authors’model is capable of editing the shape and texture of garments to the target style *** experiments demonstrate that the authors’RSI achieves better performance over the state-of-art methods.

关键词： computer vision image reconstruction virtual try‐on

来源：评论

学校读者我要写书评

暂无评论

Advancing differential diagnosis: a comprehensive review of deep learning approaches for differentiating tuberculosis, pneumonia, and COVID-19

引用

Multimedia Tools and Applications 2025年第13期84卷 11871-11906页

作者： Kansal, Kajal Chandra, Tej Bahadur Singh, Akansha School of Computer Science Engineering and Technology Bennett University Uttar Pradesh Greater Noida India

In the realm of medical diagnostics, particularly in differential diagnosis, where differentiating between illnesses or ailments with comparable symptoms is essential, deep learning has gained importance. Recent developments in deep learning have demonstrated considerable promise for revolutionizing medical diagnostics by using the ability of artificial intelligence (AI) to accurately interpret radiological images. We examine the most cutting-edge deep learning techniques currently being utilized for the differential diagnosis of tuberculosis, pneumonia, and COVID-19 in this in-depth review. The study presents an in-depth critical review of several SOTA (state-of-the-art) studies used for differential diagnosis of different respiratory abnormalities like TB, Pneumonia, and COVID-19. In addition, an overview of various approaches, datasets employed in each method, various diagnosis tests, used assessment measures, and obtained performance is summarized and comprehensively compared to assist future research. We suggest a pathway for future research and development of deep learning solutions for differential diagnosis by critically analyzing the current literature and outlining the limitations and potential in this sector. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： COVID-19

来源：评论

学校读者我要写书评

暂无评论

Transformer-Based Person Re-Identification: A Comprehensive Review

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年第7期9卷 1-19页

作者： Sarker, Prodip Kumar Zhao, Qingjie Uddin, Md. Kamal School of Computer Science and Technology Beijing Institute of Technology China Department of Computer Science and Telecommunication Engineering Noakhali Science and Technology University Bangladesh

In the evolving landscape of surveillance and security applications, the task of person re-identification(re-ID) has significant importance, but also presents notable difficulties. This task entails the process of accurately matching and identifying persons across several camera views that do not overlap with one another. This is of utmost importance to video surveillance, public safety, and person-tracking applications. However, vision-related difficulties, such as variations in appearance, occlusions, viewpoint changes, cloth changes, scalability, limited robustness to environmental factors, and lack of generalizations, still hinder the development of reliable person re-ID methods. There are few approaches have been developed based on these difficulties relied on traditional deep-learning techniques. Nevertheless, recent advancements of transformer-based methods, have gained widespread adoption in various domains owing to their unique architectural properties. Recently, few transformer-based person re-ID methods have developed based on these difficulties and achieved good results. To develop reliable solutions for person re-ID, a comprehensive analysis of transformer-based methods is necessary. However, there are few studies that consider transformer-based techniques for further investigation. This review proposes recent literature on transformer-based approaches, examining their effectiveness, advantages, and potential challenges. This review is the first of its kind to provide insights into the revolutionary transformer-based methodologies used to tackle many obstacles in person re-ID, providing a forward-thinking outlook on current research and potentially guiding the creation of viable applications in real-world scenarios. The main objective is to provide a useful resource for academics and practitioners engaged in person re-ID. IEEE

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

Brain tumor segmentation and classification using transfer learning based CNN model with model agnostic concept interpretation

引用

Multimedia Tools and Applications 2025年第5期84卷 2509-2538页

作者： Nancy, A. Maria Maheswari, R. School of Computer Science and Engineering Vellore Institute of Technology Tamil Nadu Chennai632014 India

In recent decades, brain tumors have been regarded as a severe illness that causes significant damage to the health of the individual, and finally it results to death. Hence, the Brain Tumor Segmentation and Classification (BTSC) has gained more attention among researcher communities. BTSC is the process of finding brain tumor tissues and classifying the tissues based on the tumor types. Manual tumor segmentation from is prone to error and a time-consuming task. A precise and fast BTSC model is developed in this manuscript based on a transfer learning-based Convolutional Neural Networks (CNN) model. The utilization of a variant of CNN is because of its superiority in distinct tasks. In the initial phase, the Magnetic Resonance Imaging (MRI) brain images are acquired from the Brain Tumor Image Segmentation Challenge (BRATS) 2019, 2020 and 2021 databases. Then the image augmentation is performed on the gathered images by using zoom-in, rotation, zoom-out, flipping, scaling, and shifting methods that effectively reduce overfitting issues in the classification model. The augmented images are segmented using the layers of the Visual-Geometry-Group (VGG-19) model. Then feature extraction using An Attribute Aware Attention (AWA) methodology is carried out on the segmented images following the segmentation block in the VGG-19 model. The crucial features are then selected using the attribute category reciprocal attention phase. These features are inputted to the Model Agnostic Concept Extractor (MACE) to generate the relevance score between the features for assisting in the final classification process. The obtained relevance scores from the MACE are provided to the max-pooling layer of the VGG-19 model. Then, the final classified output is obtained from the modified VGG-19 architecture. The implemented Relevance score with the AWA-based VGG-19 model is used to classify the tumor as the whole tumor, enhanced tumor, and tumor core. In the classification section, the proposed

关键词： Magnetic resonance imaging

来源：评论

学校读者我要写书评

暂无评论

A vision-based hybrid ensemble learning approach for classification of gait disorders

引用

Multimedia Tools and Applications 2025年第17期84卷 17597-17644页

作者： Kour, Navleen Gupta, Sunanda Arora, Sakshi School of Computer Science and Engineering Shri Mata Vaishno Devi University Katra182320 India

computer vision-based (VB) gait analysis has become the popular platform for detecting Knee Osteoarthritis (KOA) and Parkinson’s disease (PD). The scrutinization of the literature revealed the heavy usage of sensor and markerless platforms but involved certain issues such as exposure to harmful radiations, wearing discomfort, a requirement of background, etc. Further, some aspects are lacking in the previous studies including the exploration of the marker-based (MB) approach, experimentation on disease severity levels using enhanced learning techniques, comparison of abnormal and normal (NM) gait, etc. Therefore, this research aims to predict the pathological and NM gait based on the marker-based (MB) VB platform. In this paper, first, a VB gait dataset is used namely "KOA-PD-NM" which includes three stages: KOA i.e. Early (EL), Moderate (MD), Severe (SV);PD i.e. Mild (ML), MD, SV, and NM subjects, thus, forming a total of seven labels. Then, an improved technique namely Color Segmentation based Fractional Order Darwinian Particle Swarm Optimization (CS-FODPSO) is employed to segment the region of interest (ROI). Next, a hybrid ensemble using k-nearest neighbor (KNN), Decision tree (DT), and Naive Bayes (NB) is proposed to predict the gait patterns of the considered groups. The efficiency of the proposed methodology is evaluated based on performance metrics. The evaluation results achieved provided the highest results using the presented segmentation and hybrid ensemble approaches within less time in comparison to other techniques as well as state-of-the-art. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Gait analysis

来源：评论

学校读者我要写书评

暂无评论

Message from Guest Editors of the CVM 2024 Special Issue

引用

Computational Visual Media 2024年第4期10卷 611-612页

作者： Andrei Sharf Fang-Lue Zhang Department of Computer Science Ben-Gurion University Beer-Sheva 84105 Israel School of Engineering and Computer Science Victoria University of Wellington Wellington6012 NewZealand

The Computational Visual Media(CVM)conference series is intended to provide a prominent international forum for exchanging innovative research ideas and significant computational methodologies that either underpin or ... 详细信息

关键词： CVM Guest Media

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：