检索结果-内蒙古大学图书馆

Controllable multi-domain semantic artwork synthesis

computational Visual Media 2024年第2期10卷 355-373页

作者： Yuantian Huang Satoshi Iizuka Edgar Simo-Serra Kazuhiro Fukui Department of Computer Science University of TsukubaTsukuba 305-8577Japan Department of Computer Science and Engineering Waseda UniversityTokyo 169-8050Japan

We present a novel framework for the multidomain synthesis of artworks from semantic *** of the main limitations of this challenging task is the lack of publicly available segmentation datasets for art *** address this problem,we propose a dataset called ArtSem that contains 40,000 images of artwork from four different domains,with their corresponding semantic label *** first extracted semantic maps from landscape photography and used a conditional generative adversarial network(GAN)-based approach for generating high-quality artwork from semantic maps without requiring paired training ***,we propose an artwork-synthesis model using domain-dependent variational encoders for high-quality multi-domain ***,the model was improved and complemented with a simple but effective normalization method based on jointly normalizing semantics and style,which we call spatially style-adaptive normalization(SSTAN).Compared to the previous methods,which only take semantic layout as the input,our model jointly learns style and semantic information representation,improving the generation quality of artistic *** results indicate that our model learned to separate the domains in the latent ***,we can perform fine-grained control of the synthesized artwork by identifying hyperplanes that separate the different ***,by combining the proposed dataset and approach,we generated user-controllable artworks of higher quality than that of existing approaches,as corroborated by quantitative metrics and a user study.

关键词： semantic artwork synthesis generative adversarial network(GAN) datasets non-photorealistic rendering

来源：评论

学校读者我要写书评

暂无评论

DeepGAN: Utilizing generative adversarial networks for improved deep learning

引用

International Journal of Knowledge-Based and Intelligent engineering Systems 2024年第4期28卷 732-748页

作者： V, Edward Naveen A, Jenefa T.M, Thiyagu A, Lincy Taurshia, Antony Department of Computer Science and Engineering Sri Shakthi Institute of Engineering and Technology India Department of Computer Science and Engineering Karunya Institute of Technology and Sciences India Division of Computer Science and Engineering Karunya Institute of Technology and Sciences India Department of Computer Science and Engineering National Engineering College India

In the realm of deep learning, Generative Adversarial Networks (GANs) have emerged as a topic of significant interest for their potential to enhance model performance and enable effective data augmentation. This paper addresses the existing challenges in synthesizing high-quality data and harnessing the capabilities of GANs for improved deep learning outcomes. Unlike traditional approaches that heavily rely on manually engineered data augmentation techniques, our work introduces a novel framework that leverages DeepGANs to autonomously generate diverse and high-fidelity data. Our experiments encompass a diverse spectrum of datasets, including images, text, and time series data. In the context of image classification tasks, we conduct experiments on the widely recognized CIFAR-10 dataset, which consists of 50,000 image samples. Our results demonstrate the remarkable efficacy of DeepGANs in enhancing model performance across various data domains. Notably, in image classification using the CIFAR-10 dataset, our innovative approach achieves an impressive accuracy of 97.2%. This represents a substantial advancement beyond conventional CNN models, underscoring the profound impact of DeepGANs in the realm of deep learning. In summary, this research sheds light on DeepGANs as a fundamental component in the pursuit of enhanced deep learning performance. Our framework not only overcomes existing limitations but also heralds a new era of data augmentation, with generative adversarial networks leading the way. The attainment of an accuracy rate of 97.2% on CIFAR-10 serves as a compelling testament to the transformative potential of DeepGANs, solidifying their pivotal role in the future of deep learning. This promises the development of more robust, adaptive, and accurate models across a myriad of applications, marking a significant contribution to the field. © 2024 – IOS Press. All rights reserved.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Advancing mental health detection in texts via multi-task learning with soft-parameter sharing transformers

引用

Neural Computing and Applications 2025年第5期37卷 3077-3110页

作者： Kodati, Dheeraj Tene, Ramakrishnudu Department of Computer Science and Engineering National Institute of Technology Warangal India Department of Computer Science and Engineering Mahindra University Hyderabad India

In recent years, mental health issues have profoundly impacted individuals’ well-being, necessitating prompt identification and intervention. Existing approaches grapple with the complex nature of mental health, facing challenges like task interference, limited adaptability, and difficulty in capturing nuanced linguistic expressions indicative of various conditions. In response to these challenges, our research presents three novel models employing multi-task learning (MTL) to understand mental health behaviors comprehensively. These models encompass soft-parameter sharing-based long short-term memory with attention mechanism (SPS-LSTM-AM), SPS-based bidirectional gated neural networks with self-head attention mechanism (SPS-BiGRU-SAM), and SPS-based bidirectional neural network with multi-head attention mechanism (SPS-BNN-MHAM). Our models address diverse tasks, including detecting disorders such as bipolar disorder, insomnia, obsessive-compulsive disorder, and panic in psychiatric texts, alongside classifying suicide or non-suicide-related texts on social media as auxiliary tasks. Emotion detection in suicide notes, covering emotions of abuse, blame, and sorrow, serves as the main task. We observe significant performance enhancement in the primary task by incorporating auxiliary tasks. Advanced encoder-building techniques, including auto-regressive-based permutation and enhanced permutation language modeling, are recommended for effectively capturing mental health contexts’ subtleties, semantic nuances, and syntactic structures. We present the shared feature extractor called shared auto-regressive for language modeling (S-ARLM) to capture high-level representations that are useful across tasks. Additionally, we recommend soft-parameter sharing (SPS) subtypes-fully sharing, partial sharing, and independent layer-to minimize tight coupling and enhance adaptability. Our models exhibit outstanding performance across various datasets, achieving accuracies of 96.9%, 97.

关键词： Multi-task learning

来源：评论

学校读者我要写书评

暂无评论

An Adaptive Features Fusion Convolutional Neural Network for Multi-Class Agriculture Pest Detection

引用

computers, Materials & Continua 2025年第6期83卷 4429-4445页

作者： Muhammad Qasim Syed MAdnan Shah Qamas Gul Khan Safi Danish Mahmood Adeel Iqbal Ali Nauman Sung Won Kim Department of Computer Science University of Engineering and TechnologyTaxila47050Pakistan Department of Computer Science Shaheed Zulfikar Ali Bhutto Institute of Science and TechnologyIslamabad44000Pakistan School of Computer Science and Engineering Yeungnam UniversityGyeongsan-si38541Republic of Korea

Grains are the most important food consumed globally,yet their yield can be severely impacted by pest *** this issue,scientists and researchers strive to enhance the yield-to-seed ratio through effective pest detection *** approaches often rely on preprocessed datasets,but there is a growing need for solutions that utilize real-time images of pests in their natural *** study introduces a novel twostep approach to tackle this ***,raw images with complex backgrounds are *** the subsequent step,feature extraction is performed using both hand-crafted algorithms(Haralick,LBP,and Color Histogram)and modified deep-learning *** propose two models for this purpose:PestNet-EF and ***-EF uses an early fusion technique to integrate handcrafted and deep learning features,followed by adaptive feature selection methods such as CFS and Recursive Feature Elimination(RFE).PestNet-LF utilizes a late fusion technique,incorporating three additional layers(fully connected,softmax,and classification)to enhance *** models were evaluated across 15 classes of pests,including five classes each for rice,corn,and *** performance of our suggested algorithms was tested against the IP102 *** demonstrates that the Pestnet-EF model achieved an accuracy of 96%,and the PestNet-LF model with majority voting achieved the highest accuracy of 94%,while PestNet-LF with the average model attained an accuracy of 92%.Also,the proposed approach was compared with existing methods that rely on hand-crafted and transfer learning techniques,showcasing the effectiveness of our approach in real-time pest detection for improved agricultural yield.

关键词： Artificial neural network(ANN) support vector machine(SVM) deep neural network(DNN) transfer learning(TL)

来源：评论

学校读者我要写书评

暂无评论

The superalignment of superhuman intelligence with large language models

引用

science China(Information sciences) 2025年第6期68卷 101-111页

作者： Minlie HUANG Yingkang WANG Shiyao CUI Pei KE Jie TANG The CoAI Group Department of Computer Science and Technology Tsinghua University Laboratory of Intelligent Collaborative Computing University of Electronic Science and Technology of China Knowledge Engineering Group Department of Computer Science and Technology Tsinghua University

We have witnessed the emergence of superhuman intelligence thanks to the fast development of large language models(LLMs) and multimodal language models. As the application of such superhuman models becomes increasingly popular, a critical question arises: how can we ensure they still remain safe, reliable, and aligned well with human values encompassing moral values, Schwartz's Values, ethics, and many more? In this position paper, we discuss the concept of superalignment from a learning perspective to answer this question by outlining the learning paradigm shift from large-scale pretraining and supervised fine-tuning, to alignment training. We define superalignment as designing effective and efficient alignment algorithms to learn from noisy-labeled data(point-wise samples or pair-wise preference data) in a scalable way when the task is very complex for human experts to annotate and when the model is stronger than human experts. We highlight some key research problems in superalignment, namely, weak-to-strong generalization, scalable oversight, and evaluation. We then present a conceptual framework for superalignment, which comprises three modules: an attacker which generates the adversary queries trying to expose the weaknesses of a learner model, a learner which refines itself by learning from scalable feedbacks generated by a critic model with minimal human experts, and a critic which generates critics or explanations for a given query-response pair, with a target of improving the learner by criticizing. We discuss some important research problems in each component of this framework and highlight some interesting research ideas that are closely related to our proposed framework, for instance, self-alignment, self-play, self-refinement, and more. Last, we highlight some future research directions for superalignment, including the identification of new emergent risks and multi-dimensional alignment.

关键词： superalignment superhuman intelligence large language models scalable feedback weak-to-strong generalization

来源：评论

学校读者我要写书评

暂无评论

Hyperspectral image restoration using noise gradient and dual priors under mixed noise conditions

引用

CAAI Transactions on Intelligence Technology 2025年第1期10卷 72-93页

作者： Hazique Aetesam Suman Kumar Maji V.B.Surya Prasath Computer Science and Engineering Birla Institute of Technology MesraBiharIndia Computer Science and Engineering Indian Institute of Technology PatnaBiharIndia Department of Computer Science University of CincinnatiCincinnatiOhioUSA

Images obtained from hyperspectral sensors provide information about the target area that extends beyond the visible portions of the electromagnetic ***,due to sensor limitations and imperfections during the image acquisition and transmission phases,noise is introduced into the acquired image,which can have a negative impact on downstream analyses such as classification,target tracking,and spectral *** in hyperspectral images(HSI)is modelled as a combination from several sources,including Gaussian/impulse noise,stripes,and *** HSI restoration method for such a mixed noise model is ***,a joint optimisation framework is proposed for recovering hyperspectral data corrupted by mixed Gaussian-impulse noise by estimating both the clean data as well as the sparse/impulse noise ***,a hyper-Laplacian prior is used along both the spatial and spectral dimensions to express sparsity in clean image ***,to model the sparse nature of impulse noise,anℓ_(1)−norm over the impulse noise gradient is *** the proposed methodology employs two distinct priors,the authors refer to it as the hyperspectral dual prior(HySpDualP)*** the best of authors'knowledge,this joint optimisation framework is the first attempt in this *** handle the non-smooth and nonconvex nature of the generalℓ_(p)−norm-based regularisation term,a generalised shrinkage/thresholding(GST)solver is ***,an efficient split-Bregman approach is used to solve the resulting optimisation *** results on synthetic data and real HSI datacube obtained from hyperspectral sensors demonstrate that the authors’proposed model outperforms state-of-the-art methods,both visually and in terms of various image quality assessment metrics.

关键词： hyper-laplacian prior hyperspectral images image restoration mixed noise variational approach

来源：评论

学校读者我要写书评

暂无评论

Refinement modeling and verification of secure operating systems for communication in digital twins

引用

Digital Communications and Networks 2024年第2期10卷 304-314页

作者： Zhenjiang Qian Gaofei Sun Xiaoshuang Xing Gaurav Dhiman School of Computer Science and Engineering Changshu Institute of TechnologySuzhou215500China University Centre for Research and Development Department of Computer Science and EngineeringChandigarh UniversityMohali140413India Department of Computer Science and Engineering Graphic Era Deemed to be UniversityDehradun248002India

In traditional digital twin communication system testing,we can apply test cases as completely as possible in order to ensure the correctness of the system implementation,and even then,there is no guarantee that the digital twin communication system implementation is completely *** verification is currently recognized as a method to ensure the correctness of software system for communication in digital twins because it uses rigorous mathematical methods to verify the correctness of systems for communication in digital twins and can effectively help system designers determine whether the system is designed and implemented *** this paper,we use the interactive theorem proving tool Isabelle/HOL to construct the formal model of the X86 architecture,and to model the related assembly *** verification result shows that the system states obtained after the operations of relevant assembly instructions is consistent with the expected states,indicating that the system meets the design expectations.

关键词： Theorem proving Isabelle/HOL Formal verification System modeling Correctness verification

来源：评论

学校读者我要写书评

暂无评论

An Encoding-Decoding Framework Based on CNN for circ RNA-RBP Binding Sites Prediction

引用

Chinese Journal of Electronics 2024年第1期33卷 256-263页

作者： Yajing GUO Xiujuan LEI Yi PAN School of Computer Science Shaanxi Normal University Faculty of Computer Science and Control Engineering Shenzhen Institute of Advanced TechnologyChinese Academy of Sciences Department of Computer Science Georgia State University

Predicting RNA binding protein(RBP) binding sites on circular RNAs(circ RNAs) is a fundamental step to understand their interaction mechanism. Numerous computational methods are developed to solve this problem, but they cannot fully learn the features. Therefore, we propose circ-CNNED, a convolutional neural network(CNN)-based encoding and decoding framework. We first adopt two encoding methods to obtain two original matrices. We preprocess them using CNN before fusion. To capture the feature dependencies, we utilize temporal convolutional network(TCN) and CNN to construct encoding and decoding blocks, respectively. Then we introduce global expectation pooling to learn latent information and enhance the robustness of circ-CNNED. We perform circ-CNNED across 37 datasets to evaluate its effect. The comparison and ablation experiments demonstrate that our method is superior. In addition, motif enrichment analysis on four datasets helps us to explore the reason for performance improvement of circ-CNNED.

关键词： Circular RNAs (circRNAs) RNA binding proteins Convolutional neural network Temporal convolutional network Encoder-decoder network

来源：评论

学校读者我要写书评

暂无评论

Image-based rice leaf disease detection using CNN and generative adversarial network

引用

Neural Computing and Applications 2025年第1期37卷 439-456页

作者： Ramadan, Syed Taha Yeasin Islam, Md Shafiqul Sakib, Tanjim Sharmin, Nusrat Rahman, Md. Mokhlesur Rahman, Md. Mahbubur Department of Computer Science and Engineering Military Institute of Science and Technology Dhaka Bangladesh

Rice is a major crop and staple food for more than half of the world’s population and plays a vital role in ensuring food security as well as the global economy pests and diseases pose a threat to the production of rice and have a substantial impact on the yield and quality of the crop. In recent times, deep learning methods have gained prominence in predicting rice leaf diseases. Despite the increasing use of these methods, there are notable limitations in existing approaches. These include a scarcity of extensive and diverse collections of leaf disease images, lower accuracy rates, higher time complexity, and challenges in real-time leaf disease detection. To address the limitations, we explicitly investigate various data augmentation approaches using different generative adversarial networks (GANs) for rice leaf disease detection. Along with the GAN model, advanced CNN-based classifiers have been applied to classify the images with improving data augmentation. Our approach involves employing various GANs to generate high-quality synthetic images. This strategy aims to tackle the challenges posed by limited and imbalanced datasets in the identification of leaf diseases. The key benefit of incorporating GANs in leaf disease detection lies in their ability to create synthetic images, effectively augmenting the dataset’s size, enhancing diversity, and reducing the risk of overfitting. For dataset augmentation, we used three distinct GAN architectures—namely simple GAN, CycleGAN, and DCGAN. Our experiments demonstrated that models utilizing the GAN-augmented dataset generally outperformed those relying on the non-augmented dataset. Notably, the CycleGAN architecture exhibited the most favorable outcomes, with the MobileNet model achieving an accuracy of 98.54%. These findings underscore the significant potential of GAN models in improving the performance of detection models for rice leaf diseases, suggesting their promising role in the future research within this doma

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Explainable cost-sensitive deep neural networks for brain tumor detection from brain MRI images considering data imbalance

引用

Multimedia Tools and Applications 2025年 1-28页

作者： Shawon, Md Tanvir Rouf Shibli, G. M. Shahariar Ahmed, Farzad Joy, Sajib Kumar Saha Department of Computer Science and Engineering Ahsanullah University of Science and Technology Dhaka Bangladesh

This paper presents a research study on the use of Convolutional Neural Network (CNN), ResNet50, InceptionV3, EfficientNetB0 and NASNetMobile models to efficiently detect brain tumors in order to reduce the time required for manual review of the report and create an automated system for classifying brain tumors. An automated pipeline is proposed, which encompasses five models: CNN, ResNet50, InceptionV3, EfficientNetB0 and NASNetMobile. The performance of the proposed architecture is evaluated on a balanced dataset and found to yield an accuracy of 99.33% for fine-tuned InceptionV3 model. Furthermore, Explainable AI approaches are incorporated to visualize the model’s latent behavior in order to understand its black box behavior. To further optimize the training process, a cost-sensitive neural network approach has been proposed in order to work with imbalanced datasets which has achieved almost 4% more accuracy than the conventional models used in our experiments. The cost-sensitive InceptionV3 (CS-InceptionV3) and CNN (CS-CNN) show a promising accuracy of 92.31% and a recall value of 1.00 respectively on an imbalanced dataset. The proposed models have shown great potential in improving tumor detection accuracy and must be further developed for application in practical solutions. We have provided the datasets and made our implementations publicly available at -https://***/shahariar-shibli/Explainable-Cost-Sensitive-DeepNeural-Networks-for-Brain-Tumor-Detection-from-Brain-MRI-Images. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2025.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：