检索结果-内蒙古大学图书馆

Journal of Artificial Intelligence and Consciousness 2024年第1期11卷 1-16页

作者： Tait, Izak Bensemann, Joshua Wang, Ziqi Computer Science and Software Engineering Department Auckland University of Technology NAOInstitute The University of Auckland

GPT-4 (Generative Pre-Trained Transformer 4) is often heralded as a leading commercial AI offering, sparking debates over its potential as a steppingstone toward Artificial General Intelligence. But does it possess consciousness? This paper investigates this key question using the nine qualitative measurements of the Building Blocks theory. GPT-4's design, architecture, and implementation are compared to each of the building blocks of consciousness to determine whether it has achieved the requisite milestones to be classified as conscious or, if not, how close to consciousness GPT-4 is. Our assessment is that, while GPT-4 in its native configuration is not currently conscious, current technological research and development is sufficient to modify GPT-4 to have all the building blocks of consciousness. Consequently, we argue that the emergence of a conscious AI model is plausible in the near term. The paper concludes with a comprehensive discussion of the ethical implications and societal ramifications of engineering conscious AI entities. © 2024 World Scientific Publishing Company.

关键词： Artificial Intelligence Consciousness Ethics GPT-4 Philosophy of Mind Sentience

来源：评论

学校读者我要写书评

暂无评论

GazeREC-Net: Advancing Gaze Restoration in Low-Light Conditions

IAENG International Journal of Computer Science

引用

IAENG International Journal of computer Science 2024年第12期51卷 2034-2042页

作者： Ku, Jiayin Wang, Li School of Computer Science and software Engineering University of Science and Technology Liaoning Anshan114051 China College of Computer Science and Tech nology Liaoning Anshan114051 China

Gaze estimation technology is essential for applications such as human-computer interaction, augmented reality, and virtual reality. However, its accuracy is significantly compromised in low-light conditions due to degraded image quality. To address this, we developed GazeREC-Net, an innovative gaze restoration method. We simulated low-light conditions on the MPIIFaceGaze and ColumbiaGaze datasets, creating a specialized degraded dataset for training and testing our model. GazeREC-Net combines Fourier transform techniques with advanced image restoration algorithms, significantly enhancing image quality and optimizing gaze information recovery and extraction in low-light environments. Our evaluations using Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index (SSIM) metrics demonstrated a PSNR improvement of 7.64% and an SSIM improvement of 5.75% compared to HINet. Additionally, GazeREC-Net outperformed CFTNet and other existing gaze estimation models, including Gaze-TR, Dilated-Net, CA-Net, L2CS-Net, and MTGLS, in reducing gaze estimation error. These findings validate the effectiveness of GazeREC-Net in low-light conditions and offer new research directions for applying gaze estimation technologies in complex lighting environments. © (2024), (International Association of Engineers). All rights reserved.

关键词： Augmented reality

来源：评论

学校读者我要写书评

暂无评论

Multi-Task Visual Semantic Embedding Network for Image-Text Retrieval

引用

Journal of computer Science & technology 2024年第4期39卷 811-826页

作者： Xue-Yang Qin Li-Shuang Li Jing-Yao Tang Fei Hao Mei-Ling Ge Guang-Yao Pang School of Computer Science and Technology Dalian University of TechnologyDalian 116024China School of Computer Science Shaanxi Normal UniversityXi’an 710119China School of Computer Engineering Weifang UniversityWeifang 261061China Guangxi Colleges and Universities Key Laboratory of Intelligent Industry Software Wuzhou UniversityWuzhou 543002 China

Image-text retrieval aims to capture the semantic correspondence between images and texts,which serves as a foundation and crucial component in multi-modal recommendations,search systems,and online *** mainstream methods primarily focus on modeling the association of image-text pairs while neglecting the advantageous impact of multi-task learning on image-text *** this end,a multi-task visual semantic embedding network(MVSEN)is proposed for image-text ***,we design two auxiliary tasks,including text-text matching and multi-label classification,for semantic constraints to improve the generalization and robustness of visual semantic embedding from a training ***,we present an intra-and inter-modality interaction scheme to learn discriminative visual and textual feature representations by facilitating information flow within and between ***,we utilize multi-layer graph convolutional networks in a cascading manner to infer the correlation of image-text *** results show that MVSEN outperforms state-of-the-art methods on two publicly available datasets,Flickr30K and MSCOCO,with rSum improvements of 8.2%and 3.0%,respectively.

关键词： image-text retrieval cross-modal retrieval multi-task learning graph convolutional network

来源：评论

学校读者我要写书评

暂无评论

Robust steganographic approach using generative adversarial network and compressive autoencoder

引用

Multimedia Tools and Applications 2024年 1-38页

作者： Qasaimeh, Malik Qtaish, Alaa Abu Aljawarneh, Shadi Department of Computer Information Systems Faculty of Computer and Information Technology Jordan University of Science and Technology Irbid Jordan Department of Software Engineering co-joint with the Department of Cyber Security Faculty of Computer and Information Technology Jordan University of Science and Technology Irbid Jordan

Nowadays, social media applications and websites have become a crucial part of people’s lives;for sharing their moments, contacting their families and friends, or even for their jobs. However, the fact that these valuable data are transferred via the Internet and open channels, which are vulnerable to attacks and espionage, requires using defense methods to improve the security of data transmission. Recently, the progress of Deep Learning (DL) inspired researchers to use it with security methods such as steganography, the art of hiding secret data in unrelated content. DL and steganography combined enhanced the hiding properties, especially in the Coverless Image Steganography (CIS) methods. In this paper, we propose a Compressive Coverless Image Steganography (CCIS) model, which is a generated-based CIS, to improve the capacity and robustness of Steganography. This model uses the compressive autoencoder to compress the message, thus increasing the capacity, and uses the Generative Adversarial Network (GAN) to generate a stego image from the compressed vector, thus enhancing concealment ability, besides using a regression model and optimization method to extract the data. Another version was designed to enable sending binary messages by replacing the compressive autoencoder network with a robust mapping rule. Experiments show that the proposed model could improve the characteristics of Steganography. Furthermore, the proposed model could extract binary messages with 256 bits from attacked stego images with 99% recovery accuracy. Thus, capacity and robustness were enhanced using our model with both images and text as secret messages. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Review of cervical cell segmentation

引用

Multimedia Tools and Applications 2024年 1-40页

作者： Huang, Qian Zhang, Wei Chen, Yulin Chen, Junzhou Yang, Zheng College of Computer Science and Software Engineering Hohai University Nanjing China Nanjing Huiying Electronic Technology Corporation Nanjing China

Cervical cell segmentation is a significant task in medical image analysis and can be used for screening various cervical diseases. In recent years, substantial progress has been made in cervical cell segmentation techniques, leading to notable improvements in the performance of cervical cancer auxiliary diagnostic systems. This review summarizes and analyzes the recent research on cervical cell segmentation. The main contents include an introduction to cervical cell segmentation datasets, commonly used evaluation metrics, and various segmentation methods. Currently, mainstream segmentation methods can be classified into two categories: traditional and deep learning-based. Building upon this foundation, we unfold according to the context of segmentation objectives, evaluating the performance of each method in achieving specific segmentation objectives and exploring the relationships among different methods. Through this review, other researchers can clearly understand the development of cervical cell segmentation technology and future trends, and explore new methods and technologies based on integrating and sorting out existing technologies, so as to help cervical cancer auxiliary diagnosis systems achieve more accurate cell image segmentation. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

关键词： Diagnosis

来源：评论

学校读者我要写书评

暂无评论

Conditional Semantic Textual Similarity via Conditional Contrastive Learning 31

Conditional Semantic Textual Similarity via Conditional Cont...

引用

31st International Conference on Computational Linguistics, COLING 2025

作者： Liu, Xinyue Qin, Zeyang Wang, Zeyu Liang, Wenxin Zong, Linlin Xu, Bo School of Software Dalian University of Technology China School of Computer Science and Technology Dalian University of Technology China

ISBN: (纸本)9798891761964

Conditional semantic textual similarity (C-STS) assesses the similarity between pairs of sentence representations under different conditions. The current method encounters the overestimation issue of positive and negative samples. Specifically, the similarity within positive samples is excessively high, while that within negative samples is excessively low. In this paper, we focus on the C-STS task and develop a conditional contrastive learning framework that constructs positive and negative samples from two perspectives, achieving the following primary objectives: (1) adaptive selection of the optimization direction for positive and negative samples to solve the over-estimation problem, (2) fully balance of the effects of hard and false negative samples. We validate the proposed method with five models based on bi-encoder and tri-encoder architectures, the results show that our proposed method achieves state-ofthe-art performance. The code is available at https://***/qinzeyang0919/CCL. © 2025 Association for Computational Linguistics.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Generative adversarial networks based motion learning towards robotic calligraphy synthesis

引用

CAAI Transactions on Intelligence technology 2024年第2期9卷 452-466页

作者： Xiaoming Wang Yilong Yang Weiru Wang Yuanhua Zhou Yongfeng Yin Zhiguo Gong Department of Computer and Information Science University of MacaoMacaoChina School of Software Beihang UniversityBeijingChina Department of Computer Science and Technology Faculty of Information TechnologyBeijing University of TechnologyBeijingChina School of Foreign Languages Guangzhou Huashang CollegeGuangzhouChina

Robot calligraphy visually reflects the motion capability of robotic *** traditional researches mainly focus on image generation and the writing of simple calligraphic strokes or characters,this article presents a generative adversarial network(GAN)-based motion learning method for robotic calligraphy synthesis(Gan2CS)that can enhance the efficiency in writing complex calligraphy words and reproducing classic calligraphy *** key technologies in the proposed approach include:(1)adopting the GAN to learn the motion parameters from the robot writing operation;(2)converting the learnt motion data into the style font and realising the transition from static calligraphy images to dynamic writing demonstration;(3)reproducing high-precision calligraphy works by synthesising the writing motion data *** this study,the motion trajectories of sample calligraphy images are firstly extracted and converted into the robot *** robot performs the writing with motion planning,and the writing motion parameters of calligraphy strokes are learnt with *** the motion data of basic strokes is synthesised based on the hierarchical process of‘stroke-radicalpart-character’.And the robot re-writes the synthesised characters whose similarity with the original calligraphy characters is *** calligraphy characters have been tested in the experiments for method validation and the results validated that the robot can actualise the robotic calligraphy synthesis of writing motion data with GAN.

关键词： calligraphy synthesis generative adversarial networks Motion learning robot writing

来源：评论

学校读者我要写书评

暂无评论

LogCSS: Log anomaly detection based on BERT-CNN with context-semantics-statistics features

引用

Journal of Intelligent and Fuzzy Systems 2024年第4期46卷 7659-7676页

作者： Li, Zhongliang Tu, Xuezhen Gao, Hong Huang, Shiyue Ma, Zongmin College of Computer Science & Technology Nanjing University of Aeronautics and Astronautics Nanjing China School of Computer Science Peking University Beijing China Collaborative Innovation Center of Novel Software Technology and Industrialization Nanjing China

With the development of artificial intelligence, deep-learning-based log anomaly detection proves to be an important research topic. In this paper, we propose LogCSS, a novel log anomaly detection framework based on the Context-Semantics-Statistics Convolutional Neural Network (CSSCNN). It is the first model that uses BERT (Bidirectional Encoder Representation from Transformers) and CNN (Convolutional Neural Network) to extract the semantic, temporal, and correlational features of the logs. We combine the features with the statistic information of log templates for the classification model to improve the accuracy. We also propose a technique, DOOT (Deals with the Out-Of-Templates), for online template matching. The experimental research shows that our framework improves the average F1 score of the six best algorithms in the industry by more than 5% on the open-source dataset HDFS, and improves the average F1 score of the six best algorithms in the industry by more than 8% on the BGL dataset, LogCSS also performs better than other similar methods on our own constructed dataset. © 2024 – IOS Press.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

HyperHatePrompt: A Hypergraph-based Prompting Fusion Model for Multimodal Hate Detection 31

HyperHatePrompt: A Hypergraph-based Prompting Fusion Model f...

引用

31st International Conference on Computational Linguistics, COLING 2025

作者： Xu, Bo Yu, Erchen Zhou, Jiahui Lin, Hongfei Zong, Linlin School of Computer Science and Technology Dalian University of Technology China School of Software Dalian University of Technology China

ISBN: (纸本)9798891761964

Multimodal hate detection aims to identify hate content across multiple modalities for promoting a harmonious online environment. Despite promising progress, three critical challenges, the absence of implicit hateful cues, the cross-modal-induced hate, and the diversity of hate target groups, inherent in the multimodal hate detection task, have been overlooked. To address these challenges, we propose a hypergraph-based prompting fusion model. Our model first uses tailored prompts to infer implicit hateful cues. It then introduces hyperedges to capture cross-modal-induced hate and applies a diversity-oriented hyperedge expansion strategy to account for different hate target groups. Finally, hypergraph convolution fuses diverse hateful cues, enhancing the exploration of cross-modal hate and targeting specific groups. Experimental results on two benchmark datasets show that our model achieves state-of-the-art performance in multimodal hate detection. © 2025 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Color Image Compression and Encryption Algorithm Based on 2D Compressed Sensing and Hyperchaotic System

引用

computers, Materials & Continua 2024年第2期78卷 1977-1993页

作者： Zhiqing Dong Zhao Zhang Hongyan Zhou Xuebo Chen School of Computer Science and Software Engineering University of Science and Technology LiaoningAnshan114051China School of Electronic and Information Engineering University of Science and Technology LiaoningAnshan114051China

With the advent of the information security era,it is necessary to guarantee the privacy,accuracy,and dependable transfer of *** study presents a new approach to the encryption and compression of color *** is predicated on 2D compressed sensing(CS)and the hyperchaotic ***,an optimized Arnold scrambling algorithm is applied to the initial color images to ensure strong ***,the processed images are con-currently encrypted and compressed using 2D *** them,chaotic sequences replace traditional random measurement matrices to increase the system’s ***,the processed images are re-encrypted using a combination of permutation and diffusion *** addition,the 2D projected gradient with an embedding decryption(2DPG-ED)algorithm is used to reconstruct *** with the traditional reconstruction algorithm,the 2DPG-ED algorithm can improve security and reduce computational ***,it has better *** experimental outcome and the performance analysis indicate that this algorithm can withstand malicious attacks and prove the method is effective.

关键词： Image encryption image compression hyperchaotic system compressed sensing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：