检索结果-内蒙古大学图书馆

44th ACM/IEEE International Conference on software Engineering, ICSE 2024

作者： Du, Xueying Liu, Mingwei Wang, Kaixin Wang, Hanlin Liu, Junwei Chen, Yixuan Feng, Jiayi Sha, Chaofeng Peng, Xin Lou, Yiling School of Computer Science and Shanghai Key Laboratory of Data Science Fudan University China Fudan University Shanghai China

ISBN: (纸本)9798400702174

Recently, many large language models (LLMs) have been proposed, showing advanced proficiency in code generation. Meanwhile, many efforts have been dedicated to evaluating LLMs on code generation benchmarks such as HumanEval. Although being very helpful for comparing different LLMs, existing evaluation focuses on a sim-ple code generation scenario (i.e., function-level or statement-level code generation), which mainly asks LLMs to generate one single code unit (e.g., a function or a statement) for the given natural language description. Such evaluation focuses on generating independent and often small-scale code units, thus leaving it unclear how LLMs perform in real-world software development scenarios. To fill this knowledge gap, we make the first attempt to evaluate LLMs in a more challenging code generation scenario, i.e., class-level code generation. Compared with existing code generation benchmarks, it better reflects real-world software development scenarios due to it comprising broader contextual dependencies and multiple, interdependent units of code. We first manually construct the first class-level code generation benchmark ClassEval of 100 class-level Python code generation tasks with approximately 500 person-hours. Based on the new benchmark ClassEval, we then perform the first study of 11 state-of-the-art LLMs on class-level code generation. Based on our results, we find that all LLMs perform much worse on class-level code generation compared to the method-level. While GPT models still dominate other LLMs on class-level code generation, the performance rankings of other models on method-level code generation no longer holds for class-level code generation. Besides, most models (except GPT models) perform better when generating the class method by method;and they have the limited ability of generating dependent code. Based on our findings, we call for software engineering (SE) researchers' expertise to build more LLM benchmarks based on practical and com

关键词： Function evaluation

来源：评论

学校读者我要写书评

暂无评论

AMSC: Adaptive Multi-channel Graph Convolutional Network-Enhanced Web Services Classification 23

AMSC: Adaptive Multi-channel Graph Convolutional Network-Enh...

引用

23rd IEEE International Conference on High Performance Computing and Communications, 7th IEEE International Conference on Data science and Systems, 19th IEEE International Conference on Smart City and 7th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Applications, HPCC-DSS-SmartCity-DependSys 2021

作者： Qing, Yueying Cao, Buqing Peng, Mi Zhang, Lulu Kang, Guosheng Liu, Jianxun Fletcher, Kenneth K. School of Computer Science and Engineering Hunan University of Science and Technology Hunan Key Laboratory of Service Computing and New Software Service Technology Xiangtan China University of Massachusetts Boston Department of Computer Science Boston United States

ISBN: (纸本)9781665494571

With the development of Service Oriented Architecture (SOA), the number of Web services on the Internet is also growing rapidly. Classifying Web services accurately and efficiently is helpful to improve the quality of services discovery and promote the efficiency of service composition. However, existing deep learning-based Web services classification methods, such as graph convolutional networks (GCNs), are incapable of adaptively learning the correlation between service topology structure and service node features concurrently, resulting in unsatisfactory classification performance. To address this problem, this paper proposes an adaptive multi-channel GCN-enhanced Web services classification method. In this method, we first extract specific and shared embedding, in the Web API node isomorphic network, from the node features, topology, and combination of Web service nodes. Then, an attention mechanism is used to learn the importance weight of each embedding. By doing this, we adaptively integrate these weights to ensure the consistency and difference of each learned embedding. Finally, experimental results on real datasets from Programmable Web show that compared with LINE, Node2vec, Deep-walk, GCN, and GAT, the method proposed in this paper has an average improvement in accuracy of 19.81%, 19.35%, 19.16%, 11.56%, and 7.75% respectively. © 2021 IEEE.

关键词： Correlation Adaptive systems Network topology Smart cities Quality of service Feature extraction Topology

来源：评论

学校读者我要写书评

暂无评论

Research on the Expert Database System of Characteristic Species in Wuling Mountain Area 21

Research on the Expert Database System of Characteristic Spe...

引用

21st International Computer Conference on Wavelet Active Media Technology and Information Processing, ICCWAMTIP 2024

作者： Yujun, Yang Yimei, Yang Wang, Zhou Liyun, Li Ling, Peng Jin, Xiang Huixia, Shu College of Computer and Artificial Intelligence Software College Huaihua University Huaihua418008 China Hunan Provincial Key Laboratory of Ecological Agriculture Intelligent Control Technology Huaihua418008 China School of Computer and Software Engineering Xihua University Chengdu610039 China

ISBN: (纸本)9798331519254

The Wuling Mountains Area(WMA) is one of the important ecological protection areas in central China, known for its rich biodiversity and unique ecological environment. To effectively protect the species resources in this area and strengthen ecological monitoring and research, this paper proposes and designs the 'Wuling Mountains Region Characteristic Species Expert Database System.' By integrating expert resources, species data, ecological environment information, and modern information technologies, an intelligent system has been developed that combines data management, expert support, species identification, and research analysis. This paper discusses in detail the system's requirements analysis, design concept, implementation technologies, and potential applications, aiming to provide technological support for biodiversity conservation and sustainable ecological development in the region. © 2024 IEEE.

关键词： Characteristic species Expert database system Intelligent system Species data Wuling Mountain Area

来源：评论

学校读者我要写书评

暂无评论

Sensor Technology for Car Park Management System: Nile University of Nigeria 2

Sensor Technology for Car Park Management System: Nile Unive...

引用

2nd International Conference on Multidisciplinary Engineering and Applied science, ICMEAS 2023

作者： Jibrin, Abdulateef Saleh Ibrahim, Umar Adam Dalhatu, Abubkar Ibrahim Nile University of Nigeria Software Engineering Abuja Nigeria Nile University of Nigeria Computer Science Abuja Nigeria

ISBN: (纸本)9798350358834

The escalating parking space problem at Nile University of Nigeria poses a growing threat due to rising enrollments. The absence of a structured parking management system compounds this issue. This project aims to develop a system to inform users about available parking slots, mitigating parking congestion, and providing valuable insights for carpark management improvements © 2023 IEEE.

关键词： Parks

来源：评论

学校读者我要写书评

暂无评论

A New Feature Fusion Method Based on Pre-Training Model for Sequence Labeling

A New Feature Fusion Method Based on Pre-Training Model for ...

引用

International Conference on Data Storage and Data Engineering (DSDE)

作者： Shichuan Yu Yan Yang School of computer science and technology Heilongjiang University Harbin China Heilongjiang Key Laboratory of Database and Parallel Computing Heilongjiang University Harbin China

To fuse vocabulary features into the pre-training model is the mainstream data feature processing method for sequence labelling tasks. In general, the feature fusion methods that have been proposed at present are direct fusion outside the pre-training model or fusion of lexical features using attention mechanism. However, the study found that this way of vocabulary enhancement does not conform to the word formation rules of modern Chinese. In the Chinese language, it is easy to fuse irrelevant or even incorrect lexical features into the sequence using the above feature processing methods, which is bad for the experimental results of the Chinese sequence labelling task. To solve these problems, we propose to use Cosine Similarity Adapter to process lexical features in Chinese sequence labelling tasks. CSBERT is a hybrid model using this structure based on BERT, which conforms to the word formation rules of modern Chinese to a certain extent. It can fuse the features of the word into the character or eliminate the features of the word in the character according to the cosine similarity between the character vector and a word vector. The experimental results show that CSBERT has better ability to label Chinese sequences than the benchmark model. CSBERT has achieved the best experimental results such as F1-Score on 7 open datasets and the best ability of multi-label classification, which proves that the model has good practical value.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Derivative-free Trust-region Method for Optimization on the Ellipsoid

A Derivative-free Trust-region Method for Optimization on th...

引用

2023 International Conference on Advances in Computer science and Engineering Technology, ACSE 2023

作者： Xie, Pengcheng State Key Laboratory of Scientific and Engineering Computing Institute of Computational Mathematics and Scientific/Engineering Computing Academy of Mathematics and Systems Science Chinese Academy of Sciences University of Chinese Academy of Sciences ZhongGuanCun East Road No. 55 Beijing China

Optimization methods play a crucial role in various fields and applications. In some optimization problems, the derivative information of the objective function is unavailable. Such black-box optimization problems need to be solved by derivative-free optimization methods. At the same time, optimization problems with ellipsoidal constraints are important and have widespread applications in various fields as well. Following the development of the late professor M. J. D. Powell's efficient derivative-free trust-region optimization methods, this paper considers solving derivative-free optimization problems on the ellipsoid. Our new optimization solver EC-NEWUOA for problems on the ellipsoid in ., n is designed based on Powell's derivative-free software NEWUOA for unconstrained optimization problems. The proposed techniques for our new method mainly include using the Courant penalty function, the augmented Lagrangian method, and the projection technique. Details about the method and theoretical analysis are included in this paper. We also compare our new method with other algorithms by solving test problems and then show the numerical advantages of our new method. © Published under licence by IOP Publishing Ltd.

关键词： Lagrange multipliers

来源：评论

学校读者我要写书评

暂无评论

HateTune: Tunisian Dialect Hate Speech Detection Dataset 8th

HateTune: Tunisian Dialect Hate Speech Detection Dataset

引用

8th International Conference on Arabic Language Processing

作者： Kharrat, Ons Mohamed, Fatma Alzahra Mtimet, Ikram Benamor, Nour Fourati, Chayma Mediterranean Inst Technol Software Engn Lac 2 Tunis Tunisia

ISBN: (纸本)9783031791635;9783031791642

In Tunisia, citizens use social media platforms as a space to exercise freedom of speech. However, unchecked and complete freedom of expression can fuel the spread of hateful speech, which is devastating not only for those targeted but also for our society. This alarming situation evokes the need for limiting the spread of hateful content by working on hate speech detection in "Derja", which is the tunisian dialect. Used as a means of communication in daily life and on social media platforms, this dialect is a mixture of many languages, including Arabic, French, and Amazighi, and it can be written using Arabic letters. Due to the complexity of this language, a significant lack of publicly available, large, and annotated datasets for hate speech detection in Tunisian dialect written in Arabic letters is noticeable, making "Tunisian Derja" an underrepresented dialect. In this paper, we introduce the largest publicly available dataset, which consists of more than 12k comments manually annotated as Hate, and Neutral. We also provide an in-depth explanation of the processes of data collection, annotation, and pre-processing. Moreover, we undertake a comprehensive evaluation of the dataset's efficacy through various machine learning models, including Support Vector Machines (SVM), Random Forest, and XGBoost.

关键词： Tunisian Dialect Underrepresented Arabic Letters Hate Speech

来源：评论

学校读者我要写书评

暂无评论

DKGV: A Dynamic Knowledge Graph Visualization Method Based on Force-Directed Layout 19

DKGV: A Dynamic Knowledge Graph Visualization Method Based o...

引用

19th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry, VRCAI 2024

作者： Zhou, Hong Huang, Sunjie Zheng, Yushan Liu, Jinyan Chen, Yang Chen, Xiangyuan Li, Jun College of Computer Science and Software Engineering Shenzhen University Shenzhen China Shenzhen Academy of Inspection and Quarantine Shenzhen China Shenzhen University Shenzhen China

ISBN: (纸本)9798400713484

The dynamic knowledge graph is a data structure that adds temporal information to the nodes and edges of a traditional knowledge graph. It describes the changing processes of entities and relationships over time, thereby enriching and updating the expression of knowledge. The dynamic knowledge graph possesses characteristics such as time and labels;it is not only a knowledge graph but also a type of dynamic graph containing heterogeneous information. To address the issue that traditional dynamic graph visualization methods do not account for heterogeneous information in the layout, leading to a relatively random distribution of node labels in the visualization results, this paper proposes DKGV, a dynamic knowledge graph visualization method based on a force-directed layout. DKGV utilizes an improved GCN to generate the initial layout of the dynamic knowledge graph data, performs static force-directed layout iterative processing, and conducts dynamic force-directed layout calculations on the dynamic knowledge graph. Finally, it employs a force-directed layout with boundaries to support three-dimensional temporal display after view transformation. Experimental results show that DKGV can maintain the stability of the node layout during the evolution of the dynamic knowledge graph while making the overall layout more regular to better display the relationships between entities. © 2024 ACM.

关键词： Knowledge graph

来源：评论

学校读者我要写书评

暂无评论

Detecting Misleading Headlines Through the Automatic Recognition of Contradiction in Spanish

引用

IEEE ACCESS 2023年 11卷 72007-72026页

作者： Sepulveda-Torres, Robiert Bonet-Jover, Alba Saquete, Estela Univ Alicante Dept Software & Comp Syst Alicante 03690 Spain

Misleading headlines are part of the disinformation problem. Headlines should give a concise summary of the news story helping the reader to decide whether to read the body text of the article, which is why headline accuracy is a crucial element of a news story. This work focuses on detecting misleading headlines through the automatic identification of contradiction between the headline and body text of a news item. When the contradiction is detected, the reader is alerted to the lack of precision or trustworthiness of the headline in relation to the body text. To facilitate the automatic detection of misleading headlines, a new Spanish dataset is created (ES_Headline_Contradiction) for the purpose of identifying contradictory information between a headline and its body text. This dataset annotates the semantic relationship between headlines and body text by categorising the relation between texts as compatible, contradictory and unrelated. Furthermore, another novel aspect of this dataset is that it distinguishes between different types of contradictions, thereby enabling a more fine-grain identification of them. The dataset was built via a novel semi-automatic methodology, which resulted in a more cost-efficient development process. The results of the experiments show that pre-trained language models can be fine-tuned with this dataset, producing very encouraging results for detecting incongruency or non-relation between headline and body text.

关键词： Annotation guideline contradiction detection dataset annotation deep learning techniques disinformation detection human language technologies and natural language processing

来源：评论

学校读者我要写书评

暂无评论

Heterogeneous Federated Knowledge Graph Embedding Learning and Unlearning

arXiv

引用

arXiv 2023年

作者： Zhu, Xiangrong Li, Guangyao Hu, Wei State Key Laboratory for Novel Software Technology Nanjing University China State Key Laboratory for Novel Software Technology National Institute of Healthcare Data Science Nanjing University China

Federated Learning (FL) recently emerges as a paradigm to train a global machine learning model across distributed clients without sharing raw data. Knowledge Graph (KG) embedding represents KGs in a continuous vector space, serving as the backbone of many knowledge-driven applications. As a promising combination, federated KG embedding can fully take advantage of knowledge learned from different clients while preserving the privacy of local data. However, realistic problems such as data heterogeneity and knowledge forgetting still remain to be concerned. In this paper, we propose FedLU, a novel FL framework for heterogeneous KG embedding learning and unlearning. To cope with the drift between local optimization and global convergence caused by data heterogeneity, we propose mutual knowledge distillation to transfer local knowledge to global, and absorb global knowledge back. Moreover, we present an unlearning method based on cognitive neuroscience, which combines retroactive interference and passive decay to erase specific knowledge from local clients and propagate to the global model by reusing knowledge distillation. We construct new datasets for assessing realistic performance of the state-of-the-arts. Extensive experiments show that FedLU achieves superior results in both link prediction and knowledge forgetting. © 2023, CC BY.

关键词： Distillation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：