检索结果-内蒙古大学图书馆

Overview of the Tenth Dialog system technology Challenge: DSTC10

IEEE/ACM Transactions on Audio Speech and Language Processing 2024年 32卷 765-778页

作者： Yoshino, Koichiro Chen, Yun-Nung Crook, Paul Kottur, Satwik Li, Jinchao Hedayatnia, Behnam Moon, Seungwhan Fei, Zhengcong Li, Zekang Zhang, Jinchao Feng, Yang Zhou, Jie Kim, Seokhwan Liu, Yang Jin, Di Papangelis, Alexandros Gopalakrishnan, Karthik Hakkani-Tur, Dilek Damavandi, Babak Geramifard, Alborz Hori, Chiori Shah, Ankit Zhang, Chen Li, Haizhou Sedoc, Joao D'haro, Luis F. Banchs, Rafael Rudnicky, Alexander Guardian Robot Project R-IH RIKEN 2-2-2 Hikaridai Seika Shoraku619-0288 Japan Information Science Nara Institute of Science and Technology Ikoma630-0101 Japan Computer Science and Information Engineering National Taiwan University Taipei10617 Taiwan Inc. Palo AltoCA95054 United States Alexa AI *** Inc. SunnyvaleCA94089 United States Meta Seattle RedmondWA98052 United States Institute of Computing Technology Chinese Academy of Sciences Beijing100190 China Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences Beijing100190 China Tencent AI Lab Beijing Beijing China Kexueyuan South Road Zhongguancun Beijing100190 China Beijing 100190 China Alexa AI *** Inc. SunnyvaleCA United States 1120 Enterprise way Sunnyvale94089 United States *** Inc. SeattleWA United States Menlo Park CA United States Audio and Speech Group Mitsubishi Electric Research Laboratories CambridgeMA02139-1955 United States Carnegie Mellon University Department of Language and Information Technologies or just Carnegie Mellon University Pittsburgh United States National University of Singapore Singapore Singapore Department of Electrical and Computer Engineering National University of Singapore Singapore Singapore Shenzhen Research Institute of Big Data School of Data Science Chinese University of Hong Kong Shenzhen518172 China New York University New YorkNY United States ETSI de Telecomunicacion - Speech Technology and Machine Learning Group Universidad Politecnica de Madrid Ciudad Universitaria Madrid28040 Spain Nanyang Technological University Singapore Singapore Carnegie Mellon University PittsburghPA United States

This article introduces the Tenth Dialog system technology Challenge (DSTC-10). This edition of the DSTC focuses on applying end-to-end dialog technologies for five distinct tasks in dialog systems, namely 1. Incorporation of Meme images into open domain dialogs, 2. Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations, 3. Situated Interactive Multimodal dialogs, 4. Reasoning for Audio Visual Scene-Aware Dialog, and 5. Automatic Evaluation and Moderation of Open-domainDialogue systems. This article describes the task definition, provided datasets, baselines, and evaluation setup for each track. We also summarize the results of the submitted systems to highlight the general trends of the state-of-the-art technologies for the tasks. © 2023 The Authors.

关键词： Job analysis

来源：评论

学校读者我要写书评

暂无评论

Size-invariance matters: rethinking metrics and losses for imbalanced multi-object salient object detection 24

Size-invariance matters: rethinking metrics and losses for i...

引用

Proceedings of the 41st International Conference on Machine Learning

作者： Feiran Li Qianqian Xu Shilong Bao Zhiyong Yang Runmin Cong Xiaochun Cao Qingming Huang Institute of Information Engineering Chinese Academy of Sciences Beijing China and School of Cyber Security University of Chinese Academy of Sciences Beijing China Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences Beijing China School of Computer Science and Technology University of Chinese Academy of Sciences Beijing China Institute of Information Science Beijing Jiaotong University Beijing China and School of Control Science and Engineering Shandong University Jinan China and Key Laboratory of Machine Intelligence and System Control Ministry of Education Jinan China School of Cyber Science and Tech. Shenzhen Campus Sun Yat-sen University School of Computer Science and Technology University of Chinese Academy of Sciences Beijing China and Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences Beijing China and Key Laboratory of Big Data Mining and Knowledge Management Chinese Academy of Sciences Beijing China

This paper explores the size-invariance of evaluation metrics in Salient Object Detection (SOD), especially when multiple targets of diverse sizes co-exist in the same image. We observe that current metrics are size-sensitive, where larger objects are focused, and smaller ones tend to be ignored. We argue that the evaluation should be size-invariant because bias based on size is unjustified without additional semantic information. In pursuit of this, we propose a generic approach that evaluates each salient object separately and then combines the results, effectively alleviating the imbalance. We further develop an optimization framework tailored to this goal, achieving considerable improvements in detecting objects of different sizes. Theoretically, we provide evidence supporting the validity of our new metrics and present the generalization analysis of SOD. Extensive experiments demonstrate the effectiveness of our method. The code is available at https://***/Ferry-Li/SI-SOD.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Size-Invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection

arXiv

引用

arXiv 2024年

作者： Li, Feiran Xu, Qianqian Bao, Shilong Yang, Zhiyong Cong, Runmin Cao, Xiaochun Huang, Qingming Institute of Information Engineering Chinese Academy of Sciences Beijing China School of Cyber Security University of Chinese Academy of Sciences Beijing China Key Laboratory of Intelligent Information Processing Institute of Computing Technology Chinese Academy of Sciences Beijing China School of Computer Science and Technology University of Chinese Academy of Sciences Beijing China Institute of Information Science Beijing Jiaotong University Beijing China School of Control Science and Engineering Shandong University Jinan China Key Laboratory of Machine Intelligence and System Control Ministry of Education Jinan China School of Cyber Science and Tech. Sun Yat-Sen University Shenzhen Campus China Key Laboratory of Big Data Mining and Knowledge Management Chinese Academy of Sciences Beijing China

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Disentangled Cascaded Graph Convolution Networks for Multi-Behavior Recommendation

arXiv

引用

arXiv 2024年

作者： Cheng, Zhiyong Dong, Jianhua Liu, Fan Zhu, Lei Yang, Xun Wang, Meng School of Computer Science and Information Engineering Hefei University of Technology No. 485 Danxia Road Anhui Hefei230009 China Shandong Artificial Intelligence Institute Qilu University of Technology Shandong Academy of Sciences No. 19 Keyuan Road Shandong Jinan250014 China School of Computing National University of Singapore 21 Lower Kent Ridge Road Singapore119077 Singapore School of Electronic and Information Engineering University of Tongji No. 4800 Caoan Road Shanghai201804 China School of Information Science and Technology University of Science and Technology of China No. 443 Huangshan Road Anhui Hefei230027 China Key Laboratory of Knowledge Engineering with Big Data Hefei University of Technology Institute of Artificial Intelligence Hefei Comprehensive National Science Centery No. 485 Danxia Road Anhui Hefei230009 China

Multi-behavioral recommender systems have emerged as a solution to address data sparsity and cold-start issues by incorporating auxiliary behaviors alongside target behaviors. However, existing models struggle to accurately capture varying user preferences across different behaviors and fail to account for diverse item preferences within behaviors. Various user preference factors (such as price or quality) entangled in the behavior may lead to sub-optimization problems. Furthermore, these models overlook the personalized nature of user behavioral preferences by employing uniform transformation networks for all users and items. To tackle these challenges, we propose the Disentangled Cascaded Graph Convolutional Network (Disen-CGCN), a novel multi-behavior recommendation model. Disen-CGCN employs disentangled representation techniques to effectively separate factors within user and item representations, ensuring their independence. In addition, it incorporates a multi-behavioral meta-network, enabling personalized feature transformation across user and item behaviors. Furthermore, an attention mechanism captures user preferences for different item factors within each behavior. By leveraging attention weights, we aggregate user and item embeddings separately for each behavior, computing preference scores that predict overall user preferences for items. Our evaluation on benchmark datasets demonstrates the superiority of Disen-CGCN over state-of-the-art models, showcasing an average performance improvement of 7.07% and 9.00% on respective datasets. These results highlight Disen-CGCN’s ability to effectively leverage multi-behavioral data, leading to more accurate recommendations. Copyright © 2024, The Authors. All rights reserved.

关键词： Recommender systems

来源：评论

学校读者我要写书评

暂无评论

Transferring knowledge distillation for multilingual social event detection

arXiv

引用

arXiv 2021年

作者： Ren, Jiaqian Peng, Hao Jiang, Lei Wu, Jia Tong, Yongxin Wang, Lihong Bai, Xu Wang, Bo Yang, Qiang Institute of Information Engineering Chinese Academy of Sciences The School of Cyber Security University of Chinese Academy of Sciences Beijing100041 China Beijing Advanced Innovation Center for Big Data and Brain Computing Beihang University Beijing100083 China The School of Cyber Science and Technology Beihang University Beijing100083 China Institute of Information Engineering Chinese Academy of Sciences Beijing100041 China The Department of Computing Macquarie University Sydney Australia The State Key Laboratory of Software Development Environment The School of Computer Science and Engineering Beihang University Beijing100083 China The National Computer Network Emergency Response Technical Team Coordination Center of China Beijing100029 China The Department of Computer Science and Engineering Hong Kong University of Science and Technology Hong Kong AI Group WeBank Co. Ltd. China

Recently published graph neural networks (GNNs) show promising performance at social event detection tasks. However, most studies are oriented toward monolingual data in languages with abundant training samples. This has left the more common multilingual settings and lesser-spoken languages relatively unexplored. Thus, we present a GNN that incorporates cross-lingual word embeddings for detecting events in multilingual data streams. The first exploit is to make the GNN work with multilingual data. For this, we outline a construction strategy that aligns messages in different languages at both the node and semantic levels. Relationships between messages are established by merging entities that are the same but are referred to in different languages. Non-English message representations are converted into English semantic space via the cross-lingual word embeddings. The resulting message graph is then uniformly encoded by a GNN model. In special cases where a lesser-spoken language needs to be detected, a novel cross-lingual knowledge distillation framework, called CLKD, exploits prior knowledge learned from similar threads in English to make up for the paucity of annotated data. Experiments on both synthetic and real-world datasets show the framework to be highly effective at detection in both multilingual data and in languages where training samples are scarce. © 2021, CC BY.

关键词： Embeddings

来源：评论

学校读者我要写书评

暂无评论

Contrastive Learning based Speech Spoofing Detection for Multimedia Security in Edge Intelligence

引用

ACM Transactions on Multimedia computing, Communications, and Applications 1000年

作者： Jiaqi Sun Xianjun Deng Shenghao liu Xiaoxuan Fan Yongling Huang Yuanyuan He Celimuge Wu Jong Hyuk Park Hubei Key Laboratory of Distributed System Security Hubei Engineering Research Center on Big Data Security School of Cyber Science and Engineering Huazhong University of Science and Technology China The University of Electro-Communications Japan Seoul National University of Science and Technology South Korea

Artificial intelligence (AI) empowered edge computing has given rise to a new paradigm and effectively facilitated the promotion and development of multimedia applications. The speech assistant is one of the significant services provided by multimedia applications, which aims to offer intelligent interactive experiences between humans and machines. However, malicious attackers may exploit spoofed speeches to deceive speech assistants, posing great challenges to the security of multimedia applications. The limited resources of multimedia terminal devices hinder their ability to effectively load speech spoofing detection models. Furthermore, processing and analyzing speech in the cloud can result in poor real-time performance and potential privacy risks. Existing speech spoofing detection methods rely heavily on annotated data and exhibit poor generalization capabilities for unseen spoofed speeches. To address these challenges, this paper first proposes the Coordinate Attention Network (CA2Net) that consists of coordinate attention blocks and Res2Net blocks. CA2Net can simultaneously extract temporal and spectral speech feature information and represent multi-scale speech features at a granularity level. Besides, a contrastive learning-based speech spoofing detection framework named GEMINI is proposed. GEMINI can be effectively deployed on edge nodes and autonomously learn speech features with strong generalization capabilities. GEMINI first performs data augmentation on speech signals and extracts conventional acoustic features to enhance the feature robustness. Subsequently, GEMINI utilizes the proposed CA2Net to further explore the discriminative speech features. Then, a tensor-based multi-attention comparison model is employed to maximize the consistency between speech contexts. GEMINI continuously updates CA2Net with contrastive learning, which enables CA2Net to effectively represent speech signals and accurately detect spoofed speeches. Extensive experiments on

关键词： Edge intelligence Multimedia applications Speech spoofing detection Contrastive learning Coordinate attention

来源：评论

学校读者我要写书评

暂无评论

Two Efficient Beamforming Methods for Hybrid IRS-aided AF Relay Wireless Networks

arXiv

引用

arXiv 2023年

作者： Wang, Xuehui Shu, Feng Huang, Mengxing Zhou, Fuhui Chen, Riqing Pan, Cunhua Wu, Yongpeng Wang, Jiangzhou The School of Information and Communication Engineering Hainan University Haikou570228 China The School of Information and Communication Engineering and Collaborative Innovation Center of Information Technology Hainan University Haikou570228 China The School of Electronic and Optical Engineering Nanjing University of Science and Technology Nanjing210094 China The College of Electronic and Information Engineering Nanjing University of Aeronautics and Astronautics Nanjing210000 China The Key Laboratory of Dynamic Cognitive System of Electromagnetic Spectrum Space Nanjing University of Aeronautics and Astronautics Nanjing210000 China The Ministry of Industry and Information Technology Nanjing211106 China The Digital Fujian Institute of Big Data for Agriculture Fujian Agriculture and Forestry University Fuzhou350002 China National Mobile Communications Research Laboratory Southeast University Nanjing211111 China The Shanghai Key Laboratory of Navigation and Location Based Services Shanghai Jiao Tong University Minhang200240 China The School of Engineering University of Kent CanterburyCT2 7NT United Kingdom

Due to the "double fading" effect caused by conventional passive intelligent reflecting surface (IRS), the signal via the reflection link is weak. To enhance the received signal, active elements with the ability to amplify the reflected signal are introduced to the passive IRS forming hybrid IRS. In this paper, we propose a hybrid IRS-aided amplify-and-forward (AF) relay wireless network, where an optimization problem is formulated, which is subject to the constraints of transmit power budgets at the source/AF relay/hybrid IRS and that of unit modulus for passive IRS elements. By alternately designing the beamforming matrix at AF relay and the reflecting coefficient matrices at IRS, signal-to-noise ratio can be maximized. To achieve high rate performance and extend the coverage range, a high-performance method based on semidefinite relaxation and fractional programming (HP-SDR-FP) algorithm is presented. Due to its extremely high complexity, a low-complexity method based on whitening filter, general power iterative and generalized Rayleigh-Ritz (WF-GPI-GRR) is proposed, which is different from HP-SDR-FP method. It is assumed that the amplifying coefficient of each active IRS element is equal, and the corresponding analytical solution of the amplifying coefficient can be obtained according to the transmit powers at AF relay and hybrid IRS. Simulation results show that the proposed two methods can greatly improve the rate performance compared to the existing networks, such as the passive IRS-aided AF relay and only AF relay network. In particular, a 50.0% rate gain over the existing networks is approximately achieved in the high power budget region of hybrid IRS. Moreover, it is verified that the proposed HP-SDR-FP method perform better than WF-GPI-GRR method in terms of rate performance. Copyright © 2023, The Authors. All rights reserved.

关键词： Wireless networks

来源：评论

学校读者我要写书评

暂无评论

Errata to “On-Edge Multi-Task Transfer Learning: Model and Practice With data-Driven Task Allocation”

引用

IEEE Transactions on Parallel and Distributed systems 2020年第11期31卷 2569-2569页

作者： Qiong Chen Zimu Zheng Chuang Hu Dan Wang Fangming Liu National Engineering Research Center for Big Data Technology and System Huazhong University of Science and Technology Wuhan China Edge Cloud Innovation Lab Technical Innovation Department Cloud BU Huawei Technologies Company Ltd. Shenzhen China Department of Computing Hong Kong Polytechnic University Kowloon Hong Kong

Presents corrections to author information for the above named paper.

关键词： Task analysis Resource management Technological innovation Computational modeling big data Service computing Computer science

来源：评论

学校读者我要写书评

暂无评论

Ti interstitial flows giving rutile TiO2 reoxidation process enhanced in (001) surface

arXiv

引用

arXiv 2019年

作者： Ichibha, Tom Benali, Anouar Hongo, Kenta Maezono, Ryo School of Information Science JAIST 1-1 Asahidai NomiIshikawa923-1292 Japan Computational Science Division Argonne National Laboratory 9700 Cass Avenue LemontIL60439 United States Research Center for Advanced Computing Infrastructure JAIST 1-1 Asahidai NomiIshikawa923-1292 Japan Center for Materials Research by Information Integration Research and Services Division of Materials Data and Integrated System National Institute for Materials Science 1-2-1 Sengen Tsukuba305-0047 Japan PRESTO Japan Science and Technology Agency 4-1-8 Honcho Kawaguchi-shi Saitama322-0012 Japan Computational Engineering Applications Unit RIKEN 2-1 Hirosawa Wako Saitama351-0198 Japan

We revisited ab initio evaluations of the energy barriers along the possible diffusion paths of the defects in rutile TiO2. By using a method carefully considering the cancellation of the self-interaction, Ti interstitials hopping along c-axis are identified as the major diffusion directing to [001] surface. The conclusion is contradicting to any of previous theoretical works, and the discrepancy is explained by the overestimation of the radius of defects due to the poor cancellations in the previous works. The updated prediction here can explain the superior photocatalysis activity in [001] surface to [110]. Copyright © 2019, The Authors. All rights reserved.

关键词： Titanium dioxide

来源：评论

学校读者我要写书评

暂无评论

Reports of the AAAI 2014 conference workshops

Reports of the AAAI 2014 conference workshops

引用

作者： Albrecht, Stefano V. Barreto, André M. S. Braziunas, Darius Buckeridge, David L. Cuayáhuitl, Heriberto Dethlefs, Nina Endres, Markus Farahmand, Amir-Massoud Fox, Mark Frommberger, Lutz Ganzfried, Sam Guillet, Sébastien Gil, Yolanda Hunter, Lawrence E. Jhala, Arnav Kersting, Kristian Konidaris, George Lecue, Freddy McIlraith, Sheila Natarajan, Sriraam Noorian, Zeinab Poole, David Ronfard, Rémi Saffiotti, Alessandro Shaban-Nejad, Arash Srivastava, Biplav Tesauro, Gerald Uceda-Sosa, Rosario Van Den Broeck, Guy Van Otterlo, Martijn Wallace, Byron C. Weng, Paul Wiens, Jenna Zhang, Jie School of Informatics University of Edinburgh United Kingdom Department of Applied and Computational Mathematics Brazilian National Laboratory Brazil Big Data Group Kobo Inc. Brazil Department of Epidemiology and Biostatistics McGill University Canada School of Mathematical and Computer Sciences Heriot-Watt University in Edinburgh United Kingdom University of Augsburg Germany Carnegie Mellon University United States University of Toronto Canada Cognitive Systems Group University of Bremen Germany LIARA UQAC Chicoutimi Canada Information Sciences Institute University of Southern California Computer Science Department United States Center for Computational Pharmacology University of Colorado School of Medicine United States Department of Computational Media University of California Santa Cruz United States Computer Science Department Technical University of Dortmund Germany MIT Computer Science and Artificial Intelligence Laboratory CambridgeMA United States IBM Research - Smarter Cities Technology Centre Dubline Ireland School of Informatics and Computing Indiana University United States Department of Computer Science University of Saskatchewan Canada University of British Columbia Canada INRIA University of Grenoble France Orebro University Sweden McGill Clinical and Health Informatics Group McGill University Canada IBM Research United States Cognitive Computing Department IBM Research Yorktown HeightsNY United States Computer Science Department Katholieke Universiteit Leuven Belgium Cognitive Artificial Intelligence Department Radboud University Nijmegen Netherlands University of Texas Austin United States Pierre and Marie Curie University Paris France University of Michigan United States School of Computer Engineering Nanyang Technological University Singapore

The AAAI-14 Workshop program was held Sunday and Monday, July 27-28, 2014, at the Québec City Convention Centre in Québec, Canada. The AAAI-14 workshop program included 15 workshops covering a wide range of topics in artificial intelligence. The titles of the workshops were Artificial Intelligence and Robotics;Artificial Intelligence Applied to Assistive Technologies and Smart Environments;Cognitive computing for Augmented Human Intelligence;Computer Poker and Imperfect Information;Discovery Informatics;Incentives and Trust in Electronic Communities;Intelligent Cinematography and Editing;Machine Learning for Interactive systems: Bridging the Gap Between Perception, Action, and Communication;Modern Artificial Intelligence for Health Analytics;Multiagent Interaction Without Prior Coordination;Multidisciplinary Workshop on Advances in Preference Handling;Semantic Cities - Beyond Open data to Models, Standards, and Reasoning;Sequential Decision Making with big data;Statistical Relational AI;and the World Wide Web and Public Health Intelligence. This article presents short summaries of those events. © 2015, Association for the Advancement of Artificial Intelligence. All rights reserved.

关键词： Decision making

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：