Tables,typically two-dimensional and structured to store large amounts of data,are essential in daily activities like database queries,spreadsheet manipulations,Web table question answering,and image table information...
详细信息
Tables,typically two-dimensional and structured to store large amounts of data,are essential in daily activities like database queries,spreadsheet manipulations,Web table question answering,and image table information *** these table-centric tasks with Large Language Models(LLMs)or Visual Language Models(VLMs)offers significant public benefits,garnering interest from academia and *** survey provides a comprehensive overview of table-related tasks,examining both user scenarios and technical *** covers traditional tasks like table question answering as well as emerging fields such as spreadsheet manipulation and table data *** summarize the training techniques for LLMs and VLMs tailored for table ***,we discuss prompt engineering,particularly the use of LLM-powered agents,for various tablerelated ***,we highlight several challenges,including diverse user input when serving and slow thinking using chainof-thought.
Multi-label stream classification aims to address the challenge of dynamically assigning multiple labels to sequentially arrived instances. In real situations, only partial labels of instances can be observed due to t...
详细信息
Graph convolutional network(GCN)as an essential tool in human action recognition tasks have achieved excellent performance in previous ***,most current skeleton-based action recognition using GCN methods use a shared ...
详细信息
Graph convolutional network(GCN)as an essential tool in human action recognition tasks have achieved excellent performance in previous ***,most current skeleton-based action recognition using GCN methods use a shared topology,which cannot flexibly adapt to the diverse correlations between joints under different motion *** video-shooting angle or the occlusion of the body parts may bring about errors when extracting the human pose coordinates with estimation *** this work,we propose a novel graph convolutional learning framework,called PCCTR-GCN,which integrates pose correction and channel topology refinement for skeleton-based human action ***,a pose correction module(PCM)is introduced,which corrects the pose coordinates of the input network to reduce the error in pose feature ***,channel topology refinement graph convolution(CTR-GC)is employed,which can dynamically learn the topology features and aggregate joint features in different channel dimensions so as to enhance the performance of graph convolution networks in feature ***,considering that the joint stream and bone stream of skeleton data and their dynamic information are also important for distinguishing different actions,we employ a multi-stream data fusion approach to improve the network’s recognition *** evaluate the model using top-1 and top-5 classification *** the benchmark datasets iMiGUE and Kinetics,the top-1 classification accuracy reaches 55.08%and 36.5%,respectively,while the top-5 classification accuracy reaches 89.98%and 59.2%,*** the NTU dataset,for the two benchmark RGB+Dsettings(X-Sub and X-View),the classification accuracy achieves 89.7%and 95.4%,respectively.
作者:
Zheng, TaoHou, QiyuChen, XingshuRen, HaoLi, MengLi, HongweiShen, ChangxiangSichuan University
School of Cyber Science and Engineering Chengdu610065 China Sichuan University
School of Cyber Science and Engineering Cyber Science Research Institute Key Laboratory of Data Protection and Intelligent Management Ministry of Education Chengdu610065 China Hefei University of Technology
Key Laboratory of Knowledge Engineering with Big Data Ministry of Education Intelligent Interconnected Systems Laboratory of Anhui Province School of Computer Science and Information Engineering Hefei230002 China University of Padua
Department of Mathematics HIT Center Padua35131 Italy University of Electronic Science and Technology of China
School of Computer Science and Engineering Chengdu611731 China Sichuan University
Cyber Science Research Institute Key Laboratory of Data Protection and Intelligent Management Ministry of Education Chengdu610065 China
Android malware authors often use packers to evade analysis. Although many unpacking tools have been proposed, they face two significant challenges: 1) They are easily impeded by anti-analysis techniques employed by p...
详细信息
Federated learning-based Named Entity Recognition (FNER) has attracted widespread attention through decentralized training on local clients. However, most FNER models assume that entity types are pre-fixed, so in prac...
详细信息
Represented by evolutionary algorithms and swarm intelligence algorithms, nature-inspired metaheuristics have been successfully applied to recommender systems and amply demonstrated effectiveness, in particular, for m...
详细信息
Breast cancer is a serious and high morbidity disease in women,and it is the main cause of cancer death in ***,getting tested and diagnosed early can reduce the risk of *** present,there are clinical examinations,imag...
详细信息
Breast cancer is a serious and high morbidity disease in women,and it is the main cause of cancer death in ***,getting tested and diagnosed early can reduce the risk of *** present,there are clinical examinations,imaging screening and biopsies,among which histopathological examination is the gold ***,the process is complicated and time-consuming,and misdiagnosis may *** paper puts forward a classification framework based on deep learning,introducing multi-attention mechanism,selecting kernel convolution instead of ordinary convolution,and using different weights and combinations to pay attention to the accuracy index and growth rate of the *** addition,we also compared the learning rate *** function can fine-tune the learning rate to achieve good performance,using label softening to reduce the loss error caused by model error recognition in the label,and assigning different category weights in the loss function to balance the positive and negative *** used the BreakHis data set to automatically classify histological images into benign and malignant,four categories and eight *** results showed that the accuracy of binary classifications ranged from 98.23%to 98.83%,and that of multiple classifications ranged from 97.89%to 98.11%.
Kolmogorov-Arnold Networks (KAN) is an emerging neural network architecture in machine learning. It has greatly interested the research community about whether KAN can be a promising alternative to the commonly used M...
详细信息
Query optimization is a critical task in database systems, focused on determining the most efficient way to execute a query from an enormous set of possible strategies. Traditional approaches rely on heuristic search ...
详细信息
Current inference scaling methods, such as Self-consistency and Best-of-N, have proven effective in improving the accuracy of LLMs on complex reasoning tasks. However, these methods rely heavily on the quality of cand...
详细信息
暂无评论