Current metagenomic tools can fail to identify highly divergent RNA *** developed a deep learning algorithm,termed LucaProt,to discover highly divergent RNA-dependent RNA polymerase(RdRP)sequences in 10,487 metatransc...
详细信息
Current metagenomic tools can fail to identify highly divergent RNA *** developed a deep learning algorithm,termed LucaProt,to discover highly divergent RNA-dependent RNA polymerase(RdRP)sequences in 10,487 metatranscriptomes generated from diverse global *** integrates both sequence and predicted structural information,enabling the accurate detection of RdRP *** this approach,we identified 161,979 potential RNA virus species and 180 RNA virus supergroups,including many previously poorly studied groups,as well as RNA virus genomes of exceptional length(up to 47,250 nucleotides)and genomic complexity.
Nowadays, vision-based computing tasks play an important role in various real-world applications. However, many vision computing tasks, e.g. semantic segmentation, are usually computationally expensive, posing a chall...
详细信息
WORLD HERITAGE AND SPACE TECHNOLOGY The Convention Concerning the Protection of the World Cultural and Natural Heritage(WHC),adopted by United Nations educational,Scientific and Cultural Organization(UNESCO)on Novembe...
详细信息
WORLD HERITAGE AND SPACE TECHNOLOGY The Convention Concerning the Protection of the World Cultural and Natural Heritage(WHC),adopted by United Nations educational,Scientific and Cultural Organization(UNESCO)on November 16,1972,aims to ensure the identification,protection,conservation,presentation,and transmission to future generations of the world’s cultural and natural *** WHC works toward these goals by emphasizing the Outstanding Universal Value(OUV)of heritage sites and the unique contribution such places can make to conservation and human development agendas.1 As of the end of January 2023,theWHC has been signed by 194 state parties,covering 1,157 sites(including 900 cultural,218 natural,and 39 mixed properties),55 of which are considered to be in *** sites,totaling an area of more than 370 million hectares are designated as World Heritage(WH)sites(https://***/en/list/).WH sites have played a significant role in the sustainable development of society globally and helped effectively maintain and preserve the cultural diversity and global biodiversity of the Earth.
Deep learning (DL), a pivotal technology in artificial intelligence, has recently gained substantial traction in the domain of dental auxiliary diagnosis. However, its application has predominantly been confined to im...
详细信息
Cross-silo federated learning (FL) enables multiple institutions (clients) to collaboratively build a global model without sharing private data. To prevent privacy leakage during aggregation, homomorphic encryption (H...
详细信息
The rapid and unrestrained advancement of generative artificial intelligence (AI) presents a double-edged sword: while enabling unprecedented creativity, it also facilitates the generation of highly convincing decepti...
详细信息
The rapid and unrestrained advancement of generative artificial intelligence (AI) presents a double-edged sword: while enabling unprecedented creativity, it also facilitates the generation of highly convincing deceptive content, undermining societal trust. As image generation techniques become increasingly sophisticated, detecting synthetic images is no longer just a binary task—it necessitates interpretable, context-aware methodologies that enhance trustworthiness and transparency. However, existing detection models primarily focus on classification, offering limited explanatory insights into image authenticity. In this work, we propose FakeScope, an expert multimodal model (LMM) tailored for AI-generated image forensics, which not only identifies AI-synthetic images with high accuracy but also provides rich, interpretable, and query-driven forensic insights. To this end, we first construct FakeChain dataset that contains linguistic authenticity reasoning based on visual trace evidence, developed through a novel human-machine collaborative framework. Building upon it, we further present FakeInstruct, the largest multimodal instruction tuning dataset containing 2 million visual instructions tailored to enhance forensic awareness in LMMs. Leveraging the knowledge of FakeInstruct, FakeScope achieves state-of-the-art performance in both closed-ended and open-ended forensic scenarios. It can distinguish synthetic images with high accuracy while offering coherent and insightful explanations, free-form discussions on fine-grained forgery attributes, and actionable enhancement strategies. Notably, despite being trained exclusively on qualitative hard labels, FakeScope demonstrates remarkable zero-shot quantitative capability on detection, enabled by our proposed token-based probability estimation strategy. Furthermore, FakeScope exhibits strong generalization and in-the-wild ability, ensuring its applicability in real-world scenarios. The data, model, and demo will be publ
Named entity recognition (NER) is a fundamental task in the natural language processing (NLP) area. Recently, representation learning methods (e.g., character embedding and word embedding) have achieved promising reco...
详细信息
Short text classification methods have achieved significant progress and wide application on text data such as Twitter and Weibo. However, the extremely short chinese texts like tax invoice data are different with tra...
详细信息
Short text classification methods have achieved significant progress and wide application on text data such as Twitter and Weibo. However, the extremely short chinese texts like tax invoice data are different with traditional short texts in lackness of contextual semantic information, feature sparseness and extremely short length. The existing short text classification methods are difficult to achieve a satisfactory performance in these texts. To address these problems, this paper proposes a text classification method based on bidirectional semantic extension for extremely short texts like Chinese tax invoice data. More specifically, firstly, the Chinese knowledge graph is introduced for extending bidirectional semantic of texts and label data to expand the extremely short texts and ease the problem of feature sparseness;secondly,the hash vectorization is used to avoid the semantic problem caused by the lackness of contextual information. Experimental results conducted the real tax invoice dataset demonstrate the effectiveness of our proposed method.
Artificial intelligence-enabled database technology, known as AI4DB (Artificial Intelligence for databases), is an active research area attracting significant attention and innovation. This survey first introduces the...
详细信息
Artificial intelligence-enabled database technology, known as AI4DB (Artificial Intelligence for databases), is an active research area attracting significant attention and innovation. This survey first introduces the background of learning-based database techniques. It then reviews advanced query optimization methods for learning databases, focusing on four popular directions: cardinality/cost estimation, learning-based join order selection, learning-based end-to-end optimizers, and text-to-SQL models. Cardinality/cost estimation is classified into supervised and unsupervised methods based on learning models, with illustrative examples provided to explain the working mechanisms. Detailed descriptions of various query optimizers are also given to elucidate the working mechanisms of each component in learning query optimizers. Additionally, we discuss the challenges and development opportunities of learning query optimizers. The survey further explores text-to-SQL models, a new research area within AI4DB. Finally, we consider the future development prospects of learning databases.
The potential pattern changes in brain micro-structure can be used for the brain development assessment in children and adolescents by MRI scans. In this paper, we propose a highly accurate and efficient end-to-end br...
详细信息
ISBN:
(数字)9781538693308
ISBN:
(纸本)9781538693315
The potential pattern changes in brain micro-structure can be used for the brain development assessment in children and adolescents by MRI scans. In this paper, we propose a highly accurate and efficient end-to-end brain age estimation network (BAENET) on T1-weighted MRI images. On the network, 3D skipping and outlier constraint loss are designed to accommodate deeper network and increase the robustness. Besides, we incorporate the neuroimaging domain knowledge into stratified sampling for better generalization ability for datasets with different age distributions, and gender learning for more gender-specific features during modeling. We verify the effectiveness of the proposed method on the public ABIDE2 and ADHD200 benchmark, consisting of 382 and 378 normal children scans respectively. Our BAENET achieves MAE of 1.11 and 1.16, significantly outperforming the best reported methods by 5.1 % and 9.4%.
暂无评论