检索结果-内蒙古大学图书馆

2021 International Symposium on Artificial Intelligence and Intelligent Manufacturing, AIIM 2021

作者： Zhu, Yuchen College of Artificial Intelligence and Data Science Hebei University of Technology TianJin China

As an important research direction of computer vision, target detection has been widely used in face recognition, intelligent driving, robot navigation and other fields. In recent years, with the deepening research on deep learning, great progress has been made in the field of computer vision, such as image acquisition, image processing and target detection. Compared with the traditional target detection algorithm based on candidate regions, it has the problems of poor timeliness and slow detection speed. Recently, the popular target detection algorithm based on regression realizes the real sense of end-to-end detection and greatly improves the detection efficiency. However, the accuracy of small target detection and dense target detection has not been solved. In the future, we still need to improve the efficiency and accuracy of recognition on the existing basis, and solve the problem of small target and dense target detection to make it more widely used in practical application scenarios. In this paper, the principle, advantages and disadvantages, accuracy and other aspects of the above algorithms are introduced in detail, the problems existing in the target detection algorithm are summarized, and the future development direction has been prospected. In short, both algorithms have advantages and disadvantages, but the regression-based target detection algorithm has better practicality and development prospects. © Published under licence by IOP Publishing Ltd.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction

arXiv

引用

arXiv 2024年

作者： Shi, Yuchen Jiang, Guochao Qiu, Tian Yang, Deqing School of Data Science Fudan University Shanghai China

The relation extraction (RE) in complex scenarios faces some challenges such as diverse relation types and ambiguous relations between entities within a single sentence, leading to the poor performance of pure "text-in, text-out" language models (LMs). To address these challenges, in this paper we propose an agent-based RE framework, namely AgentRE, which employs a large language model (LLM) as the agent interacting with some modules to achieve complex RE tasks. Specifically, three major modules are built in AgentRE serving as the tools to help the agent acquire and process various useful information, thereby obtaining improved RE performance. Our extensive experimental results upon two datasets in English and Chinese, respectively, demonstrate our AgentRE’s superior performance, especially in low-resource scenarios. Additionally, the trajectories generated by AgentRE can be refined to construct a high-quality training dataset incorporating different reasoning methods, which can be used to fine-tune smaller models.1 Copyright © 2024, The Authors. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Combinative Bio-feature Proteins Generation via Pre-trained Protein Large Language Models

Combinative Bio-feature Proteins Generation via Pre-trained ...

引用

IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

作者： Xin Sun Yuhao Wu Center for Data Science Zhejiang University Hangzhou China

ISBN: (数字)9798350386226

ISBN: (纸本)9798350386233

Proteins serve as the functional building blocks of life, facilitating critical tasks such as signaling, catalysis, and structural support in all living organisms. Designing proteins with targeted biological features or domains is of utmost importance. Traditional wet-lab experiments are time-consuming and resource-intensive, which makes deep learning (DL) methods ideal alternatives. However, existing DL methods predominantly focus on generating new proteins with the same biological domain as the training data, and overlook some scenarios where designers expect to combine proteins from different biological domains to create novel proteins with both features, which can show better fits for practical purpose. To fill this gap, in this paper, we present ComProtein, a novel framework further exploiting the potential of pre-trained protein large language models, which is the first work aiming to generate innovative proteins with combinative biological features from two different domains. This process is performed by a cycle-consistent generative adversarial approach, leveraging insights from the latent space. It enables the transformation of protein representations from one biological domain to another, while preserving their intrinsic features. Additionally, we introduce new evaluative metrics, namely Shortest Target Neighbor Distance (STND), Mutual Root Mean Square Deviation (MRMSD) and Sequence Diversity (SD) on the evaluation of biological representations, protein structure and sequence quality, respectively to complement the existing measures. Our experimental results demonstrate that our proposed method performs better and has great potential in biological representations, structure similarity, homology relationships, and sequence quality.

关键词： Proteins Deep learning Large language models Biological system modeling Training data Feature extraction Protein sequence Organisms Root mean square Periodic structures

来源：评论

学校读者我要写书评

暂无评论

An exploration of research trends on metaverse: topic modeling with latent dirichlet allocation

引用

Quality and Quantity 2025年第1期59卷 233-252页

作者： Park, Hyejin Ahn, Buyoung Kim, Taejong Science Data Education Center Korea Institute of Science and Technology Information Daejeon South Korea AI Meteorological Research Division National Institute of Meteorological Sciences 33 Seohobuk-ro Seogwipo-si Jeju-do 63568 South Korea

Online platforms have supported users in collaborating and communicating with each other distantly. Adopting online platforms interconnected with the virtual world, especially the metaverse, has fostered interactive activities in diverse sectors. However, a deep understanding of how such platforms have been used and discussed in academia needs to be enriched. To tackle this issue, this study investigates the research trends of the metaverse. The search period was set to include all publication dates up to the analysis point on January 9th, 2022, so the research data of the papers published in the database until then was obtained. A total of 451 publications were collected and analyzed through the Latent Dirichlet Allocation algorithm of the topic modeling technique, exploring conspicuous topics, keywords, and pertinent publications over time. As a result, six topics were determined: Topic 1 (engaging in real and virtual worlds);Topic 2 (crypto marketplace);Topic 3 (teaching and learning in a virtual educational environment);Topic 4 (a figure of oneself traveling in a virtual world);Topic 5 (virtual marketplace reshaping retail);and Topic 6 (game-mediated activity in a virtual world community). The time series change in the number of publications on each topic was tracked, and an apparent increase was found since 2007. Further, keywords in each topic and relevant publications were obtained based on probability values and then elaborated. The findings illustrate what researchers have discussed regarding metaverse and suggest the direction for future study in the field. © The Author(s), under exclusive licence to Springer Nature B.V. 2024.

关键词： Latent dirichlet allocation LDA Metaverse Research trends Topic modeling

来源：评论

学校读者我要写书评

暂无评论

Optimizing the ALMA Research Proposal Process with Machine Learning

Optimizing the ALMA Research Proposal Process with Machine L...

引用

IEEE Symposium on Systems and Information Engineering Design, SIEDS

作者： Arnav Boppudi Ryan Lipps Noah McIntire Kaleigh O’Hara Brendan Puglisi Antonios Mamalakis School of Data Science University of Virginia Charlottesville Virginia

ISBN: (数字)9798350385144

ISBN: (纸本)9798350385151

Every year, astronomers from around the world submit research proposals to the Atacama Large Millimeter Array (ALMA), the largest radio telescope array in the world. The aim of the current work is to streamline the proposal process for astronomers submitting projects to ALMA by suggesting frequency ranges that may be relevant to their research based on their proposal text. We introduce a pipeline of supervised and unsupervised machine learning models, each using various representations of the title and abstract of an incoming proposal. First, a logistic regression filters out proposed projects that are not expected to need specific technical setups. Second, if a technical setup is deemed necessary, our pipeline assigns an incoming project to one of 50 "similar project" groups, defined by topics generated from Latent Dirichlet Allocation (LDA). Third, we apply Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) to mine patterns in measurements ("areas of interest") made in previous projects, for each one of the 50 "similar project" groups. In parallel to the aforementioned topic modeling and HDBSCAN mining, we employ a Multinomial Naive Bayes classifier to predict the broad frequency range defined by the technical limitations of ALMA (frequency band) that we expect a project to make measurements in. Finally, we offer researchers a list of the mined "areas of interest" filtered by the predictions of the Multinomial Naive Bayes classifier. Ultimately, given a proposed project title and abstract, our pipeline generates several recommended "areas of interest" that one should consider measuring *** the performance of our models, we find that 67.17% of test projects match at least one of the recommended "areas of interest", with an average hit rate of 44.72% across measurements within each test project, when limiting to the top two band predictions. When we disregard band predictions, 88.81% of test projects match at least one recomm

关键词： Radio astronomy Area measurement Pipelines Machine learning Predictive models Extraterrestrial measurements Frequency measurement

来源：评论

学校读者我要写书评

暂无评论

Anomaly Detection Through Graph Autoencoder-Based Learning of Screenshot Image Logs

Anomaly Detection Through Graph Autoencoder-Based Learning o...

引用

International Conference on Semantic Computing

作者： Yuki Ohkawa Takafumi Nakanishi Department of Data Science Musashino University Tokyo Japan

IT controls in information systems play an important role for companies. One common control is the management and verification of daily logs. Text-based logs (keystrokes, communication history, and application information) are often used to verify that the system is operating properly. However, some systems can only record PC screenshot image logs to prioritize stable operation. In such systems, checking the logs is time consuming, making it difficult to check the logs on a daily basis. In addition, if an auditor wants to detect anomalous operations, the auditor needs to know the correct operation of the system, which becomes very difficult when targeting a large number of systems. In this study, we aimed to convert user operations from screenshot images of PCs into graph structures and use the features of the graph structures for anomaly detection. The proposed method groups image features from screenshot images based on similarity, transforms feature transitions into graph structures, and detects anomalous operations using graph autoencoder-based learning. We demonstrate that the proposed method can detect anomalous operations with a recall rate of over 70%.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Deduplication and Approximate Analytics for Encrypted IoT data in Fog-Assisted Cloud Storage 24th

Deduplication and Approximate Analytics for Encrypted IoT ...

引用

24th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2024

作者： Wang, Rongxi Ha, Guanxiong Jia, Chunfu Li, Ruiqi Su, Zhen The College of Cyber Science Nankai University Tianjin China Tianjin Key Laboratory of Network and Data Security Technology Tianjin China The College of Safety Science and Engineering Civil Aviation University of China Tianjin China

ISBN: (纸本)9789819615476

With the advancement of the Internet of Things (IoT) technologies, there has been a rapid increase in the volume of IoT data, leading to escalating costs in storage, transmission, and analytics. The benefits of conventional data deduplication schemes are diminishing when applied to IoT data that is similar but distinct, necessitating the development of new approaches to accommodate these new scenarios. This paper proposes a deduplication and approximate analytics scheme for encrypted IoT data in fog-assisted cloud storage. The scheme is based on Generalized Deduplication (GD), Message Locked Encryption (MLE), Homomorphic Encryption (HE), and ciphertext conversion techniques. We employ GD to divide similar but distinct IoT data into bases and deviations, and perform deduplication on the encrypted base to achieve efficient storage while protecting data privacy. Additionally, we utilize Hybrid Homomorphic Encryption (HHE) techniques to convert the symmetric ciphertext of IoT data into homomorphic ciphertext, facilitating approximate analytics while ensuring privacy protection of IoT data in fog-assisted cloud storage and reducing the computation overhead on IoT devices. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Cloud storage

来源：评论

学校读者我要写书评

暂无评论

Fake News Detection and Classify the Category

Fake News Detection and Classify the Category

引用

2022 IEEE International Conference on Trends in Quantum Computing and Emerging Business Technologies, TQCEBT 2022

作者： Mitra, Pritha Jacob, Lija Christ University Pune Lavasa Campus Department of Data Science Pune India

ISBN: (纸本)9781665453615

Everyone depends on numerous sources of E-news in today's world when the internet is ubiquitous. Online content abounds, especially social media feeds, many of which are unreliable and may not always be factual. For people to utilise social media platforms like Facebook, Twitter, and others, fake news is a topic that may be studied through Natural Language Processing techniques. Using ideas from natural language processing and machine learning applied to social media, our goal in this work is to conduct categorization of different news items that are available online. Our intention is to empower the user to utilise NLP (Natural Language Processing) methods to identify 'fake news,' which refers to misinformed material that may be categorised as genuine or false using software like Python. The model focuses on identifying false news sources based on several articles from a website, categorising the news as false or true, and determining its veracity using unreliable sources like scikit-learn and NLP for textual analysis of the website distributing the news. When a source is identified as a publisher of false news, which can be predicted with high vectorization and also suggested using the Python scikit-learn module to do tokenization and feature development, biased viewpoints may be identified and categorised in any subsequent articles from that source. © 2022 IEEE.

关键词： Python

来源：评论

学校读者我要写书评

暂无评论

Semi-supervised Model-Based Clustering for Ordinal data 21st

Semi-supervised Model-Based Clustering for Ordinal Data

引用

21st Australasian Conference on data science and Machine Learning, AusDM 2023

作者： Cui, Ying McMillan, Louise Liu, Ivy School of Mathematics and Statistics Victoria University of Wellington Wellington New Zealand Centre for Data Science and Artificial Intelligence Victoria University of Wellington Wellington New Zealand

ISBN: (纸本)9789819986958

This paper introduces a semi-supervised learning technique for model-based clustering. Our research focus is on applying it to matrices of ordered categorical response data, such as those obtained from the surveys with Likert scale responses. We use the proportional odds model, which is popular and widely used for analyzing such data, as the model structure. Our proposed technique is designed for analyzing datasets that contain both labeled and unlabeled observations from multiple clusters. The model fitting is performed using the expectation-maximization (EM) algorithm, incorporating the labeled cluster memberships, to cluster the unlabeled observations. To evaluate the performance of our proposed model, we conducted a simulation study in which we tested the model from eight different scenarios, each with varying combinations and proportions of known and unknown cluster memberships. The fitted models accurately estimate the parameters in most of the designed scenarios, indicating that our technique is effective in clustering partially-labeled data with ordered categorical response variables. © 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： clustering EM algorithm Likert scale data ordinal data proportional odds model semi-supervised learning

来源：评论

学校读者我要写书评

暂无评论

An Encoder-Agnostic Weakly Supervised Method For Describing Textures

An Encoder-Agnostic Weakly Supervised Method For Describing ...

引用

IEEE Workshop on Applications of Computer Vision (WACV)

作者： Shangbo Mao Deepu Rajan College of Computing and Data Science Nanyang Technological University Singapore

ISBN: (数字)9798331510831

ISBN: (纸本)9798331510848

Recent advances in Large Language Models (LLMs) have enabled the semantic description of textures in natural language, aiming to capture them in richer detail. However, most methods are confined to either depending on supervised training with pairs of images and manually annotated visual attributes that most texture datasets lack or using Vision-Language Models (VLMs) such as CLIP. In this paper, we develop an encoder-agnostic Weakly supervised Texture Description Generator (WTDG) that employs a novel Scaled Ranked Kullback-Leibler divergence (SR-KL) loss between image and text modalities. Within the SR-KL loss formulation, we leverage category information, which is always available as ground-truths for all benchmark texture recognition datasets. We further extend our proposed WTDG to assist in texture recognition by using its generated texture descriptions. Thus, we develop a multimodal framework, called $T e x^2$ , which is adept at simultaneous generation of texture description and recognition. Our approach exhibits promising performance in describing and recognizing textures on benchmark datasets.

关键词： Training Visualization Computer vision Accuracy Large language models Semantics Natural languages Benchmark testing Generators

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：