Zero pronoun resolution aims at recognizing dropped pronouns and pointing out their anaphoric mentions, while non-zero coreference resolution targets at clustering mentions referring to the same entity. Existing effor...
Unbalanced walking is increasingly common among older adults;therefore, routinely assessing the balance of older adults is crucial. The traditional method of assessing balance uses scales, requires the supervision of ...
详细信息
This research presents an innovative method for blood bank management using Cloud-based Long Short-Term Memory (LSTM) models for precise inventory forecasting and optimization. The objective of this research is to inc...
详细信息
Future food security is a major concern of the 21st century with the growing global population and climate changes. In addressing these challenges, protected cropping ensures food production year-round and increases c...
详细信息
Cell instance segmentation is a fundamental task in analyzing microscopy images, with applications in computer-aided biomedical research. In recent years, deep learning techniques have been widely used in this field. ...
详细信息
Aiming at the problem that existing image description models cannot model high-order multimodal feature interaction, this paper introduces the X-Linear attention mechanism, which uses bilinear pooling and ELU activati...
详细信息
We study the problem of robust multivariate polynomial regression: let p: Rn → R be an unknown n-variate polynomial of degree at most d in each variable. We are given as input a set of random samples (xi, yi) ∈ [−1,...
详细信息
Malaria is a communicable disease with half of the global population at risk due to its high morbidity and mortality rates. A massive number of studies are dedicated to malaria research, so it plays a key role in form...
详细信息
ISBN:
(纸本)9781665473286
Malaria is a communicable disease with half of the global population at risk due to its high morbidity and mortality rates. A massive number of studies are dedicated to malaria research, so it plays a key role in formulating the proper prevention strategy and effective malaria treatment. With the overwhelming number of updated publications in the field, an unsupervised text mining approach such as topic modeling may provide an alternative method for the malaria researcher to keep pace with new insights. In this work, we collect metadata of malaria publications from the PubMed database to perform BERT-based topic modeling to find well-defined topics regarding malaria research. The method is largely based on the popular BERTopic pipeline. We compare the performance of three different language models to generate document embeddings from the data. The dimension reduction and the density-based clustering algorithm are used to cluster the embeddings. The topic representation is computed based on the semantic similarity of the class TF-IDF representation. The substance of the resulting topics is then manually annotated based on the top words of each topic. We demonstrate that by merging initial topics into larger topics using hierarchical clustering and manual content-based examination, the evaluated coherence measure can be further improved, thus enhancing the topic's interpretability. Our modeling result is able to extract ten major topics recurring in the malaria research publication published from 2017–2022. The result provides preliminary insight to understand the dynamics and patterns of malaria research over the years
Deep hashing is an appealing approach for large-scale image retrieval. Most existing supervised deep hashing methods learn hash functions using pairwise or triple image similarities in randomly sampled mini-batches. T...
Deep hashing is an appealing approach for large-scale image retrieval. Most existing supervised deep hashing methods learn hash functions using pairwise or triple image similarities in randomly sampled mini-batches. They suffer from low training efficiency, insufficient coverage of data distribution, and pair imbalance problems. Recently, central similarity quantization (CSQ) attacks the above problems by using “hash centers” as a global similarity metric, which encourages the hash codes of similar images to approach their common hash center and distance themselves from other hash centers. Although achieving SOTA retrieval performance, CSQ falls short of a worst-case guarantee on the minimal distance between its constructed hash centers, i.e. the hash centers can be arbitrarily close. This paper presents an optimization method that finds hash centers with a constraint on the minimal distance between any pair of hash centers, which is non-trivial due to the non-convex nature of the problem. More importantly, we adopt the Gilbert-Varshamov bound from coding theory, which helps us to obtain a large minimal distance while ensuring the empirical feasibility of our optimization approach. With these clearly-separated hash centers, each is assigned to one image class, we propose several effective loss functions to train deep hashing networks. Extensive experiments on three datasets for image retrieval demonstrate that the proposed method achieves superior retrieval performance over the state-of-the-art deep hashing methods.
Feature discretization can improve the processing efficiency of remote sensing big data. However, the distribution of target attribute values is often difficult to ascertain, and there are complex correlations between...
详细信息
暂无评论