Text-to-Music Retrieval, finding music based on a given naturallanguage query, plays a pivotal role in content discovery within extensive music databases. To address this challenge, prior research has predominantly f...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Text-to-Music Retrieval, finding music based on a given naturallanguage query, plays a pivotal role in content discovery within extensive music databases. To address this challenge, prior research has predominantly focused on a joint embedding of music audio and text, utilizing it to retrieve music tracks that exactly match descriptive queries related to musical attributes (i.e. genre, instrument) and contextual elements (i.e. mood, theme). However, users also articulate a need to explore music that shares similarities with their favorite tracks or artists, such as I need a similar track to Superstition by Stevie Wonder. To address these concerns, this paper proposes an improved Text-to-Music Retrieval model, denoted as TTMR++, which utilizes rich text descriptions generated with a finetuned large language model and metadata. To accomplish this, we obtained various types of seed text from several existing music tag and caption datasets and a knowledge graph dataset of artists and tracks. The experimental results show the effectiveness of TTMR++ in comparison to state-of-the-art music-text joint embedding models through a comprehensive evaluation involving various musical text queries. (1)
In recent years, language models have undergone significant advancements with models like GPT-3, showcasing impressive abilities in naturallanguageprocessing and generation. However, these models often experience fr...
详细信息
Recent advances in naturallanguageprocessing (NLP) have produced state of the art results on several sequence to sequence (seq2seq) tasks. Enhancements in embedders and their training methodologies have shown signif...
详细信息
ISBN:
(纸本)9798350324471
Recent advances in naturallanguageprocessing (NLP) have produced state of the art results on several sequence to sequence (seq2seq) tasks. Enhancements in embedders and their training methodologies have shown significant improvement on downstream tasks. Word vector models like Word2Vec, FastText & Glove were widely used over one-hot encoded vectors for years until the advent of deep contextualized embedders. Protein sequences consist of 20 naturally occurring amino acids that can be treated as the language of nature. These amino acids in combinations with each other makeup the biological functions. The choice of vector representation and architecture design for a biological task is highly dependent upon the nature of the task. We utilize unlabelled protein sequences to train a Convolution and Gated Recurrent Network (CGRN) embedder using Masked language Modeling (MLM) technique that shows significant performance boost under resource constraint setting on two downstream tasks i.e., F1-score(Q8) of 73.1% on Secondary Structure Prediction (SSP) & F1-score of 84% on Intrinsically Disordered Region Prediction (IDRP). We also compare different architectures on downstream tasks to show the impact of the nature of biological task on the performance of the model.
The advancements in artificial intelligence (AI) have propelled the domain of text similarity, where sophisticated algorithms harness the power of naturallanguageprocessing to analyze and compare textual content. In...
详细信息
Prompt recovery, a crucial task in naturallanguageprocessing, entails the reconstruction of prompts or instructions that language models use to convert input text into a specific output. Although pivotal, the design...
详细信息
With the increasing population in India and the limited availability of medical professionals, time efficiency in diagnosis and prescription generation has become crucial. This paper proposes an automated system lever...
详细信息
In the world of naturallanguageprocessing (NLP), developing structures that not simplest carry out nicely but additionally offer apparent reasoning for their selections is paramount. Traditional gadget mastering fas...
详细信息
In the real world, knowledge comes from books and papers. Now that information only reaches to those with clear vision. In the community there are a part of people suffering either from poor eyesight or blindness. Bra...
详细信息
This research delves into the difficulty of summarizing legal documents using naturallanguageprocessing. It examines how cutting-edge models like XLNet and BART can be used for abstractive summarization specifically...
详细信息
The proceedings contain 39 papers. The topics discussed include: adaptive optimal output regulation: a parametric Lyapunov equation approach;evaluation on parameter-efficient continual instruction tuning of large lang...
ISBN:
(纸本)9781510685864
The proceedings contain 39 papers. The topics discussed include: adaptive optimal output regulation: a parametric Lyapunov equation approach;evaluation on parameter-efficient continual instruction tuning of large language models;research on enhancing forecast accuracy of gold futures using LSTM neural networks across various time periods;FedGKD: personalized federated learning through grouping and distillation;the development of a question-answering system for learning article using naturallanguageprocessing;KG-CQAM: knowledge graph and mind mapping-based complex question answering for large language models;and PCEM-SQL: enhancing text-to-SQL capabilities by large language models and prompt engineering with self-consistency and self-evaluation mechanism.
暂无评论