The rise of social networking services has increased text-based communication, often leading to misunderstandings. This study aims to develop a system using large language models (LLMs) like ChatGPT to provide real-ti...
详细信息
Many critical tasks such as document approval and banking services, which are now hosted on cloud infrastructure. This transformation introduces stress on cloud security from the physical layer of the data center to t...
详细信息
ISBN:
(数字)9781665457279
ISBN:
(纸本)9781665457279
Many critical tasks such as document approval and banking services, which are now hosted on cloud infrastructure. This transformation introduces stress on cloud security from the physical layer of the data center to the application layer of web application. All data access and service access need to be monitored and responded to in real-time. In this paper, we study methods to detect anomaly incidents such as spikes from network volume, malicious incidents from API scanning, error messages from internal systems and timeout from Slowloris attack[1]. We select machine learning based anomaly detection algorithms, such as LOF, Isolation Forest and Elliptic Envelope, to find suitable methods to detect incidents in real-time using stream processing tools including Kafka and message ingression. The result shows that LOF is fast and robust in most of the cases. However, when log messages have unseen words, which normally need to be hashed to preprocess, the Isolation Forest shows better results. This study shows the possibility of applying stream processing with machine learning to detect anomaly behavior for cloud services.
Checkpoint averaging is a simple and effectivemethod to boost the performance of convergedneural machine translation models. The calculation is cheap to perform and the fact thatthe translation improvement almost come...
详细信息
With the widespread use of large language models (LLMs) in naturallanguageprocessing, traditional evaluation methods based on static datasets have become inadequate to fully capture their performance and generalizat...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
With the widespread use of large language models (LLMs) in naturallanguageprocessing, traditional evaluation methods based on static datasets have become inadequate to fully capture their performance and generalization capabilities. To address this challenge, we propose an Iterative Dynamic Evaluation (IDE) framework, which utilizes a multi-agent system to systematically evaluate LLMs. The innovation of IDE lies in its iterative enhancement and comparative selection mechanisms. Through successive rounds of data augmentation, the framework emulates evolutionary processes by progressively increasing sample complexity, while competitive selection ensures only the optimal samples are retained. This iterative optimization and filtering produce increasingly challenging and diverse datasets. Experimental results demonstrate that IDE more effectively exposes the limitations of models in complex tasks compared to traditional methods, providing a more comprehensive foundation for the evaluation, optimization, and application of LLMs.
Recent advances in naturallanguageprocessing (NLP) and Diffusion Models (DMs) are leading to a significant change in the way architecture is conceived. With capabilities that surpass those of current generative mode...
详细信息
Current approaches for open-vocabulary scene graph generation (OVSGG) use vision-language models such as CLIP and follow a standard zero-shot pipeline - computing similarity between the query image and the text embedd...
A study of a person's attitude in terms of using several unstructured texts is denoted as Sentimental analysis or opinion mining. Opinion mining or sentimental analysis distinguishes as the degree of polarity disc...
A study of a person's attitude in terms of using several unstructured texts is denoted as Sentimental analysis or opinion mining. Opinion mining or sentimental analysis distinguishes as the degree of polarity discover. The estimation of tweet review topics and a product is a high-grade sentimental analysis. naturallanguage understanding was essential for such data; many challenges were present in the naturallanguageprocessing field for sentimental analysis. Nowadays, many pieces of research consider deep learning-based techniques for sentimental analysis in the naturallanguageprocessing field. In this study, 25 papers were reviewed through deep learning-based approaches. Measures, as well as achievements attained by various methods, were simplified. The survey described the improvements and a limitation of each method as well as it regards the challenges and future potential research which is to acquire high accuracy and precision in sentimental analysis. Taxonomy represents the study gap and it elaborates on the various approaches.
The proceedings contain 57 papers. The special focus in this conference is on Software Technologies. The topics include: Product-Line Engineering for Smart Manufacturing: A Systematic Mapping Study on Security Concept...
ISBN:
(纸本)9789897587061
The proceedings contain 57 papers. The special focus in this conference is on Software Technologies. The topics include: Product-Line Engineering for Smart Manufacturing: A Systematic Mapping Study on Security Concepts;diagnosis Automation Using Similarity Analysis: Application to Industrial Systems;improving Robustness of Satellite Image processing Using Principal Component Analysis for Explainability;multimodal Approach Based on Autistic Child Behavior Analysis for Meltdown Crisis Detection;Towards Accurate Cervical Cancer Detection: Leveraging Two-Stage CNNs for Pap Smear Analysis;feature Extraction, Learning and Selection in Support of Patch Correctness Assessment;Accurate Recommendation of EV Charging Stations Driven by Availability Status Prediction;RLHR: A Framework for Driving Dynamically Adaptable Questionnaires and Profiling People Using Reinforcement Learning;optimizing Intensive Database Tasks Through Caching Proxy Mechanisms;An Evaluation of Risk Management Standards and Frameworks for Assuring Data Security of Medical Device Software AI Models;a Systematic Mapping Study on Impact Analysis;HybridCRS-TMS: Integrating Collaborative Recommender System and TOPSIS for Optimal Transport Mode Selection;automated Generation of Web Application Front-end Components from User Interface Mockups;a Webcam Artificial Intelligence-Based Gaze-Tracking Algorithm;an empirical Examination of the Technical Aspects of Data Sovereignty;asmDocGen: Generating Functional naturallanguage Descriptions for Assembly Code;Integrating a LLaMa-based Chatbot with Augmented Retrieval Generation as a Complementary Educational Tool for High School and College Students;artificial Intelligence-Based Detection and Prediction of Giant African Snail (Lissachatina Fulica) Infestation in the Galapagos Islands;six-Layer Industrial Architecture Applied to Predictive Maintenance;Smart Blockchain-Based Information Flow Control for SOA;logging Hypercalls to Learn About the Behavior of Hyper-V.
This paper presents a language-powered paradigm for ordinal regression. Existing methods usually treat each rank as a category and employ a set of weights to learn these concepts. These methods are easy to overfit and...
详细信息
ISBN:
(纸本)9781713871088
This paper presents a language-powered paradigm for ordinal regression. Existing methods usually treat each rank as a category and employ a set of weights to learn these concepts. These methods are easy to overfit and usually attain unsatisfactory performance as the learned concepts are mainly derived from the training set. Recent large pre-trained vision-language models like CLIP have shown impressive performance on various visual tasks. In this paper, we propose to learn the rank concepts from the rich semantic CLIP latent space. Specifically, we reformulate this task as an image-language matching problem with a contrastive objective, which regards labels as text and obtains a language prototype from a text encoder for each rank. While prompt engineering for CLIP is extremely time-consuming, we propose OrdinalCLIP, a differentiable prompting method for adapting CLIP for ordinal regression. OrdinalCLIP consists of learnable context tokens and learnable rank embeddings. The learnable rank embeddings are constructed by explicitly modeling numerical continuity, resulting in well-ordered, compact language prototypes in the CLIP space. Once learned, we can only save the language prototypes and discard the huge language model, resulting in zero additional computational overhead compared with the linear head counterpart. Experimental results show that our paradigm achieves competitive performance in general ordinal regression tasks, and gains improvements in few-shot and distribution shift settings for age estimation. The code is available at https://***/xk-huang/OrdinalCLIP.
Text generation is of great importance to many naturallanguageprocessing applications. However, maximization-based decoding methods (e.g., beam search) of neural language models often lead to degenerate solutions-th...
详细信息
ISBN:
(纸本)9781713871088
Text generation is of great importance to many naturallanguageprocessing applications. However, maximization-based decoding methods (e.g., beam search) of neural language models often lead to degenerate solutions-the generated text is unnatural and contains undesirable repetitions. Existing approaches introduce stochasticity via sampling or modify training objectives to decrease the probabilities of certain tokens (e.g., unlikelihood training). However, they often lead to solutions that lack coherence. In this work, we show that an underlying reason for model degeneration is the anisotropic distribution of token representations. We present a contrastive solution: (i) SimCTG, a contrastive training objective to calibrate the model's representation space, and (ii) a decoding method-contrastive search-to encourage diversity while maintaining coherence in the generated text. Extensive experiments and analyses on three benchmarks from two languages demonstrate that our proposed approach significantly outperforms current state-of-the-art text generation methods as evaluated by both human and automatic metrics.
暂无评论