检索结果-内蒙古大学图书馆

International Conference on Computing and Networking technology (ICCNT)

作者： Shanmukha Aditya G Kruthika B Venkata Sneha G Deepa Gupta T. V. Smitha Asha S Manek Dept. of Computer Science and Engineering Amrita School of Computing Amrita Vishwa Vidyapeetham Bengaluru India Dept. of Mathematics Amrita School of Engineering Amrita Vishwa Vidyapeetham Bengaluru India Dept. of Computer Science and Engineering T. John Institute of Technology Bengaluru India

The COVID-19 pandemic has been associated with several manifestations that affect an individual’s visual health. To address these concerns, we propose a classification model to predict the impact of COVID-19 on vision. The model is built on a dataset corresponding to visual symptoms and other factors. The proposed classification model will classify people into three categories: those with no impact on vision, those with mild-to-moderate impact, and those with severe impact. Some machine learning algorithms, such as logistic regression, decision trees, random forests, etc., were applied by us to build the model to identify the optimal algorithm for the task. The results obtained from the model show that it has high accuracy, precision, and recall. The model can predict the severity of visual symptoms in people due to COVID-19 with an accuracy of over 84%. The study findings are entirely focused on the E-learning or online learning caused due to COVID-19 pandemic and their impact on vision of people from different age groups.

关键词：

来源：评论

学校读者我要写书评

暂无评论

ADAMAX-Based Optimization of Efficient Net V2 for NSFW Content Detection

ADAMAX-Based Optimization of Efficient Net V2 for NSFW Conte...

引用

Contemporary Computing and Communications (InC4), IEEE International Conference on

作者： Chayanika Arora Gaurav Raj Akshat Ajit Aditya Saxena Dept. of Computer Science and Engineering Sharda School of Engineering and Technology Sharda University Greater Noida Greater Noida India

In recent years, the need for automatic detection of Not Safe for Work (NSFW) content on social media platforms has increased dramatically. In this study, we contrasted the presentation of five optimizers, namely ADAM, ADAMAX, ADAMW, SGD, and ADAGRAD, on the EfficientNet-V2M and V2L models for NSFW content detection. We used a dataset consisting of NSFW images for training and testing the models. The results show that the ADAM optimizer performed better than the other optimizers with an accuracy of 98.80% for training and that for testing is 95.60% for the EfficientNet-V2L model. However, all the optimizers performed reasonably well with same parameters. This study provides valuable insights into the selection of optimizers for NSFW content detection using deep learning models. The dataset used in the research consists of a large number of images with explicit and non-explicit content. The results were evaluated based on accuracy, precision, recall, F1-score, loss, and area under the curve (AUC). The findings indicate that ADAM, ADAMAX, and ADAMW outperform the other optimizers in terms of all evaluation metrics. With the easy availability of explicit content online, it has become a concern for society, especially for children who are easily influenced by such content on various platforms. Exposure to unfiltered and inappropriate content can have a negative impact on young minds. To safeguard against such content, the authors of this paper review and analyze different approaches for detecting and filtering pornographic and NSFW content. The objective of this paper is to provide a filtered and safe content environment for the community, especially for children and teenagers who are the most vulnerable.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Unlocking the Mysteries of Speech: A Hybrid RNN-CNN Model for Vocal Language Identification

Unlocking the Mysteries of Speech: A Hybrid RNN-CNN Model fo...

引用

IEEE Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)

作者： Ganeshayya Shidaganti Pathanjali Harebailu Kishore Kumar K Mohammad Akmal Nagesh D Dept. of Computer Science and Engineering M.S. Ramaiah Institute of Technology Bangalore Karnataka India

Speech is the primary form of communication, and being able to identify the language used in an audio sample is crucial in numerous fields. Language identification serves a purpose in speech recognition, language translation, voice assistants, and many other domains. Traditional language identification systems rely on statistical or acoustic models, which frequently require extensive domain-specific knowledge and have limitations in accuracy and resilience. By employing a hybrid Recurrent neural network and Convolutional Neural Network, this paper aims to develop a novel method of language identification. The suggested approach entails spectrogram preprocessing of audio samples, Mel-Frequency cepstrum coefficient extraction, and Convolutional Neural Network and Recurrent neural network architecture model training on a sizable dataset. Recurrent neural networks are good at capturing temporal dependencies in data, while Convolutional Neural Networks are better at capturing spatial patterns. This methodology successfully identifies the languages with an overall accuracy of 93%, proving the efficacy of the proposed model.

关键词：

来源：评论

学校读者我要写书评

暂无评论

MultiLingualSync: A Novel Method for Generating Lip-Synced Videos in Multiple Languages

MultiLingualSync: A Novel Method for Generating Lip-Synced V...

引用

Innovation in technology (ASIANCON), Asian Conference on

作者： Gundu Nanditha Kaushik Varma Datla Gopanpally Kevin Ramavath Nikitha Lanke Pallavi Chunduri Madhu Babu Dept. of Computer Science Engineering B V Raju Institute of Technology Narsapur Medak Telangana India

We present a Multi Lingual Sync model for generating lip-synced videos in multiple languages. The model consists of Lingua Speak for translation and Wav2Lip for lip synchronization. The workflow involves extracting audio from a video, translating it using Lingua Speak, converting the translated text to speech, and then using Wav2Lip to generate synchronized lip movements. Wav2Lip utilizes a Generator with an identity encoder, speech encoder, and face decoder, along with a pre-trained lip-sync discriminator called SyncNet. The proposed model enables the creation of accurate and efficient multilingual videos with synchronized speech and video.

关键词：

来源：评论

学校读者我要写书评

暂无评论

On enabling remote hands-on computer Networking Education: the NITOS testbed approach

On enabling remote hands-on Computer Networking Education: t...

引用

IEEE Integrated STEM Education Conference (ISEC)

作者： Nikos Makris Virgilios Passas Apostolos Apostolaras Theodoros Tsourdinis Ilias Chatzistefanidis Thanasis Korakis Dept. of Electrical and Computer Engineering University of Thessaly Greece Centre for Research and Technology Hellas CERTH Greece

Education in recent years has slowly transitioned to an online model, allowing massive access to online courses virtually from anywhere. The adoption of such educational models was boosted by the global pandemic in 2020, with universities and other degree programs quickly transitioning to such schemes. Although such a model is apt for lecture-based courses, hands-on training remains a puzzle on how it can transition to remote learning. In this work, we describe and evaluate our scheme for integrating testbed resources in online-taught networking-related courses in University of Thessaly, Greece. The framework is based on Kubernetes and is able to deliver hands-on labs related to networking as micro-services over the testbed architecture with minimal overhead on the lab setup from the instructor. The proposed approach has been applied in the networking-related courses of the curriculum during the 2020-2021 and 2021-2022 academic years, educating more than 800 students on computer networking concepts in practice. The paper describes the framework and a benchmarking evaluation, which proves the capacity of the framework to serve up to 5 times higher numbers of students, compared to prior methodologies and practices, without any infrastructure upgrades.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Direct data-driven state-feedback control of general nonlinear systems

arXiv

引用

arXiv 2023年

作者： Verhoek, Chris Koelewijn, Patrick J.W. Haesaert, Sofie Tóth, Roland The Dept. of Electrical Engineering The Eindhoven University of Technology Netherlands The Institute for Computer Science and Control Budapest Hungary

Through the use of the Fundamental Lemma for linear systems, a direct data-driven state-feedback control synthesis method is presented for a rather general class of nonlinear (NL) systems. The core idea is to develop a data-driven representation of the so-called velocity-form, i.e., the time-difference dynamics, of the NL system, which is shown to admit a direct linear parameter-varying (LPV) representation. By applying the LPV extension of the Fundamental Lemma in this velocity domain, a state-feedback controller is directly synthesized to provide asymptotic stability and dissipativity of the velocity-form. By using realization theory, the synthesized controller is realized as a NL state-feedback law for the original unknown NL system with guarantees of universal shifted stability and dissipativity, i.e., stability and dissipativity w.r.t. any (forced) equilibrium point, of the closed-loop behavior. This is achieved by the use of a single sequence of data from the system and a predefined basis function set to span the scheduling map. The applicability of the results is demonstrated on a simulation example of an unbalanced disc. © 2023, CC BY-NC-SA.

关键词： Linear systems

来源：评论

学校读者我要写书评

暂无评论

UrduSpeakXLSR: Multilingual Model for Urdu Speech Recognition

UrduSpeakXLSR: Multilingual Model for Urdu Speech Recognitio...

引用

International Conference on Emerging Technologies, ICET

作者： Hira Mohiuddin Zahoor Ahmed Maha Kasi Bakhtiar Kasi Dept. of Computer Science Balochistan University of Information Technology Engineering and Management Sciences (BUITEMS) Quetta Pakistan

Speech recognition, a disruptive technology, has revolutionized human-machine interaction. While numerous Automatic Speech Recognition (ASR) models are publicly available via HuggingFace, the majority cater to English language. For Urdu, however, models are scarce or closed-source, with open-sourced ones often lack the robustness. Our research addresses this scarcity, focusing on the challenges posed by dialects, slangs, and accents. In this work, we introduce an innovative ASR model leveraging the pretrained XLS-R model based on Wav2Vec2.0 architecture, trained on CommonVoice corpus V11. Our approach outperforms other deep learning-based techniques both qualitatively and quantitatively, offering promising results for an under-resourced language.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Optimized IoT Security with Attention-Based Hybrid Deep Learning Approach

Optimized IoT Security with Attention-Based Hybrid Deep Lear...

引用

International Conference on computer and Information technology (ICCIT)

作者： Rahman Shiddiqur Jiancheng Wu Mehedi Hasan Shakil Liaoyuan Zeng School of Information and Communication Engineering University of Electronic Science and Technology of China Chengdu China Dept. of Computer Science and Engineering Chittagong University of Engineering & Technology Chattagram Bangladesh Yibin Institute of UESTC

ISBN: (数字)9798331519094

ISBN: (纸本)9798331519100

Maintaining strong network security in the quickly developing Internet of Things (IoT) space is crucial because IoT traffic is so varied and complicated. By focusing on the TON IoT dataset, this study introduces a novel method for intrusion detection in IoT networks. Bidirectional Gated Recurrent Units (Bi-GRU) were utilized in a deep learning architecture to capture temporal dependencies in the data, while Convolutional Neural Networks (CNN) were utilized for effective feature extraction. By including a Multi-Head Attention layer, the model was able to select and pay attention to important characteristics, which enhanced its ability to focus on important patterns. A hybrid feature selection approach incorporating MI, Lasso, and RFECV was employed to generate a refined feature set, maintaining critical information for improved model performance. With a 99.73% binary classification accuracy and a 99.68% multi-class classification accuracy, our work outperformed the state-of-the-art techniques. These findings highlight how well CNN, Bi-GRU, and Attention mechanisms work together to detect intricate intrusion patterns in IoT environments, which advances the security measures required to shield IoT networks from dynamic cyber attacks.

关键词： Deep learning Attention mechanisms Accuracy Intrusion detection Network security Feature extraction Real-time systems Convolutional neural networks Internet of Things Low latency communication

来源：评论

学校读者我要写书评

暂无评论

Performance Evaluation of Machine Learning Algorithms for Lung Nodule Detection using Multi-Modal Imaging - A Hybrid Approach

Performance Evaluation of Machine Learning Algorithms for Lu...

引用

International Conference on Edge Computing and Applications (ICECAA)

作者： Puneeth R P Amogha Mayya K S Agnesh Shetty Akash Rao Aditya A Sooda Dept. of computer science and engineering NITTE (Deemed to be University) NMAM Institute of Technology Nitte Karnataka India

Lung cancer is a major contributor to global mortality rates and identification is critical to improve patient outcomes. In recent years, machine learning algorithms have demonstrated promising results in identifying lung nodules from medical images. The most compelling area of research for scientists is the early detection of lung cancer. This study is a method for lung nodule detection using CT images. The study incorporates a hybrid model that combines multiple machine learning algorithms including CNN, SVM, DTC, ANN, and KNN to improve the accuracy of nodule detection. The hybrid model demonstrated high accuracy in identifying various types of lung nodules, including Adenocell carcinoma, squamous cell carcinoma, and large cell carcinoma. Specifically, the model achieved an accuracy rate of over 90% in detecting and differentiating normal lung tissue and Adenocell carcinomas. Accuracy graphs and priority setting were utilized to assess the model's capability in accurately predicting the presence of lung cancer. Additionally, the efficiency of the hybrid model was compared with other machine learning algorithms, including SVM, Random Forest, and Decision Trees. A large dataset of CT scans was collected for training and evaluation purposes. The results demonstrated the advantages of the suggested hybrid model in terms of accuracy and efficiency. This study highlights the importance of early lung nodule identification using CT scans and demonstrates the effectiveness of the hybrid model in accurately identifying different types of lung nodules.

关键词：

来源：评论

学校读者我要写书评

暂无评论

QVD-Querying Video Databases for Event Related Frames Using Text Keywords

QVD-Querying Video Databases for Event Related Frames Using ...

引用

2023 International Conference on Self Sustainable Artificial Intelligence Systems, ICSSAS 2023

作者： Kumar, Ravi Ranjan Kumar, Kamal Jain, Anuj Kumar Sharma, Vikrant Sharma, Neha Jain, Nitin Chitkara University Institute of Engineering and Technology Chitkara University Vlsi Centre of Excellence Punjab India National Institute of Technology Uttarakhand Dept. of Computer Science and Engineering Uttarakhand Srinagar India Chitkara University Institute of Engineering and Technology Chitkara University Punjab India Graphic Era Hill University Uttarakhand Dehradun India Career Skills Iilm University Haryana Gurugram India

ISBN: (纸本)9798350300857

With increasing camera enabled devices huge video data is generated every second. With unlimited storage offered by cloud, most of this video data is moved to cloud storage directly or indirectly. But manually querying such a huge data is challenging for most of the people. We have proposed an indexing method where from tokens of query we can find related frames and videos in a short amount of time. Queries could be very specific e.g. 'car accident in area ∗ at time*' Thus video summarization is used to produce short summary of long videos. Manually checking these summaries is again challenging task. This work aims to generate algorithm to query any video database in fraction of time, with few keywords related to video. Aim of generating a comprehensive algorithm is achieved by creating summary of each video. And the aim is also to achieve to Generate caption for each frame and then generating cumulative summary from summarized video frames using LSTM. This is achieved by searching user keywords in these summaries and ranking related summaries using CNN. © 2023 IEEE.

关键词： Video recording

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：