检索结果-内蒙古大学图书馆

10th International Conference on Virtual Reality, ICVR 2024

作者： Damastuti, Fardani Annisa Firmansyah, Kenan Arif, Yunifa Miftachul Barakbah, Ali Ridho Hariadi, Mochamad Institut Teknologi Sepuluh Nopember Department of Electrical Engineering Surabaya Indonesia Electronic Engineering Polytechnic Institute of Surabaya Department of Creative Multimedia Technology Surabaya Indonesia Universitas Islam Negeri Maulana Malik Ibrahim Department of Informatics Engineering Malang Indonesia Electronic Engineering Polytechnic Institute of Surabaya Department of Informatics and Computer Engineering Surabaya Indonesia Institut Teknologi Sepuluh Nopember Department of Computer Engineering Surabaya Indonesia

ISBN: (纸本)9798350364231

The immersive nature of virtual reality (VR) gaming is significantly improved by the implementation of dynamic environmental systems. This paper focuses on the development and impact of a dynamic day and night cycle in a serious VR game. This feature is intended to simulate a full 24-hour cycle within a 10-minute gameplay session, resulting in a seamless transition between different times of day that impacts both the visual aesthetics and gameplay mechanics. The day and night cycle introduces strategic depth by requiring players to adjust their management strategies according to the time of day, which affects resource availability, operational efficiency, and conditions. This system not only enhances visual fidelity but also directly impacts gameplay, necessitating that player modify their strategies in response to the changing environment. Visual aesthetics are expected to increase by 15%, strategic profundity will be improved by 20%, and player immersion will be enhanced by 25%, according to the research findings. This investigation underscores the importance of natural cycles in enhancing the strategic complexity, interactivity, and realism of virtual reality environments, thereby offering valuable insights for the creation of future virtual reality games. © 2024 IEEE.

关键词： Virtual environments

来源：评论

学校读者我要写书评

暂无评论

A transformer-based approach to Nigerian Pidgin text generation

引用

International Journal of Speech Technology 2024年第4期27卷 1027-1037页

作者： Garba, Kabir Kolajo, Taiwo Agbogun, Joshua B. Department of Computer Science Federal University Lokoja P.M.B 1154 Kogi State Lokoja Nigeria Department of Informatics Faculty of Engineering Built Environment & IT University of Pretoria Pretoria South Africa

This paper describes the development of a transformer-based text generation model for Nigerian Pidgin also known as Naijá, a popular language in West Africa. Despite its wide use, Nigerian Pidgin remains under-resourced, particularly in areas related to text generation and natural language processing. These difficulties are primarily due to technological constraints rather than the language’s fundamental attributes. There is currently a demand for Nigerian Pidgin-specific solutions because it is used in everyday communication and has a unique linguistic blend. This paper aims to close this gap by exploring the application of state-of-the-art transformer technology to develop a text generation model for Nigerian Pidgin. This work uses the public Afriberta-corpus dataset to optimize the Generative Pre-trained Transformer (GPT-2) model across a sizeable dataset. The performance evaluators, BLEU and Perplexity metrics provide a detailed breakdown of the model’s text quality and predictive accuracy. Despite the difficulties caused by a limited amount of training data, preliminary evaluations show that the model can generate coherent Nigerian Pidgin text. The performance evaluation yielded perplexity scores of 43.56 for variable target reference length and 43.26 for fixed text length. BLEU scores of 0.15 for fixed max length and 0.56 for variable reference target length. This highlights the quality of generated text and the significant improvement when the generated text length is aligned with the reference target. Our work was benchmarked against African American Vernacular (AAVE) revealing that BLEU scores for AAVE are significantly lower than those for Standard American English, with BLEU given as 0.26. Our Nigerian Pidgin model, with a BLEU score of 0.56, shows a better performance. However, both results suggest that both dialects are challenging for language models. Leveraging the pre-trained transformer-based language model and evaluation metrics, we showcase the mo

关键词： Controllable text Generation Natural Language Generation Natural Language Processing Nigerian pidgin Pre-trained Language models Transformers

来源：评论

学校读者我要写书评

暂无评论

Optical Intra- and Inter-Rack Switching Architecture for Scalable, Low-Latency Data Center Networks 28

Optical Intra- and Inter-Rack Switching Architecture for Sca...

引用

28th IEEE Symposium on computers and Communications, ISCC 2023

作者： Drainakis, Georgios Baziana, Peristera Bogris, Adonis School of Electrical and Computer Engineering National Technical University of Athens Athens Greece University of Thessaly Department of Informatics and Telecommunications Lamia Greece University of West Attica Department of Informatics and Computer Engineering Egaleo Greece

ISBN: (纸本)9798350300482

In this paper we propose a DC network (DCN) architecture that interconnects servers in the intra-rack and inter-rack domain, utilizing optical switching at each domain. The proposed interconnection techniques are studied as an intermediate step before migrating the entire DCN to all-optical schemes. Unlike other studies, we study the server-to-server communication across the whole DCN. For the performance evaluation we produce numerical results for throughput and end-to-end delay for three traffic classes co-existing in DCN s. The numerical analysis reveals that bandwidth utilization reaches 90% and 100% in the intra- and inter- domain respectively. Meanwhile, the maximum end-to-end delay for the highest priority packets under congested load is lower than 0.56 and 0.41 μs for the two examined intra-rack capacity scenarios of 400 and 600 Gbps respectively. A comparative study shows that our solution can effectively interconnect up to 10000 servers with lower environmental footprint and end-to-end delay than other DCN s. © 2023 IEEE.

关键词： Network architecture

来源：评论

学校读者我要写书评

暂无评论

Face image authentication scheme based on MTCNN and SLT

引用

Multimedia Tools and Applications 2025年 1-43页

作者： Thabit, Rasha Al-Askari, Mohanad A. Mohammed, Dunya Zeki Anaam, Elham Abdulwahab Mahmood, Zainab H. Jabbar, Dina Jamal Salih, Zahraa Aqeel Information System Department College of Computer Sciences and Information Technology University of Al-Anbar Al-Anbar Iraq Computer Engineering Department College of Engineering Al-Iraqia University Baghdad Iraq Electronic and Communications Engineering Department College of Engineering Gilgamesh University Baghdad Iraq Faculty of Computing and Informatics Multimedia University Selangor Cyberjaya Malaysia Department of Cybersecurity Engineering College of Information Engineering Al-Nahrain University Baghdad Iraq Electrical Engineering Technical Collage Middle Technical University Baghdad Iraq Department of Business Administration Alamaal University College Baghdad Iraq

DeepFakes and face image manipulation methods have been widely distributed in the last few years and several techniques have been presented to check the authenticity of the face image and detect manipulation if exists. Most of the available manipulation detection techniques have been successfully applied to reveal one type of manipulation under specific conditions, however, many limitations and challenges can be encountered in this field. To overcome some limitations and challenges, this paper presents a new face image authentication (FIA) scheme based on Multi-Task Cascaded Conventional Neural Networks (MTCNN) and watermarking in Slantlet transform (SLT) domain. The proposed FIA scheme has three main algorithms that are face detection and selection, embedding, and extraction algorithms. Different block sizes have been used to divide the image into non-overlapping blocks followed by classifying them into two groups that are blocks from face area (FA) and blocks from the remaining area (RA) of the image. In the embedding algorithms, the authentication information is generated from FA blocks and embedded in the RA blocks. In the extraction algorithms, the embedded information is extracted from RA blocks and compared with the calculated data from FA blocks to reveal manipulations and localize the manipulated blocks if exist. Extensive experiments have been conducted to evaluate the performance of the proposed FIA scheme for different face images. The experimental work included tests for payload, capacity, visual quality, time complexity, and localization of manipulations. The results proved the efficiency of the proposed scheme in detecting and localizing different face image manipulations such as attributes attacks, retouching attacks, expression swap, face swap, and morphing attacks. The proposed scheme overcomes many limitations and it is 100% accurate in localizing the tampered blocks which makes it a better candidate for practical applications. © The Author(s), un

关键词： Authentication

来源：评论

学校读者我要写书评

暂无评论

Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents 30

Towards Human-Like Machine Comprehension: Few-Shot Relationa...

引用

Joint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024

作者： Wang, Hao Li, Tang Chu, Chenhui Zhu, Nengjun Wang, Rui Zhu, Pinpin School of Computer Engineering and Science Shanghai University China Graduate School of Informatics Kyoto University Japan Department of Computer Science and Engineering Shanghai Jiao Tong University China

ISBN: (纸本)9782493814104

Key-value relations are prevalent in Visually-Rich Documents (VRDs), often depicted in distinct spatial regions accompanied by specific color and font styles. These non-textual cues serve as important indicators that greatly enhance human comprehension and acquisition of such relation triplets. However, current document AI approaches often fail to consider this valuable prior information related to visual and spatial features, resulting in suboptimal performance, particularly when dealing with limited examples. To address this limitation, our research focuses on few-shot relational learning, specifically targeting the extraction of key-value relation triplets in VRDs. Given the absence of a suitable dataset for this task, we introduce two new few-shot benchmarks built upon existing supervised benchmark datasets. Furthermore, we propose a variational approach that incorporates relational 2D-spatial priors and prototypical rectification techniques. This approach aims to generate relation representations that are more aware of the spatial context and unseen relation in a manner similar to human perception. Experimental results demonstrate the effectiveness of our proposed method by showcasing its ability to outperform existing methods. This study also opens up new possibilities for practical applications. © 2024 ELRA Language Resource Association: CC BY-NC 4.0.

关键词： Variational techniques

来源：评论

学校读者我要写书评

暂无评论

A Graph Neural Network Based Learning Model for Urban Metro Flow Prediction 22

A Graph Neural Network Based Learning Model for Urban Metro ...

引用

22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023

作者： Drosouli, Ifigenia Voulodimos, Athanasios Mastorocostas, Paris Miaoulis, Georgios Ghazanfarpour, Djamchid Engineering University of West Attica Department of Informatics & Computer Athens Greece School of Electrical & Computer Engineering National Technical Univ. of Athens Athens Greece University of Limoges Department of Informatics Limoges France

ISBN: (纸本)9798350345346

- Abstract-Transport data with dynamic spatial-temporal dependencies elevates transportation flow forecasting to a significant issue for operational planning, managing passenger flow, and arranging for individual travel in a smart city. The task is challenging due to the composite spatial dependency on transportation networks and the non-linear temporal dynamics with mobility conditions changing over time. To address these challenges, we propose a Spatial- Temporal Graph Convolutional Recurrent Network that learns from both the spatial stations network data and time-series of historical mobility changes so as to predict urban metro flow at a future time. The model is based on Graph Convolutional Networks (GCN) and Long Short-Term Memory (LSTM) in order to further improve the estimation accuracy. Extensive experiments on a real-world dataset of Hangzhou metro system prove the effectiveness of the proposed model. © 2023 IEEE.

关键词： Graph Convolutional Networks spatial-temporal dependencies urban metro flow prediction

来源：评论

学校读者我要写书评

暂无评论

Exploring Art in the Digital Era: Creating and Deploying an Immersive Virtual Gallery Experience 25

Exploring Art in the Digital Era: Creating and Deploying an ...

引用

25th International Electronics Symposium, IES 2023

作者： Rante, Hestiasari Hanifati, Kirana Miranto, Cahya Tan, Toni Politeknik Elektronika Negeri Department of Informatics and Computer Engineering Surabaya Indonesia University of Bremen Department of Mathematics and Computer Science Germany

ISBN: (纸本)9798350314731

Art galleries play a vital role in preserving and promoting artistic heritage, but the current state of art galleries in Indonesia, particularly in attracting the younger generation, raises concerns. The general public, especially the younger generation, has shown the lack of interest in visiting art galleries, particularly in Indonesia. In light of these challenges, it becomes imperative to explore alternative approaches to engage the younger generation in the realm of art. Developing a virtual gallery platform that offers immersive and interactive experiences can help bridge the gap between the digital preferences of the younger generation and the traditional art world. By leveraging technology, this platform can provide a convenient and accessible medium for the younger generation to explore, appreciate, and interact with art, that can trigger their interest in the artistic heritage of Indonesia. This research explores the current state of art galleries in Indonesia and highlights the lack of interest among the younger generation in visiting physical art galleries. To counter this trend, a virtual gallery platform is developed to engage the younger generation and stimulate their interest in art. The platform offers an immersive experience, replicating the gallery environment virtually. Through software testing and initial user testing, the platform's performance and usability are evaluated. The findings present the potential of this platform in revitalizing the younger generation's interest in art. This web-based platform provides accessibility and convenience, allowing users to explore and appreciate art from anywhere. © 2023 IEEE.

关键词： Software testing

来源：评论

学校读者我要写书评

暂无评论

QuarantivityVR: Supporting Self-Embodiment for Non-HMD Users in Asymmetric Social VR Games

引用

i-com 2022年第1期21卷 55-70页

作者： Yassien, Amal Soliman, Mohamed Ahmed Abdennadher, Slim German International University Department of Informatics and Computer Science Cairo Egypt German University in Cairo Department of Computer Science and Engineering Cairo Egypt

The prevalence of immersive head-mounted display (HMD) social virtual reality (VR) applications introduced asymmetric interaction among users within the virtual environment (VE). Therefore, researchers opted for (1) exploring the asymmetric social VR interaction dynamics in only co-located setups, (2) assigning interdependent roles to both HMD and non-HMD users, and (3) representing non-HMD users as abstract avatars in the VE. Therefore, we investigate the feasibility of supporting Self-Embodiment in an asymmetric VR interaction mode in a remote setup. To this end, we designed an asymmetric social VR game, QuarantivityVR, to (1) support sense of self-embodiment for non-HMD users in a remote setting by representing them as realistic full-body avatars within the VE, (2) augment visual-motor synchrony for the non-HMD users to increase their sense of agency and presence by detecting their motion through Kinect sensor and laptop's webcam. During the game, each player performs three activities in succession, namely movie-guessing, spelling-bee, and answering mathematical questions. We believe that our work will act as a step towards the inclusion of a wide spectrum of users that can not afford full immersion and will aid researchers in creating enjoyable interactions for both users in the physical and virtual spaces. © 2022 Walter de Gruyter GmbH, Berlin/Boston.

关键词： Virtual reality

来源：评论

学校读者我要写书评

暂无评论

Unlocking Domain Specificity: Fine-Tuning Llama 2 for Enhanced Performance on Custom Datasets

Unlocking Domain Specificity: Fine-Tuning Llama 2 for Enhanc...

引用

International IoT, Electronics and Mechatronics Conference, IEMTRONICS 2024

作者： Bhattacharya, Swarnadwip Bhattacharjee, Anindita Das Roy, Pranab Singha Samanta, Tapas Department of Computer Science and Engineering Institute of Engineering and Management Kolkata700091 India Department of CSE IEM Centre of Excellence for InnovAI Institute of Engineering and Management Kolkata700091 India Department of Computer and Informatics Variable Energy Cyclotron Centre Kolkata700064 India

ISBN: (纸本)9789819747795

This paper aims to improve Llama 2’s performance by using personalized and modified datasets. Despite the impressive capabilities of large language models (LLMs) such as Llama 2, their effectiveness may be limited in specialized domains. The proposed method entails fine-tuning Llama 2 on custom datasets to optimize performance efficiently. The study focuses on the impact of quantization-aware low-rank adapter layers (QLoRA) on a single GPU’s resource-efficient fine-tuning performance. Furthermore, the study looks into the design of instruction datasets to guide the model toward desired behaviors. When Llama 2 is fine-tuned with QLoRA, performance improves significantly across tasks such as text summarization, question answering, and natural language generation in a variety of domains. The paper concludes by highlighting the broader implications of the findings. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Custom datasets Fine-tuning Instruction datasets Large language models (LLMs) Llama 2 Personalized datasets QLoRA Quantization-aware low-rank adapter layers (QLoRA) Resource-efficient fine-tuning Specialized domains

来源：评论

学校读者我要写书评

暂无评论

Human-Robot Interaction on Elderly Companion Robot Development Using Dual Intent Entity Transformer 7

Human-Robot Interaction on Elderly Companion Robot Developme...

引用

7th International Conference on Information Technology, Information Systems and Electrical engineering, ICITISEE 2023

作者： Rauf, Naufal Haidar Santoso, Heru Agus Universitas Dian Nuswantoro Faculty of Computer Science Department of Informatics Engineering Semarang Indonesia

ISBN: (纸本)9798350382266

The growing elderly population and the increasing demand for nursing home care have led to a need to improve the quality of life for residents. A popular solution is developing a companion robot to assist residents with various tasks and provide human-like interaction. In this paper, we present the prototype of a companion robot currently website-based, equipped with a Dual Intent Entity Transformer to understand what the user wants. The results of the study show the prototype is capable of classifying user messages according to their appropriate intent with precision, recall, F-score, and accuracy on average 91 %, 91 %, 90%, and 87% respectively using privately owned dataset. Future works such as expanding the robot's insight and tuning the model would improve the service provided by the companion robot. © 2023 IEEE.

关键词： Human robot interaction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：