检索结果-内蒙古大学图书馆

Computational robotics, Testing and Engineering Evaluation (ICCRTEE), International Conference on

作者： R. Raja Subramanian T.V. Sreevastha P. Thisyanth P. Venkateswara Rao P. Ruthik Department of Computer Science and Engineering Kalasalingam Academy of Research and Education Madurai India

ISBN: (数字)9798331597092

ISBN: (纸本)9798331597108

Addressing the growing need for immediate sports activities analysis and automatic content material, a new smart system detects basketball highlights in real time. This deep studying framework makes use of a custom-trained YOLOv8 model for specific detection and tracking of the basketball. Key capabilities consist of automated camera movement and viewport smoothing, improved by speed-based motion prediction and confidence-aware monitoring. Adaptive stabilization minimizes jitter whilst following the movement. A sliding-window buffer statistics video before and after huge occasions, such as successful shots, to generate entire highlight clips right away. tested on live and recorded footage, the machine operates at real-time pace and dynamically crops videos to a mobile-friendly 9:16 aspect ratio. The consequences demonstrate robust overall performance in automating highlight technology, appreciably decreasing the need for guide enhancing. This enables instantaneous replay creation useful for broadcasting, coaching analysis, and improving fan interaction.

关键词： YOLO Deep learning computer vision Analytical models Computational modeling Streaming media Cameras Real-time systems Monitoring Sports

来源：评论

学校读者我要写书评

暂无评论

Exploring Deep Learning and Word Embedding Techniques for Sentiment Analysis on Diverse Textual Data 3rd

Exploring Deep Learning and Word Embedding Techniques for Se...

引用

3rd International Conference on Machine vision and Augmented Intelligence, MAI 2023

作者： Lavanya, B.N. Rathnam, K. V. Anitha Kiran, K. Appaji, Abhishek Shenoy, P. Deepa Venugopal, K.R. Department of Computer Science and Engineering University of Visvesvaraya College of Engineering BangaloreUniversity Bengaluru India Department of Medical Electronics BMS College of Engineering Bangalore India

ISBN: (纸本)9789819743582

The rapid expansion of data poses a significant challenge for analyzing sentiments. The importance of user-generated reviews highlights the need to carefully curate and evaluate text data to extract opinions. This research delves into the world of learning for sentiment analysis exploring models such as Bi-LSTM, CNN, and GRU along with various word embeddings like N-grams, Keras embedding, BERT, Roberta, CT-Bert, and Elmo. It effectively categorizes sentiments and detects sarcasm in Twitter and IMDB data making it a vital tool for applying learning to real-world sentiment analysis tasks. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Sentiment analysis

来源：评论

学校读者我要写书评

暂无评论

Automated Food Intake Monitoring System to Prevent Malnutrition Using the Tiago Robot Camera

Automated Food Intake Monitoring System to Prevent Malnutrit...

引用

2023 IEEE EMBS Special Topic Conference on Data science and Engineering in Healthcare, Medicine and Biology, IEEECONF 2023

作者： Konstantakopoulos, Fotios S. Plati, Daphni K. Pliakou, Labrina A. Di Luzio, Francesco Scotto Tagliamonte, Nevio Luigi Zollo, Loredana Tsiknakis, Manolis Fotiadis, Dimitrios I. University of Ioannina Unit of Medical Technology and Intelligent Information Systems Materials Science and Engineering Department Greece Forth Biomedical Research Institute IoanninaGR 45110 Greece Campus Bio-Medico University Research Unit of Advanced Robotics and Human-Centred Technologies Rome Italy Forth Institute of Computer Science Crete HeraklionGR 70013 Greece

ISBN: (纸本)9798350383386

The management of a daily diet is a significant concern among individuals in modern culture. The utilization of dietary assessment systems has significantly contributed to the efficient management of malnutrition and dietary habits over a period of time. In order to determine the nutritional value of the food that individuals consume, this study introduces a novel food monitoring system and its architecture. By creating a dataset of food images in a tray and using image processing, machine learning, and computer vision techniques, the nutrients and calories consumed by a patient are calculated. The proposed system achieves a 95.6% Intersection over Union score in the segmentation task, 92.3% top-1 accuracy in the classification task, and 4.6 g mean weight absolute error in the weight estimation *** Relevance: The proposed system has the ability to calculate the nutritional composition, encompassing calories, proteins, fats, and carbs, of food consumed by a patient, making it suitable for nutritional monitoring applications and healthcare systems that aim to monitor malnutrition in hospital patients. © 2023 IEEE.

关键词： Nutrition

来源：评论

学校读者我要写书评

暂无评论

3D Facial Expressions through Analysis-by-Neural-Synthesis

3D Facial Expressions through Analysis-by-Neural-Synthesis

引用

Conference on computer vision and Pattern Recognition (CVPR)

作者： George Retsinas Panagiotis P. Filntisis Radek Daněček Victoria F. Abrevaya Anastasios Roussos Timo Bolkarr Petros Maragos Institute of Robotics Athena Research Center Maroussi Greece MPI for Intelligent Systems Tübingen Germany Institute of Computer Science (ICS) Foundation for Research & Technology - Hellas (FORTH) Greece School of Electrical & Computer Engineering National Technical University of Athens Greece

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

While existing methods for 3D face reconstruction from in-the-wild images excel at recovering the overall face shape, they commonly miss subtle, extreme, asymmetric, or rarely observed expressions. We improve upon these meth-ods with SMIRK (Spatial Modeling for Image-based Reconstruction of Kinesics), which faithfully reconstructs expres-sive 3D faces from images. We identify two key limitations in existing methods: shortcomings in their self-supervised training formulation, and a lack of expression diversity in the training images. For training, most methods employ differentiable rendering to compare a predicted face mesh with the input image, along with a plethora of additional loss functions. This differentiable rendering loss not only has to provide supervision to optimize for 3D face geom-etry, camera, albedo, and lighting, which is an ill-posed optimization problem, but the domain gap between ren-dering and input image further hinders the learning pro-cess. Instead, SMIRK replaces the differentiable rendering with a neural rendering module that, given the ren-dered predicted mesh geometry, and sparsely sampled pix-els of the input image, generates a face image. As the neural rendering gets color information from sampled im-age pixels, supervising with neural rendering-based reconstruction loss can focus solely on the geometry. Further it enables us to generate images of the input identity with varying expressions while training. These are then utilized as input to the reconstruction model and used as supervision with ground truth geometry. This effectively augments the training data and enhances the generalization for di-verse expressions. Our qualitative, quantitative and partic-ularly our perceptual evaluations demonstrate that SMIRK achieves the new state-of-the art performance on accurate expression reconstruction. For our method's source code, demo video and more, please visit our project webpage: https://***/smirk/.

关键词： Training Geometry Solid modeling Three-dimensional displays Accuracy Face recognition Training data

来源：评论

学校读者我要写书评

暂无评论

Grasp Approach Under Positional Uncertainty Using Compliant Tactile Sensing Modules and Reinforcement Learning

Grasp Approach Under Positional Uncertainty Using Compliant ...

引用

Canadian Conference on Electrical and computer Engineering (CCECE)

作者： Viral Rasik Galaiya Thiago Eustaquio Alves De Oliveira Xianta Jiang Vinicius Prado Da Fonseca Department of Computer Science Robotics and AI Lab Memorial University of Newfoundland and Labrador St. John’s Canada Department of Computer Science Ubiquitous Computing and Machine Learning Lab Memorial University of Newfoundland and Labrador St. John’s Canada Department of Computer Science Haptics and Robots Research Group Lakehead University Thunder Bay Canada

ISBN: (数字)9798350371628

ISBN: (纸本)9798350371635

Object grasping is a complex task that requires high environmental awareness. While vision generally provides highly detailed environmental information, light changes, object transparency, camera resolution, and other factors such as occlusion and clutter affect its perception of object pose. Due to these limitations, there may be some deviation between the estimated and actual object pose in unstructured environments. The use of compliant tactile sensors relaxes the requirement of strict finger position planning while providing essential information regarding contact with the target object. Therefore, under positional uncertainty, the robotic system may use compliant tactile sensors to perform multiple attempts before a successful grasp. In the present paper, we investigate using reinforcement learning and compliant tactile sensors to provide adaptive grasping under pose uncertainty. Here, we identify a policy that models an object position estimation error while minimizing the exploratory sensor contact before obtaining a grasp. Our method was able to perform a successful grasp while reducing the number of attempts from an average of five to an average of two per episode.

关键词： Estimation error Uncertainty Contacts Fingers Tactile sensors Reinforcement learning Grasping

来源：评论

学校读者我要写书评

暂无评论

CDQN: Context infused Sequential Object Detection with Deep Reinforcement Learning in Aerial Images 3

CDQN: Context infused Sequential Object Detection with Deep ...

引用

3rd IEEE India Geoscience and Remote Sensing Symposium, InGARSS 2023

作者： Gandikota, Rohit Mishra, Deepak Northeastern University Department of Computer Science Boston United States Indian Institute of Space Science and Technology Department of Avionics Thiruvananthapuram India

ISBN: (纸本)9798350325591

Object detection in satellite imagery is challenging due to small object scale. Traditional one-shot and region proposal methods struggle with accuracy and computational costs. We propose a novel deep reinforcement learning approach using a hierarchical agent. It sequentially zooms into fixed sub-regions, starting from the full image and repeating until terminating in a final bounding box. Our method also optimizes computations by leveraging multi-resolution images. The agent begins with lower resolution, increasing based on state context for *** addition, we compare various deep Q-network (DQN) settings and show that adding a history vector of actions to the state is suboptimal, consistent with prior research. We demonstrate that our proposed approach outperforms traditional one-shot object detection and region-based proposal methods on satellite images. It is widely applicable to real-world uses like geospatial analysis, disaster management, and urban planning. Code, models, and results are publicly available at https://***/rohitgandikota/cdqn-detect. © 2023 IEEE.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

DynamicvisionCore: A Predictive Object Tracking Framework for Real-Time robotics Applications 8

DynamicVisionCore: A Predictive Object Tracking Framework fo...

引用

8th International Conference on Trends in Electronics and Informatics, ICOEI 2025

作者： Suneel, Sajja Rallan, Reema Jayakarthik, R. Naveenkumar, R. Dubey, Alok Shivaputra Telangana Hyderabad500043 India Lovely Professional University Department of Computer Application Punjab Phagwara India Saveetha College of Liberal Arts and Sciences SIMATS Dept of Computer science India Chandigarh Colleges of Engineering Chandigarh Group of Colleges Department of Computer Science and Engineering Punjab Mohali140307 India College of Dentistry Jazan University Department of Preventive Dental Sciences Jazan Saudi Arabia Dr. Ambedkar Institute of Technology Department of Electronics and Communication Engineering Bengaluru India

ISBN: (纸本)9798331544607

DynamicvisionCore is a novel predictive object tracking framework designed for real-time robotics applications. The system integrates YOLOv8 for object detection and DeepSORT for multi-object tracking, ensuring high accuracy and low latency. The framework achieves an impressive MOTA of 72.5% and MOTP of 80.1% on the MOT17 dataset, outperforming traditional methods such as SORT and ByteTrack. Motion prediction is enhanced using a hybrid Kalman Filter and LSTM-based model, reducing RMSE from 9.7 to 6.3 in occlusion scenarios. Additionally, the system ensures robust performance under challenging conditions, achieving 91.3% accuracy in normal lighting and 85.6% in low-light environments. The implementation is optimized for edge computing platforms like Jetson Xavier, where it achieves real-time processing at 9.7ms per frame. The system's robustness is further validated against varying lighting conditions, occlusions, motion blur, and extreme environments. This research demonstrates the efficacy of DynamicvisionCore in real-time robotic vision applications, ensuring reliable object tracking for autonomous systems. Future work will explore adaptive reinforcement learning strategies and improved sensor fusion techniques to further enhance tracking robustness. © 2025 IEEE.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Face Detection and Recognition in Near Infra-Red Image 6

Face Detection and Recognition in Near Infra-Red Image

引用

6th IEEE International Conference on Computational System and Information Technology for Sustainable Solutions, CSITSS 2022

作者： Shet, Athreya V Bs, Chinmay Shetty, Anvithkumar A Shankar, T. Hemavathy, R. Ramakanth, P. R v College of Engineering Computer Science and Engineering Department Bengaluru India R v College of Engineering Research and Projects Center for Cctv Research Bengaluru India

ISBN: (纸本)9781665456982

Face detection and recognition is an extensively researched topic in Artificial Intelligence. The use of AI in detection and mapping of faces or any objects can reduce the time spent in video auditing. A face recognition system maps facial traits from a picture or video using biometrics. To identify a match, it compares the data with a database of recognized faces. In any situation, facial recognition technology can intelligently assist in confirming a person's identification. Infra-Red camera or night vision in security cameras uses infrared light to capture near-infrared images in the dark and also through fog, dust and smoke, such that camera works in all conditions. Cameras operating in visual spectrum will work well only when the image capturing works well, but IR camera images can capture quality images at all times. The paper presents our work in creating a facial recognition system in near-infrared images from a surveillance camera, in which database was created in live Close Circuit Television footage for different individuals and the same was tested with different face detection techniques, classify images using the facial embeddings from VGG-face model to determine the faces. © 2022 IEEE.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

Temporally Consistent Referring Video Object Segmentation with Hybrid Memory

arXiv

引用

arXiv 2024年

作者： Miao, Bo Bennamoun, Mohammed Gao, Yongsheng Shah, Mubarak Mian, Ajmal The Department of Computer Science and Software Engineering The University of Western Australia Perth CrawleyWA6009 Australia The School of Engineering Griffith University BrisbaneQLD4111 Australia The Center for Research in Computer Vision University of Central Florida OrlandoFL32816 United States

Referring Video Object Segmentation (R-VOS) methods face challenges in maintaining consistent object segmentation due to temporal context variability and the presence of other visually similar objects. We propose an end-to-end R-VOS paradigm that explicitly models temporal instance consistency alongside the referring segmentation. Specifically, we introduce a novel hybrid memory that facilitates inter-frame collaboration for robust spatio-temporal matching and propagation. Features of frames with automatically generated high-quality reference masks are propagated to segment the remaining frames based on multi-granularity association to achieve temporally consistent R-VOS. Furthermore, we propose a new Mask Consistency Score (MCS) metric to evaluate the temporal consistency of video segmentation. Extensive experiments demonstrate that our approach enhances temporal consistency by a significant margin, leading to top-ranked performance on popular R-VOS benchmarks, i.e., Ref-YouTube-VOS (67.1%) and Ref-DAVIS17 (65.6%). The code is available at https://***/bo-miao/HTR. © 2024, CC BY.

关键词： Video analysis

来源：评论

学校读者我要写书评

暂无评论

Manta Ray Foraging Optimizer with Deep Learning Assisted Automated Fabric Defect Detection 1

Manta Ray Foraging Optimizer with Deep Learning Assisted Aut...

引用

1st IEEE International Conference on Cognitive robotics and Intelligent Systems, ICC - ROBINS 2024

作者： Sajitha, N. Prasanna Priya, S. Annamalai University Faculty of Science Department of Computer and Information Science India Thiru A. Govindasamy Govt Arts College Tindivanam604 307 India

ISBN: (纸本)9798350372748

Detecting defects in fabric is a crucial step in the textile industry. The intricate nature of textile structures often poses a challenge for the automated identification of fabric damage. Fabric Defect Detection (FDD) relies on sophisticated methods such as computer vision and machine learning to automatically identify and categorize faults or irregularities in fabric materials. By employing high-resolution imaging methods and classy models, these systems analyze textures, patterns, and colour differences in textiles to identify faults such as holes, stains, or abnormalities. FDD utilizing Deep Learning (DL) influences the power of neural networks to spontaneously recognize and label deficiencies in fabric materials. Through Convolutional Neural Networks (CNNs) or other DL architectures, these techniques are trained on great image datasets that contain defective as well as non-defective textile samples. This research develops a novel Manta Ray Foraging Optimizer with a Deep Learning-Assisted Automated Fabric Defect Detection (MRFODL-AFDD) approach. The MRFODL-AFDD approach uses an Inception v3 feature extractor to improve the efficacy of feature representation in textile images. Besides, the presented approach combines MRFO model for hyperparameter tuning, enhancing the performance of the defect detection method. At last, a deep Long Short-Term Memory (LSTM) detection model is implemented to diagnose temporal needs in consecutive fabric data. Experimental results determine the efficiency of the developed framework in attaining precise and robust FDD. The integration of Inception v3, MRFO, and deep LSTM classification displays a promising improvement in mechanizing excellence control procedures within the textile business. © 2024 IEEE.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：