检索结果-内蒙古大学图书馆

A Content-Based Image Retrieval Method Based on the google cloud vision api and WordNet 1

9th Asian Conference on Intelligent Information and Database Systems (ACIIDS)

作者： Chen, Shih-Hsin Chen, Yi-Hui Cheng Shiu Univ Dept Informat Management 840 Chengcing Rd Kaohsiung 83347 Taiwan Asia Univ Dept M Commerce & Multimedia Applicat 500 Lioufeng Rd Taichung 41354 Taiwan

ISBN: (数字)9783319544724

ISBN: (纸本)9783319544724

Content-Based Image Retrieval (CBIR) method analyzes the content of an image and extracts the features to describe images, also called the image annotations (or called image labels). A machine learning (ML) algorithm is commonly used to get the annotations, but it is a time-consuming process. In addition, the semantic gap is another problem in image labeling. To overcome the first difficulty, google cloud vision api is a solution because it can save much computational time. To resolve the second problem, a transformation method is defined for mapping the undefined terms by using the WordNet. In the experiments, a well-known dataset, Pascal VOC 2007, with 4952 testing figures is used and the cloud vision api on image labeling implemented by R language, called cloud vision api. At most ten labels of each image if the scores are over 50. Moreover, we compare the cloud vision api with well-known ML algorithms. This work found this api yield 42.4% mean average precision (mAP) among the 4,952 images. Our proposed approach is better than three well-known ML algorithms. Hence, this work could be extended to test other image datasets and as a benchmark method while evaluating the performances.

关键词： Content Based Image Retrieval Image annotation google cloud vision api WordNet Pascal VOC 2007

来源：评论

学校读者我要写书评

暂无评论

Smart vision and Intelligent Object Recognition System for the Visually Impaired 15th

Smart Vision and Intelligent Object Recognition System for t...

引用

15th International Conference on Soft Computing and Pattern Recognition, SoCPaR 2023 and 14th World Congress on Nature and Biologically Inspired Computing, NaBIC 2023

作者： Suresh, Merugu Sandhyarani Shaik, Abdul Subhani Premalatha, B. Ghinea, George Reddy, Avala Raji Raju, K. Srujan Kumar, Voruganti Naresh Department of ECE CMR College of Engineering and Technology Telangana Hyderabad India CMR Technical Campus Telangana Hyderabad India Mulsemedia Computing Brunal University London Wilfred Brown Building 215 Uxbridge United Kingdom

ISBN: (纸本)9783031810824

In an effort to increase the functional dependence for the visually impaired people, have identified the disadvantages and drawbacks in the present existing solutions and have tried to include the loopholes in the existing systems. Identified and addressed the problems like: object recognition, identification of packaged goods, identification of currency coins and notes, ability to read texts recognize people and faces. This paper provides complete support to the visually impaired friends. The idea of this paper is an augmented reality wearable spectacle along with a wearable smart vision band on the wrist. The Spectacles are equipped with camera, earphones, GPS module and Internet connection (Wi-Fi &4G). The wearable smart vision system contains ultrasonic sensors, IR sensors to recognize the objects that are present before/behind the visually impaired person and can alert using a vibration. The Spectacles and the band are connected using Bluetooth. The outcome of the assistive device should be in the form of audible signal received using the earphones, that can be clearly usable for a visually impaired friend. This research paper works on the principle of Computer vision for image processing using OpenCV and google cloud vision api. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Augmented Reality Bluetooth Computer vision google cloud vision api GPS module IR sensor OpenCV Ultrasonic sensor

来源：评论

学校读者我要写书评

暂无评论

A vision Sensor Network to Study Viewers' Visible Behavior of Art Appreciation 10th

A Vision Sensor Network to Study Viewers' Visible Behavior o...

引用

Japanese-Society-of-Artificial-Intelligence 10th International Symposium on Artificial Intelligence (JSAI-isAI)

作者： Wu, Yilang Huang, Luyi Wei, Zhongyu Cheng, Zixue Univ Aizu Aizu Wakamatsu Fukushima Japan Fudan Univ Shanghai Peoples R China

ISBN: (纸本)9783030316051;9783030316044

Since the empathic processes are essential to the aesthetic experience, the empathy-enabling technology for behavioral sensing is gaining its popularity to support the study of anonymized viewers' cognition in art appreciation. Because such behavior is highly dynamic and divergent among viewers, it is a challenge to observe the multiple dynamic features from the streaming data. In this study, we propose a vision sensor network (VSN) to support the visual interpretation of viewers' appreciation on visual arts. It firstly annotates the features in the captured frames based on cloudapi (here the google cloud vision api is used), and secondly the query on nested documents in MongoDB provides universal access to the annotated features. Comparing with the traditional approaches with subjective evidence, such as the questionnaire or social listening methods, the proposed VSN can interpret the visible behavior of viewers in real-time. In addition, it also has less selective bias because of more objective evidence being captured.

关键词： Aesthetic empathy vision Sensor Network google cloud vision api Real-time image annotation Query on nested documents

来源：评论

学校读者我要写书评

暂无评论

Implementation of Machine Learning for Gender Detection using CNN on Raspberry Pi Platform 2

Implementation of Machine Learning for Gender Detection usin...

引用

2nd International Conference on Inventive Systems and Control (ICISC)

作者： Gauswami, Mitulgiri H. Trivedi, Kiran R. Shantilal Shah Engn Coll Elect & Commun Engn Commun Syst Engn Bhavnagar Gujarat India Gujarat Technol Univ Ahmadabad Gujarat India Shantilal Shah Engn Coll Elect & Commun Engn Bhavnagar Gujarat India

ISBN: (纸本)9781538608074

Gender Detection has numerous application in the field of authentication, security and surveillance systems, social platforms and social media. The proposed system describes gender detection based on Computer vision and Machine Learning Approach using Convolutional Neural Network (CNN) which is used to extract various facial feature. First, the facial-extraction is investigated and best features are introduced which would be useful for training and testing the dataset. This learning representation is taken through the use of convolution neural network. Which reveals that the proposed system is tested across various challenging levels of face datasets and gives excellent performance efficiency of the system with gender detection rate for each of the database. This whole system is introduced by the simple and easy hardware implementation on Raspberry Pi programmed using Python.

关键词： Machine Learning Gender Detection google cloud vision api Raspberry Pi Convolutional Neural Networks(CNN) Artificial Intelligence Linux Platform Embedded System

来源：评论

学校读者我要写书评

暂无评论

Enriching social analytics with latent Twitter image information 15

Enriching social analytics with latent Twitter image informa...

引用

15th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP)

作者： Razis, Gerasimos Theofilou, Georgios Anagnostopoulos, Ioannis Univ Thessaly Comp Sci & Biomed Infounat Dept Lamia Greece Natl Tech Univ Athens Sch Elect & Comp Engn Athens Greece

ISBN: (纸本)9781728159195

In this paper, we propose a framework that uses latent information from Twitter images by employing the google cloud vision api platform aiming at enriching social analytics with semantics and textual information. Our study reveals that user-generated content, linked data as well as hidden concepts and textual information from social images can be highly considered for enriching social analytics. Finally, we publish our annotated dataset for further use and evaluation from our research community.

关键词： Social labeling Twitter Images google cloud vision api OCR

来源：评论

学校读者我要写书评

暂无评论

Latent Twitter Image Information for Social Analytics

引用

INFORMATION 2021年第2期12卷 49页

作者： Razis, Gerasimos Theofilou, Georgios Anagnostopoulos, Ioannis Univ Thessaly Comp Sci & Biomed Informat Dept Lamia 35131 Greece Natl Tech Univ Athens Sch Elect & Comp Engn Athens 15780 Greece

The appearance of images in social messages is continuously increasing, along with user engagement with that type of content. Analysis of social images can provide valuable latent information, often not present in the social posts. In that direction, a framework is proposed exploiting latent information from Twitter images, by leveraging the google cloud vision api platform, aiming at enriching social analytics with semantics and hidden textual information. As validated by our experiments, social analytics can be further enriched by considering the combination of user-generated content, latent concepts, and textual data extracted from social images, along with linked data. Moreover, we employed word embedding techniques for investigating the usage of latent semantic information towards the identification of similar Twitter images, thereby showcasing that hidden textual information can improve such information retrieval tasks. Finally, we offer an open enhanced version of the annotated dataset described in this study with the aim of further adoption by the research community.

关键词： social labeling Twitter images google cloud vision api OCR cosine similarity Word2Vec

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：