检索结果-内蒙古大学图书馆

Violent Interaction Detection in Video Based on Deep Learning

Journal of Physics: Conference Series 2017年第1期844卷

作者： Peipei Zhou Qinghai Ding Haibo Luo Xinglin Hou Shenyang Institute of Automation Chinese Academy of Sciences Shenyang 110016 University of Chinese Academy of Sciences Beijing 100049 Key Laboratory of Opto-Electronic Information Processing CAS Shenyang 110016 The Key Lab of Image Understanding and Computer Vision Liaoning Province Shenyang 110016 Space star technology co. LTD Beijing 100086

Violent interaction detection is of vital importance in some video surveillance scenarios like railway stations, prisons or psychiatric centres. Existing vision-based methods are mainly based on hand-crafted features such as statistic features between motion regions, leading to a poor adaptability to another dataset. En lightened by the development of convolutional networks on common activity recognition, we construct a FightNet to represent the complicated visual violence interaction. In this paper, a new input modality, image acceleration field is proposed to better extract the motion attributes. Firstly, each video is framed as RGB images. Secondly, optical flow field is computed using the consecutive frames and acceleration field is obtained according to the optical flow field. Thirdly, the FightNet is trained with three kinds of input modalities, i.e., RGB images for spatial networks, optical flow images and acceleration images for temporal networks. By fusing results from different inputs, we conclude whether a video tells a violent event or not. To provide researchers a common ground for comparison, we have collected a violent interaction dataset (VID), containing 2314 videos with 1077 fight ones and 1237 no-fight ones. By comparison with other algorithms, experimental results demonstrate that the proposed model for violent interaction detection shows higher accuracy and better robustness.

关键词：

来源：评论

学校读者我要写书评

暂无评论

High Dynamic Range Imaging Using Multiple Exposures

引用

Journal of Physics: Conference Series 2017年第1期844卷

作者： Xinglin Hou Haibo Luo Peipei Zhou Wei Zhou Shenyang Institute of Automation Chinese Academy of Sciences Shenyang 110016 University of Chinese Academy of Sciences Beijing 100049 Key Laboratory of Opto-Electronic Information Processing CAS Shenyang 110016 The Key Lab of Image Understanding and Computer Vision Liaoning Province Shenyang 110016 AVIC Jiangxi HONGDU Aviation Industry Group LTD Nanchang China

It is challenging to capture a high-dynamic range (HDR) scene using a low-dynamic range (LDR) camera. This paper presents an approach for improving the dynamic range of cameras by using multiple exposure images of same scene taken under different exposure times. First, the camera response function (CRF) is recovered by solving a high-order polynomial in which only the ratios of the exposures are used. Then, the HDR radiance image is reconstructed by weighted summation of the each radiance maps. After that, a novel local tone mapping (TM) operator is proposed for the display of the HDR radiance image. By solving the high-order polynomial, the CRF can be recovered quickly and easily. Taken the local image feature and characteristic of histogram statics into consideration, the proposed TM operator could preserve the local details efficiently. Experimental result demonstrates the effectiveness of our method. By comparison, the method outperforms other methods in terms of imaging quality.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the BRATS challenge

arXiv

引用

arXiv 2018年

作者： Bakas, Spyridon Reyes, Mauricio Jakab, Andras Bauer, Stefan Rempfler, Markus Crimi, Alessandro Shinohara, Russell Takeshi Berger, Christoph Ha, Sung Min Rozycki, Martin Prastawa, Marcel Alberts, Esther Lipkova, Jana Freymann, John Kirby, Justin Bilello, Michel Fathallah-Shaykh, Hassan M. Wiest, Roland Kirschke, Jan Wiestler, Benedikt Colen, Rivka Kotrotsou, Aikaterini Lamontagne, Pamela Marcus, Daniel Milchenko, Mikhail Nazeri, Arash Weber, Marc-Andr Mahajan, Abhishek Baid, Ujjwal Gerstner, Elizabeth Kwon, Dongjin Acharya, Gagan Agarwal, Manu Alam, Mahbubul Albiol, Alberto Albiol, Antonio Albiol, Francisco J. Alex, Varghese Allinson, Nigel Amorim, Pedro H.A. Amrutkar, Abhijit Anand, Ganesh Andermatt, Simon Arbel, Tal Arbelaez, Pablo Avery, Aaron Azmat, Muneeza Pranjal, B. Bai, Wenjia Banerjee, Subhashis Barth, Bill Batchelder, Thomas Batmanghelich, Kayhan Battistella, Enzo Beers, Andrew Belyaev, Mikhail Bendszus, Martin Benson, Eze Bernal, Jose Bharath, Halandur Nagaraja Biros, George Bisdas, Sotirios Brown, James Cabezas, Mariano Cao, Shilei Cardoso, Jorge M. Carver, Eric N. Casamitjana, Adri Castillo, Laura Silvana Cat, Marcel Cattin, Philippe Cérigues, Albert Chagas, Vinicius S. Chandra, Siddhartha Chang, Yi-Ju Chang, Shiyu Chang, Ken Chazalon, Joseph Chen, Shengcong Chen, Wei Chen, Jefferson W. Chen, Zhaolin Cheng, Kun Choudhury, Ahana Roy Chylla, Roger Clrigues, Albert Colleman, Steven Colmeiro, Ramiro German Rodriguez Combalia, Marc Costa, Anthony Cui, Xiaomeng Dai, Zhenzhen Dai, Lutao Daza, Laura Alexandra Deutsch, Eric Ding, Changxing Dong, Chao Dong, Shidu Dudzik, Wojciech Eaton-Rosen, Zach Egan, Gary Escudero, Guilherme Estienne, Tho Everson, Richard Fabrizio, Jonathan Fan, Yong Fang, Longwei Feng, Xue Ferrante, Enzo Fidon, Lucas Fischer, Martin French, Andrew P. Fridman, Naomi Fu, Huan Fuentes, David Gao, Yaozong Gates, Evan Gering, David Gholami, Amir Gierke, Willi Glocker, Ben Gong, Mingming Gonzlez-Vill, Sandra Grosges, T. Guan, Yuanfang Guo, Sheng Gupta, Sudeep Han, Woo-Sup Han, Il Song Harmuth, Ko Center for Biomedical Image Computing and Analytics University of Pennsylvania PhiladelphiaPA United States Department of Radiology Perelman School of Medicine University of Pennsylvania PhiladelphiaPA United States Department of Pathology and Laboratory Medicine Perelman School of Medicine University of Pennsylvania PhiladelphiaPA United States Institute for Surgical Technology and Biomechanics University of Bern Bern Switzerland Center for MR-Research University Children's Hospital Zurich Zurich Switzerland Support Centre for Advanced Neuroimaging Inselspital Institute for Diagnostic and Interventional Neuroradiology Bern University Hospital Bern Switzerland University Hospital of Zurich Zurich Switzerland Center for Clinical Epidemiology and Biostatistics University of Pennsylvania Philadelphia United States Image-Based Biomedical Modeling Group Technical University of Munich Munich Germany Icahn School of Medicine Mount Sinai Health System New YorkNY United States Leidos Biomedical Research Inc. Frederick National Laboratory for Cancer Research FrederickMD21701 United States Cancer Imaging Program National Cancer Institute National Institutes of Health BethesdaMD20814 United States Department of Neurology University of Alabama at Birmingham BirminghamAL United States Department of Diagnostic Radiology University of Texas MD Anderson Cancer Center HoustonTX United States Department of Psychology Washington University St. LouisMO United States Neuroimaging Informatics and Analysis Center Washington University St. LouisMO United States Department of Radiology Washington University St. LouisMO United States Institute of Diagnostic and Interventional Radiology Pediatric Radiology and Neuroradiology University Medical Center Rostock Ernst-Heydemann-Str. 6 Rostock18057 Germany Tata Memorial Centre Homi Bhabha National Institute Mumbai India Shri Guru Gobind Singhji Institute of Engineering and Technology Nanded India NVIDIA Santa Clara

Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles disseminated across multi-parametric magnetic resonance imaging (mpMRI) scans, reflecting varying biological properties. Their heterogeneous shape, extent, and location are some of the factors that make these tumors difficult to resect, and in some cases inoperable. The amount of resected tumor is a factor also considered in longitudinal scans, when evaluating the apparent tumor for potential diagnosis of progression. Furthermore, there is mounting evidence that accurate segmentation of the various tumor sub-regions can offer the basis for quantitative image analysis towards prediction of patient overall survival. This study assesses the state-of-the-art machine learning (ML) methods used for brain tumor image analysis in mpMRI scans, during the last seven instances of the International Brain Tumor Segmentation (BraTS) challenge, i.e., 2012-2018. Specifically, we focus on i) evaluating segmentations of the various glioma sub-regions in pre-operative mpMRI scans, ii) assessing potential tumor progression by virtue of longitudinal growth of tumor sub-regions, beyond use of the RECIST/RANO criteria, and iii) predicting the overall survival from pre-operative mpMRI scans of patients that underwent gross total resection. Finally, we investigate the challenge of identifying the best ML algorithms for each of these tasks, considering that apart from being diverse on each instance of the challenge, the multi-institutional mpMRI BraTS dataset has also been a continuously evolving/growing dataset. Copyright © 2018, The Authors. All rights reserved.

关键词： Tumors

来源：评论

学校读者我要写书评

暂无评论

Offline Handwritten Arabic Character Recognition Using Features Extracted from Curvelet and Spatial Domains

引用

Research Journal of Applied Sciences, Engineering and Technology 2015年第2期11卷 158-164页

作者： Mazen Abdullah Bahashwan Syed Abd Rahman Abu-Bakar Computer Vision Video and Image Processing Research Lab (CvviP) Department of Electronics and Computer Engineering Faculty of Electrical Engineering Universiti Teknologi Malaysia Johor Malaysia

Arabic character recognition is a challenging problem in several artificial intelligence applications, especially when recognizing connected cursive letters. Another dimension of complexity is that Arabic characters may form various shapes depending on their positions in the word. As a result, unconstrained handwritten Arabic character recognition has not been well explored. In this study, we propose an efficient algorithm for Arabic character recognition. The new algorithm combines features extracted from curvelet and spatial domains. The curvelet domain is multiscale and multidirectional. Therefore, curvelet domain is efficient in representing edges and curves. Meanwhile, the spatial domain preserves original aspects of the characters. This feature vector is then trained using the back propagation neural network for the recognition task. The proposed algorithm is evaluated using a database containing 5,600 handwritten characters from 50 different writers. A promising average success rate of 90.3% has been achieved. Therefore, the proposed algorithm is suitable for the unconstrained handwritten Arabic character recognition applications.

关键词： Character database curvelet transform neural network optical character recognition

来源：评论

学校读者我要写书评

暂无评论

Prediction of Soluble Solid Content of Starfruit Using Spectral Imaging Combined with Partial Least Squares and Support Vector Regression

Prediction of Soluble Solid Content of Starfruit Using Spect...

引用

IEEE International Conference on Signal and image processing Applications

作者： Feri Candra Syed Abd. Rahman Abu-Bakar Computer Vision Video and Image Processing Research Lab Electronics and Computer Engineering Department Faculty of Electrical Engineering Universiti Teknologi Malaysia 81310 Johor Bahru Malaysia

ISBN: (纸本)9781479989973

Spectral imaging technique such as hyperspectral and multispectral imaging is a combination of imaging and spectroscopy. This powerful technique can provide samples of spectral images, which can be used to analyze a number of fruit properties. The aim of this study is to develop calibration or predictive model for determining soluble solid content (SSC) of starfruit samples based on their spectral images. Partial least squares (PLSR) and support vector regression (SVR) techniques were applied to build the relationship between the mean spectral data and the reference value. The mean spectral data was extracted from spectral images of each starfruit samples. The simple template for region of interest (ROI) selection and five optimal wavelengths (565.2, 677.2, 736, 873.2 and 943.2 nm) as proposed in previous study were used for extraction of the mean spectral data. The result showed that the calibration model with PLSR and SVR had better performance than the previous study. Moreover, the calibration model with SVR was the best performance for prediction of SSC value of starfruit.

关键词： Spectral imaging Hyperspectral imaging Support vector regression Averrhoa carambola total soluble solids Support Vector Network Partial least squares Imaging hyperspectral imagery spectral data multispectral imagery

来源：评论

学校读者我要写书评

暂无评论

DeWAFF: A novel image abstraction approach to improve the performance of a cell tracking system

DeWAFF: A novel image abstraction approach to improve the pe...

引用

International Work Conference on Bio-inspired Intelligence (IWOBI)

作者： S. Calderón A. Sáenz R. Mora F. Siles I. Orozco M.E. Buemi Department of Electrical Engineering School of Engineering Cancer-Lab Universidad de Costa Rica San José Costa Rica Image Processing and Computer Vision Group Universidad de Buenos Aires Buenos Aires Argentina

This paper presents a new image abstraction approach, aiming to improve typical image related pattern recognition tasks such as segmentation, tracking, and classification. The proposed image abstraction framework performs image denoising and homogeneous region simplification, along with border and region enhancement. The proposed framework consists in a novel generalized approach of common weighted averaging denoising algorithms mixed with Unsharp Masking (USM) border enhancement techniques, to avoid typical USM artifacts as ringing. Results of the different configurations within the image abstraction framework for a cell tracking application are presented.

关键词： image edge detection Noise reduction Histograms AWGN Pattern recognition image segmentation

来源：评论

学校读者我要写书评

暂无评论

Development of young oil palm tree recognition using Haar- based rectangular windows

引用

IOP Conference Series: Earth and Environmental Science 2016年第1期37卷

作者： S Daliman S A R Abu-Bakar S H Md Nor Azam Computer Vision Video and Image Processing (CvviP) Research Lab Department of Electronics and Computer Engineering Faculty of Electrical Engineering Universiti Teknologi Malaysia 81310 Skudai Johor MALAYSIA. Sime Darby Research Sdn. Bhd. Jalan Pulau Carey 42960 Pulau Carey Selangor MALAYSIA.

This paper presents development of Haar-based rectangular windows for recognition of young oil palm tree based on WorldView-2 imagery data. Haar-based rectangular windows or also known as Haar-like rectangular features have been popular in face recognition as used in Viola-Jones object detection framework. Similar to face recognition, the oil palm tree recognition would also need a suitable Haar-based rectangular windows that best suit to the characteristics of oil palm tree. A set of seven Haar-based rectangular windows have been designed to better match specifically the young oil palm tree as the crown size is much smaller compared to the matured ones. Determination of features for oil palm tree is an essential task to ensure a high successful rate of correct oil palm tree detection. Furthermore, features that reflects the identification of oil palm tree indicate distinctiveness between an oil palm tree and other objects in the image such as buildings, roads and drainage. These features will be trained using support vector machine (SVM) to model the oil palm tree for classifying the testing set and subimages of WorldView-2 imagery data. The resulting classification of young oil palm tree with sensitivity of 98.58% and accuracy of 92.73% shows a promising result that it can be used for intention of developing automatic young oil palm tree counting.

关键词：

来源：评论

学校读者我要写书评

暂无评论

An approach to large scale interactive retrieval of cultural heritage

An approach to large scale interactive retrieval of cultural...

引用

2014 Eurographics Workshop on Graphics and Cultural Heritage, GCH 2014

作者： Takami, Masato Bell, Peter Ommer, Bjrn Heidelberg Collaboratory for Image Processing IWR University of Heidelberg Germany Robert Bosch GmbH Corporate Research Computer Vision Research Lab Hildesheim Germany

ISBN: (纸本)9783905674637

Large scale digitization campaigns are simplifying the accessibility of a rapidly increasing number of images from cultural heritage. However, digitization alone is not sufficient to effectively open up these valuable resources. Retrieval and analysis within these datasets is currently mainly based on manual annotation and laborious preprocessing. This is not only a tedious task, which rapidly becomes infeasible due to the enormous data load. We also risk to be biased to only see what an annotator beforehand has focused on. Thus a lot of potential is being wasted. One of the most prevalent tasks is that of discovering similar objects in a dataset to find relations therein. The majority of existing systems for this task are detecting similar objects using visual feature keypoints. While having a low processing time, these methods are limited to detect only close duplicates due to their keypoint based representation. In this work we propose a search method which can detect similar objects even if they exhibit considerable variability. Our procedure learns models of the appearance of objects and trains a classifier to find related *** address a central problem of such learning-based methods, the need for appropriate negative and positive training samples. To avoid a highly complicated hard negative mining stage we propose a pooling procedure for gathering generic negatives. Moreover, a bootstrap approach is presented to aggregate positive training samples. Comparison of existing search methods in cultural heritage benchmark problems demonstrates that our approach yields significantly improved detection performance. Moreover, we show examples of searching across different types of datasets, e.g., drafts and photographs. © 2014 All rights reserved.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

Human object classification in daubechies complex wavelet domain 2nd

Human object classification in daubechies complex wavelet do...

引用

2nd International Conference on Context-Aware Systems and Applications, ICCASA 2013

作者： Khare, Manish Srivastava, Rajneesh Kumar Khare, Ashish Binh, Nguyen Thanh Dien, Tran Anh Image Processing and Computer Vision Lab Department of Electronics and Communication University of Allahabad Allahabad India Ho Chi Minh City University of Technology Ho Chi Minh Viet Nam

ISBN: (纸本)9783319059389

Human object classification is an important problem for smart video surveillance applications. In this paper we have proposed a method for human object classification, which classify the objects into two classes: human and non-human. The proposed method uses Daubechies complex wavelet transform coefficients as a feature of object. Daubechies complex wavelet transform is used due to its better edge representation and approximate shift-invariant property as compared to real valued wavelet transform. We have used Adaboost as a classifier for classification of objects. The proposed method has been tested on standard datasets like, INRIA dataset. Quantitative experimental evaluation results show that the proposed method is better than other state-of-the-art methods and gives better performance. © Springer International Publishing Switzerland 2014.

关键词： Adaptive boosting

来源：评论

学校读者我要写书评

暂无评论

Maximum likelihood thresholding algorithm based on four-parameter gamma distributions

Maximum likelihood thresholding algorithm based on four-para...

引用

International Conference on Electrical Engineering, Computing Science and Automatic Control (CCE)

作者： Peter De-Ford Geovanni Martinez Image Processing and Computer Vision Research Laboratory (IPCV-LAB) Universidad de Costa Rica San José Costa Rica

In this contribution, we present a segmentation algorithm based on thresholding to subdivide an intensity image in the regions of object and background. The optimal threshold is found by maximizing a likelihood function derived from a novel intensity probability density function model, which consists of the sum of two weighted four-parameter gamma distributions, as a more flexible alternative to currently used models consisting of the sum of two weighted two-parameter Gaussian distributions. According to our experiments with 132 images, the proposed algorithm is in average slightly better than the best found in the scientific literature, performing particularly good in low contrast images. The additional parameters and complexity of its likelihood function resulted in an increase of the processing time by a factor of 3, from 0.003 sec/image to 0.009 sec/image.

关键词： image segmentation Probability density function Approximation algorithms Gaussian distribution Histograms Shape Pattern recognition

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：