Human pose estimation is an important research direction in the field of computer vision. At present, the mainstream human pose estimation algorithms have high complexity, large amount of calculation, and cannot be ru...
详细信息
With the continuous development of intelligent driving technology, vision-based navigation technology is becoming increasingly significant. Lane detection is the key part of the navigation technology. At present, most...
详细信息
ISBN:
(纸本)9789881563972
With the continuous development of intelligent driving technology, vision-based navigation technology is becoming increasingly significant. Lane detection is the key part of the navigation technology. At present, most of the researches focus on the identification of structured lane, such as highways. However, study on the detection of unstructured lane is relatively few, mainly because unstructured lanes have no obvious marks, which brings difficulties in identification. Therefore, unstructured lane detection still faces serious challenges now. In this paper, region of interest (ROI) of original RGB image is first selected. Next step is RGB image conversion to a grayscale. Then, Gaussian filter and mean filter are used to smooth the structured lane image and unstructured lane image, respectively. Finally, applying edge detection and Hough transform (HT) to extract the unstructured straight lane. Meanwhile, improved region growing and least square (LS) are used to extract unstructured curved lane. The experiment results on the roads of a campus show that the presented algorithms are effective, which can accurately extract the road area and road boundary.
Future lithography systems must produce more dense chips with smaller feature sizes, while maintaining throughput comparable to today's optical lithography systems. This places stringent data-handling requirements...
详细信息
ISBN:
(纸本)0819436151
Future lithography systems must produce more dense chips with smaller feature sizes, while maintaining throughput comparable to today's optical lithography systems. This places stringent data-handling requirements on the design of any maskless lithography system. Today's optical lithography systems transfer one layer of data from the mask to the entire wafer in about sixty seconds. To achieve a similar throughput for a direct-write maskless lithography system with a pixel size of 25 nm, data rates of about 10 Tb/s are required. In this paper, we propose an architecture for delivering such a data rate to a parallel array of writers. In arriving at this architecture, we conclude that pixel domain compression schemes are essential for delivering these high data rates. To achieve the desired compression ratios, we explore a number of binary lossless compression algorithms, and apply them to a variety of layers of typical circuits such as memory and control. The algorithms explored include the Joint Bi-Level imageprocessing Group (JBIG), Ziv-Lempel (LZ77) as implemented by ZIP, as well as our own extension of Ziv-Lempel to two-dimensions. For all the layouts we tested, at least one of the above schemes achieves a compression ratio of 20 or larger, demonstrating the feasibility of the proposed system architecture.
SAR processing usually requires very accurate navigation data, i.e. to form a focused image. The track must be measured within fractions of the centre wavelength. For high frequencies (e.g. X-band) this condition is t...
详细信息
ISBN:
(数字)9781510618060
ISBN:
(纸本)9781510618060
SAR processing usually requires very accurate navigation data, i.e. to form a focused image. The track must be measured within fractions of the centre wavelength. For high frequencies (e.g. X-band) this condition is too strict. Even with a cutting-edge motion measurement system, autofocus is a necessity. For low frequencies (e.g. VHF-band) a differential GPS (DGPS) is often an adequate solution (alone). However, for this case, it is actually conceivable to rely on autofocus capability over the motion measurement system. This paper describes how to form a SAR image without support from navigation data. That is within the scope of factorized geometrical autofocus (FGA). The FGA algorithm is a base-2 fast factorized back-projection realization with six free geometry parameters (per sub-aperture pair). These are tuned step-by-step until a sharp image is obtained. This procedure can compensate for an erroneous geometry (from a focus perspective). The FGA algorithm has been applied successfully on an ultra-wideband (UWB) data set, acquired at VHF-band by the CARABAS 3 system. The track is measured accurately by means of a DGPS. We however adopt and modify a basic geometry model. A linear equidistant flight path at fixed altitude is assumed and adjusted at several resolution levels. With this approach, we emulate a stand-alone processing chain without support from navigation data. The resulting FGA image is compared to a reference image and verified to be focused. This indicates that it is feasible to form a VHF-band SAR image without a motion measurement system.
Different from binary computation, stochastic computation (SC), as a new paradigm, uses stochastic bit stream (SBS) to encode data. By simplifying computing elements, the circuit area can be greatly reduced. SBS can b...
详细信息
ISBN:
(纸本)9781665450737
Different from binary computation, stochastic computation (SC), as a new paradigm, uses stochastic bit stream (SBS) to encode data. By simplifying computing elements, the circuit area can be greatly reduced. SBS can be generated by a stochastic number generator (SNG) with a variety of formats. In this work, we use unipolar (UP) and bipolar (BP) formats to optimize the traditional SC subtractor, which is named the UP-to-BP Subtractor (UBS). A new cross format coding (CFC) method is proposed for stochastic computing, which combines the UP and BP format, and is applied to Sobel edge detection in imageprocessingalgorithms. The fault tolerance and detection efficacy of the proposed CFC method and conventional binary computing are compared in this paper. By using the CFC method, the detected F-Score is improved by 0.15(23%). If the F-score remains unchanged, the processing speed can be about 10 times faster.
The paper presents the mathematical technique for calculation of the diffraction depth of focus of an optical system of a widefield microscope. The proposed technique applies the Rayleigh criterion based on evaluation...
详细信息
ISBN:
(纸本)9780819482822
The paper presents the mathematical technique for calculation of the diffraction depth of focus of an optical system of a widefield microscope. The proposed technique applies the Rayleigh criterion based on evaluation of the wave aberration appeared due to defocus in a high aperture optical system. The maximal value of a linear approximation of the defocus wave aberration is used to define the depth of focus. It is proven that in optical systems with numerical aperture higher than 0.5 have the diffraction depth of focus 25 - 40% smaller than the widely known formula defines. This fact is important for implementation of autofocus and digital focus extension algorithms. The non-sophisticated formula for calculation of the depth of focus is proposed. The results of experimental measurements of the depth of focus are presented and discussed.
End-to-end simulation of the influence of the optical train on the observed scene is important across optics and is particularly important for predicting the science yield of astronomical telescopes. As a consequence ...
详细信息
ISBN:
(纸本)9781510637757
End-to-end simulation of the influence of the optical train on the observed scene is important across optics and is particularly important for predicting the science yield of astronomical telescopes. As a consequence of their goal of suppressing starlight, coronagraphic instruments for high-contrast imaging have particularly complex field-dependent point-spread-functions (PSFs). The Roman Coronagraph Instrument (CGI), Hybrid Lyot Coronagraph (HLC) is one example. The purpose of the HLC is to image exoplanets and exozodiacal dust in order to understand dynamics of solar systems. This paper details how images of exoplanets and exozodiacal dust are simulated using some of the most recent PSFs generated for the CGI HLC imaging mode. First, PSFs are generated using physical optics propagation techniques. Then, the angular offset of pixels in image scenes, such as exozodiacal dust models, are used to create a library of interpolated PSFs using interpolation and rotation techniques, such that the interpolated PSFs correspond to angular offsets of the pixels. This means interpolation needs only be done once and an image can then be simulated by multiplying the vector array of the model astrophysical scene by the matrix array of the interpolated PSF data. This substantially reduces the time required to generate image simulations by reducing the process to matrix multiplication, allowing for faster scene analysis. We will detail the steps required to generate coronagraphic scenes, quantify the speed-up of our matrix approach versus other implementations, and provide example code for users who wish to simulate their own scenes using publicly available HLC PSFs.
This paper discusses key challenges of data processing in the field of artificial intelligence (AI), specifically in dealing with unstructured data and adapting to market changes. We propose a novel AI risk assessment...
详细信息
The continuing development of smaller electronic devices into the nanometer regime offers great possibilities of highly parallel computing systems, as it allows to reduce power consumption and device sizes and to incr...
详细信息
ISBN:
(纸本)9783642333507
The continuing development of smaller electronic devices into the nanometer regime offers great possibilities of highly parallel computing systems, as it allows to reduce power consumption and device sizes and to increase operating speed. Quantum-dot Cellular Automata (QCA) has been proposed as an alternative for nanoelectronic devices and introduces a new opportunity for the design of highly parallel algorithms and architectures. Its benefits are the fast speed, very small size, high density and low energy consumption. These advantages can be very useful for various real time imageprocessing applications. Complex imageprocessingalgorithms include in many cases the well-known binary median filter and mathematical morphology operations such as dilation and erosion. In this paper we propose and simulate two innovative QCA circuits which implement the dilation and the erosion.
Cross-modal matching, which aims to establish the correspondence between two different modalities, is fundamental to a variety of tasks such as cross-modal retrieval and vision-and-language understanding. Although a h...
ISBN:
(纸本)9781713845393
Cross-modal matching, which aims to establish the correspondence between two different modalities, is fundamental to a variety of tasks such as cross-modal retrieval and vision-and-language understanding. Although a huge number of cross-modal matching methods have been proposed and achieved remarkable progress in recent years, almost all of these methods implicitly assume that the multimodal training data are correctly aligned. In practice, however, such an assumption is extremely expensive even impossible to satisfy. Based on this observation, we reveal and study a latent and challenging direction in cross-modal matching, named noisy correspondence, which could be regarded as a new paradigm of noisy labels. Different from the traditional noisy labels which mainly refer to the errors in category labels, our noisy correspondence refers to the mismatch paired samples. To solve this new problem, we propose a novel method for learning with noisy correspondence, named Noisy Correspondence Rectifier (NCR). In brief, NCR divides the data into clean and noisy partitions based on the memorization effect of neural networks and then rectifies the correspondence via an adaptive prediction model in a co-teaching manner. To verify the effectiveness of our method, we conduct experiments by using the image-text matching as a showcase. Extensive experiments on Flickr30K, MS-COCO, and Conceptual Captions verify the effectiveness of our method. The code could be accessed from ***. me.
暂无评论