As a way to keep posts going, I am starting a short recap about interesting papers being published (or being discovered) every now and then. Probably I will write longer posts about some of them in the future.
Let’s get this thing going:
Two papers using ‘centroid estimation‘ to retrieve interesting information:
Mariko Akutsu, Yasuhiro Oikawa, and Yoshio Yamasaki, at The Journal of the Acoustical Society of America
Conversation is one of the most important channels for human beings. To help communications, speech recognition technologies have been developed. Above all, in a conversation, not only contents of utterances but also intonations and tones include important information regarding a speaker’s intention. To study the sphere of human speech, microphones are typically used to record voices. However, since microphones have to be set around a space, their existences affect a physical behavior of the sound field. To challenge this problem, we have suggested a recording method using a high-speed camera. By using a high-speed camera for recording sound vibrations, it can record two or more points within the range of the camera at the same time and can record from a distance, without interfering with the sound fields. In this study, we extract voice information using high-speed videos, which capture both a face and a cervical part of the subject. This method allows recording skin vibrations, which contain voices with individuality and extrapolating sound waves by using an image processing method. The result of the experiment shows that a high-speed camera is capable of recording voice information.
Dekel Raanan, Liqing Ren, Dan Oron, and Yaron Silberberg, at Optics Letters
Stimulated Raman scattering (SRS) has recently become useful for chemically selective bioimaging. It is usually measured via modulation transfer from the pump beam to the Stokes beam. Impulsive stimulated Raman spectroscopy, on the other hand, relies on the spectral shift of ultrashort pulses as they propagate in a Raman active sample. This method was considered impractical with low energy pulses since the observed shifts are very small compared to the excitation pulse bandwidth, spanning many terahertz. Here we present a new apparatus, using tools borrowed from the field of precision measurement, for the detection of low-frequency Raman lines via stimulated-Raman-scattering-induced spectral shifts. This method does not require any spectral filtration and is therefore an excellent candidate to resolve low-lying Raman lines (<200cm−1<200 cm−1), which are commonly masked by the strong Rayleigh scattering peak. Having the advantage of the high repetition rate of the ultrafast oscillator, we reduce the noise level by implementing a lock-in detection scheme with a wavelength shift sensitivity well below 100 fm. This is demonstrated by the measurement of low-frequency Raman lines of various liquid samples.
Machine learning keeps leaking into photonics. This time with a Compressive Sensing flavor and some holography:
The compressed sensing (CS) has been successfully applied to image compression in the past few years as most image signals are sparse in a certain domain. Several CS reconstruction models have been proposed and obtained superior performance. However, these methods suffer from blocking artifacts or ringing effects at low sampling ratios in most cases. To address this problem, we propose a deep convolutional Laplacian Pyramid Compressed Sensing Network (LapCSNet) for CS, which consists of a sampling sub-network and a reconstruction sub-network. In the sampling sub-network, we utilize a convolutional layer to mimic the sampling operator. In contrast to the fixed sampling matrices used in traditional CS methods, the filters used in our convolutional layer are jointly optimized with the reconstruction sub-network. In the reconstruction sub-network, two branches are designed to reconstruct multi-scale residual images and muti-scale target images progressively using a Laplacian pyramid architecture. The proposed LapCSNet not only integrates multi-scale information to achieve better performance but also reduces computational cost dramatically. Experimental results on benchmark datasets demonstrate that the proposed method is capable of reconstructing more details and sharper edges against the state-of-the-arts methods.
Yair Rivenson, Yibo Zhang, Harun Günaydın, Da Teng & Aydogan Ozcan, at Light: Science & Applications
Phase recovery from intensity-only measurements forms the heart of coherent imaging techniques and holography. In this study, we demonstrate that a neural network can learn to perform phase recovery and holographic image reconstruction after appropriate training. This deep learning-based approach provides an entirely new framework to conduct holographic imaging by rapidly eliminating twin-image and self-interference-related spatial artifacts. This neural network-based method is fast to compute and reconstructs phase and amplitude images of the objects using only one hologram, requiring fewer measurements in addition to being computationally faster. We validated this method by reconstructing the phase and amplitude images of various samples, including blood and Pap smears and tissue sections. These results highlight that challenging problems in imaging science can be overcome through machine learning, providing new avenues to design powerful computational imaging systems.
Last, single-pixel camera / ghost imaging being applied to x-ray tomography:
Andrew M. Kingston, Glenn R. Myers, Daniele Pelliccia, Imants D. Svalbe, David M. Paganin, at arXiv.org
Ghost imaging has recently been successfully achieved in the X-ray regime; due to the penetrating power of X-rays this immediately opens up the possibility of X-ray ghost tomography. No research into this topic currently exists in the literature. Here we present adaptations of conventional tomography techniques to this new ghost imaging scheme. Several numerical implementations for tomography through X-ray ghost imaging are considered. Specific attention is paid to schemes for denoising of the resulting tomographic reconstruction, issues related to dose fractionation, and considerations regarding the ensemble of illuminating masks used for ghost imaging. Each theme is explored through a series of numerical simulations, and several suggestions offered for practical realisations of X-ray ghost tomography.
Recently I have been reading a lot about Compressive Sensing strategies. One of the things we always want when we work in a single-pixel architecture is to project the lowest possible number of masks, because the projecting process is the longest in all the acquisition procedure (and it gets longer and longer when you increase the spatial resolution of your images).
In the past, several strategies haven been implemented to reduce that number of projections. From going fully random to partially scan a basis at random and at the low frequency region, each approach presents some benefits and more or less speed gain.
In this work by the group of K.F. Kelly, they explored a different approach. Instead of chosing one measurement basis and design a sensing strategy (picking random elements, or centering around the low frequency part of the basis, or a mix), they create a measurement basis by merging different functions. They call it hybrid patterns. The basic idea is to chose a low number of patterns which work well for recovering low frequency content of natural images, and also some other patterns which are good to recover high frequency content. The novel thing here is that they do not require the patterns to belong to the same orthogonal basis, thus being able to carefully design its measurement basis. This provides very good quality results with a low number of projections.
Another thing I liked a lot was the Principal Component Analysis (PCA) part of the paper. Basically, they gathered a collection of natural images and they generated an orthogonal basis by using PCA. This leads me to think of PCA as a way of obtaining orthogonal bases where objects have their sparsest representation (maybe I am wrong about that).
Realization of hybrid compressive imaging strategies,
(featured image exctracted from Fig.2 of the manuscript)
The tendency of natural scenes to cluster around low frequencies is not only useful in image compression, it also can prove advantageous in novel infrared and hyperspectral image acquisition. In this paper, we exploit this signal model with two approaches to enhance the quality of compressive imaging as implemented in a single-pixel compressive camera and compare these results against purely random acquisition. We combine projection patterns that can efficiently extract the model-based information with subsequent random projections to form the hybrid pattern sets. With the first approach, we generate low-frequency patterns via a direct transform. As an alternative, we also used principal component analysis of an image library to identify the low-frequency components. We present the first (to the best of our knowledge) experimental validation of this hybrid signal model on real data. For both methods, we acquire comparable quality of reconstructions while acquiring only half the number of measurements needed by traditional random sequences. The optimal combination of hybrid patterns and the effects of noise on image reconstruction are also discussed.
I just read on ArXiv.org that L. Bian and his colleagues made a cool comparison between several ways of performing single-pixel imaging. They have tested the performance on several recovery procedures, some quite familiar but others not so well stablished. I find both Table 1 and Fig. 7 extremely interesting. One sums up really well the different reconstruction approaches that can be used in single-pixel imaging (with or without using Compressive Sensing). The figure points out one thing that experience has told me: every problem you try to solve usually needs an specific solver if you want to get good and fast results (which is extremely important when you start to work with BIG objects, as I plan to write soon here).
Experimental comparison of single-pixel imaging algorithms,
(featured image extracted from Fig.7 of the manuscript)
Single-pixel imaging (SPI) is a novel technique capturing 2D images using a photodiode, instead of conventional 2D array sensors. SPI owns high signal-to-noise ratio, wide spectrum range, low cost, and robustness to light scattering. Various algorithms have been proposed for SPI reconstruction, including the linear correlation methods, the alternating projection method (AP), and the compressive sensing based methods. However, there has been no comprehensive review discussing respective advantages, which is important for SPI’s further applications and development. In this paper, we reviewed and compared these algorithms in a unified reconstruction framework. Besides, we proposed two other SPI algorithms including a conjugate gradient descent based method (CGD) and a Poisson maximum likelihood based method. Both simulations and experiments validate the following conclusions: to obtain comparable reconstruction accuracy, the compressive sensing based total variation regularization method (TV) requires the least measurements and consumes the least running time for small-scale reconstruction; the CGD and AP methods run fastest in large-scale cases; the TV and AP methods are the most robust to measurement noise. In a word, there are trade-offs between capture efficiency, computational complexity and robustness to noise among different SPI algorithms. We have released our source code for non-commercial use.