Volume 33,Issue 5,2018 Table of Contents

Blind Enhancement of Bone-Conducted Microphone Speech: Review and Prospects

Zhang Xiongwei , Zheng Changyan , Cao Tieyong , Yang Jibin , Xing Yibo

2018, 33(5):769-778. DOI: 10.16337/j.1004-9037.2018.05.001

Abstract (2683) HTML (2847) PDF 939.67 K (3460) Comment (0) Favorites

Abstract:As one kind of non-acoustic sensors,bone-conducted microphone has excellent anti-noise characteristics because its speech transmission channel naturally shields the influence of ambient noise.Recently,bone-conducted microphone has gradually played an important role in speech communication systems in various strong noise environments.However,due to the low-pass characteristic of human body and the limitations of sensor technology,bone-conducted speech sounds muffled and unclear. It is of great significance to improve the bone-conducted speech for more efficient speech communication in strong noise environments and wider application prospects of bone-conducted products.The blind enhancement algorithm means only the bone-conducted speech information can be acquired during the speech enhancement stage.This kind of direct enhancement algorithm is more applicable than the fusion technique with air-conducted speech.Here,the characteristics of bone conducted speech are firstly analyzed,then the existing methods including unsupervised bandwidth extension,equalization and spectral envelope transformation are introduced.Finally,we share some views of future research prospects.

Advance in Recorder Recognition for Audio Forensics

Bao Yongqiang , Liang Ruiyu , Wang Qingyun

2018, 33(5):779-792. DOI: 10.16337/j.1004-9037.2018.05.002

Abstract (2601) HTML (3460) PDF 880.18 K (3439) Comment (0) Favorites

Abstract:Extracting the characteristics of recording devices from audio signals is a frontier in judicial comparative research and audio forensics.As a research hotspot of audio forensics, recorder recognition technology is disturbed by environment, semantics, speaker and other factors. This paper introduces the development, basic theory and structure of recording equipment research. Especially, the research status of non-speech segment detection, feature parameters, recognition model and database construction is introduced and analyzed. Finally, the shortcomings and prospects of recorder identification are discussed. It is considered that the next stage should focus on how to speed up the database construction and the application of deep learning in recorder recognition.

An Advanced Speech Enhancement Algorithm Based on Deep Belief Network

Yu Hua , Tang Yufeng , Zhao Li

2018, 33(5):793-800. DOI: 10.16337/j.1004-9037.2018.05.003

Abstract (1519) HTML (1317) PDF 1.37 M (1571) Comment (0) Favorites

Abstract:A speech enhancement algorithm based on deep belief network is proposed and improved for its shortcomings.Since there are few types of noise in the training set and the noise characteristics are not rich enough, the noise spectrum is disturbed in the frequency domain to enrich the noise spectrum characteristics. Considering that the signals of different frequency points have different effects on the system error, the weight coefficient is combined with the absolute hearing threshold. Finally, the better LOG minumum mean square error (LOG-MMSE) in the traditional speech enhancement algorithm and the improved deep confidence network-based speech enhancement algorithm in the noise environment are compared and analyzed. The result shows that the speech enhancement algorithm of the deep belief network exhibits excellent performance, especially the enhanced voice quality compared with the LOG-MMSE.

Weakly-Supervised Multi-label-Classification-Based Attention Mechanism

Zhang Wen , Tan Xiaoyang

2018, 33(5):801-808. DOI: 10.16337/j.1004-9037.2018.05.004

Abstract (993) HTML (3377) PDF 1.18 M (1696) Comment (0) Favorites

Abstract:Deep learning has become new state-of-the-art framework in many task in big data circumstance.Most of methods need full annotated data or assume only an object in the image with simple background.However,complex background,more than one object in the image and expensive full annotation in the reality,object recognition becomes more challenging.Here,we propose a deep-model-based attention mechanism and recurrent neural network.It trains the network end-to-end on multi-label data with image-level label.The glimpses change along with stochastic gradient descent and focus on different local region in every step.Finally,the effectiveness of the proposed algorithm is verified on the PASCAL VOC 2007 and 2012 datasets.Results show that the network is easily interpretable than other methods.

Noise Estimation Based on Isotropic Homogeneous Region Detection in Hypersepctral Images

Sun Xin , Fu Peng , Sun Quansen

2018, 33(5):809-817. DOI: 10.16337/j.1004-9037.2018.05.005

Abstract (759) HTML (1432) PDF 2.71 M (2032) Comment (0) Favorites

Abstract:It often plays a key role to extract homogeneous regions in the existing noise estimating methods for hyperspectral images (HSI). An effective homogeneous region detection method can improve the accuracy of image noise estimation. An isotropic homogeneous region detection algorithm (IHRDA) is proposed by using spatial information and spectral information, where a new Lance-SAD metric (LSM) is constructed to distinguish the similarity of picture elements in the homogeneous regions; then the noise level of hyperspectral images is estimated using the optimal regions with decorrelation based on multivariable linear regression (MLR) model. In experiments, synthetic images with different structure under different signal to noise ratio (SNR) and true hyperspectral remote sensing images are both compared with many existing methods, which show that the proposed method is more accurate and stable for hyperspectral images with various complexities and different noise levels.

Nonlinear Blind Equalization Based on Adaptive Correlation Elimination Algorithm

Peng Xuan , Qiu Xin , Mu Fuqi , Leng Yongqing

2018, 33(5):818-825. DOI: 10.16337/j.1004-9037.2018.05.006

Abstract (490) HTML (1511) PDF 853.26 K (1378) Comment (0) Favorites

Abstract:The performance of modern wide band software radio receiver (SDR) is deteriorated by nonlinear receiver with memory effect. Thus this paper puts forward a blind nonlinear equalization method called adaptive correlation elimination algorithm to suppress the nonlinear effect of Wiener receiver. Owing to independence of signals, taking the orthogonal frequency division multiplexing (OFDM) signal as an example, the algorithm detects correlation between signal and its remaining harmonic so that it can adjust the weights of nonlinear compensation model adaptively. The simulation result suggests that the algorithm can suppress the nonlinear effect of receiver under the large interference, reduce the error rate of weak signal, and enhance sensitivity of receiver, with low complexity.

Computation Efficient Joint Angle and Doppler Frequency Estimation of Coherent Targets in Monostatic MIMO Radar

Cao Renzheng , Zhang Xiaofei

2018, 33(5):826-836. DOI: 10.16337/j.1004-9037.2018.05.007

Abstract (808) HTML (1441) PDF 576.38 K (1375) Comment (0) Favorites

Abstract:The problem of joint direction of arrival (DOA) and Doppler frequency estimation of coherent targets in a monostatic multiple-input multiple-output (MIMO) radar is addressed. Based on the propagator method (PM), an RD-FSS-PM algorithm is proposed, which can effectively estimate the DOA and Doppler frequency of coherent targets with low computational load. In the RD-FSS-PM algorithm, we firstly perform a reduced-dimension (RD) transformation on received signals to decrease the computational load, then use forward spatial smoothing (FSS) to decorrelate the coherent signals and apply the PM to estimate the DOA and Doppler frequency simultaneously, which are automatically paired. Compared with the conventional FSS-PM method, the RD-FSS-PM algorithm has much better DOA estimation performance, very close frequency estimation accuracy and much less complexity. The variance of the estimation error and the Cramer-Rao bound (CRB) of the DOA and frequency estimation are derived. Simulation results are presented to show the effectiveness and improvement of the new approach.

Transmitted Waveform Adaptive Generation Method for Airborne Radar Utilizing Prior Knowledge of Target and Clutters

Chen Xingbo , Zhou Jiabing , Liu Dingyun , Qiu Chaoyang , Wang Gang , Rao Nini

2018, 33(5):837-846. DOI: 10.16337/j.1004-9037.2018.05.008

Abstract (696) HTML (2110) PDF 2.33 M (1785) Comment (0) Favorites

Abstract:The adaptation of transmitted waveform can enhance radar performances in detection, track, anti-jamming, etc. But the performance of this technique is linked with the prior knowledge of the radar target and clutters. In theory, the adaptive method of transmitted waveform based on maximum output signal-to-clutter-noise ratio (SCNR) criterion has been proved to be feasible and effective. In this paper, for enhancing practical application value of this adaptive method, the performance of the transmitted waveform adaptation method was verified by utilizing the prior knowledge of targets and clutters provided by Swerling statistic model and the digital elevation model (DEM) in airborne phased array radar. The simulation results show that the transmitted waveform can adaptively vary with the change of target and clutter for achieving the optimal matching of the two. Therefore, this method is better than the traditional methods in increasing output SCNRs of radar. It has good practical application prospects.

Face Attributes Recognition by Multi-level Sub-network and Ranked Dropout Mechanism

Gao Shulei , Zhou Mian , Xue Yanbing , Xu Guangping , Gao Zan , Zhang Hua

2018, 33(5):847-854. DOI: 10.16337/j.1004-9037.2018.05.009

Abstract (606) HTML (1109) PDF 1.77 M (1426) Comment (0) Favorites

Abstract:How to improve the accuracy of face attributes recognition in natural environment or unrestricted environment is an important question in applying face attributes. In daily life, the uncontrollable factors, such as face postures and light, have a great influence on the recognition of human face attributes. How to improve the accuracy under the influence of the above factors is a key problem in the study of face attribute recognition. Given the success of convolutional neural network (CNN) in image classification, a new network structure is built by using multi-level sub-network and ranked Dropout mechanism algorithm. The structure has strong robustness to deal with face changes, thus achieving better results in the CelebA dataset and LFWA dataset, and reducing the network size significantly as well.

A Sparsity-Learning-Based Person Re-identification Algorithm

Zhang Wenwen , Wang Hongyuan , Wan Jianwu , Sun Jinyu , Ding Zongyuan

2018, 33(5):855-864. DOI: 10.16337/j.1004-9037.2018.05.010

Abstract (1055) HTML (1393) PDF 2.07 M (1739) Comment (0) Favorites

Abstract:Person re-identification is one of the important issues addressed in computer vision. Existing recognition system concerns the matching of pedestrians across over-lapping cameras. When assuming pedestrian images as one representation of the camera view, person re-identification can be considered as a multi-view learning problem directly. On the basis of this assumption, a pedestrian recognition algorithm is proposed via canonical correlation analysis. Since the canonical correlation analysis is a linear dimensionality reduction algorithm, it is hard to extract semantic information for person re-identification (such as low resolution of images, changing illumination and other factors). A sparsity learning based person re-identification algorithm (SLR) is proposed. First, SLR obtained the semantic information of each camera view by the sparse learning, and then mapped the high-level features t into a public hidden space in order to make characteristic distance between different views can be compared. SLR aims to obtain more discriminable public hidden space. Finally, improve the matching rate of person re-identification across disjoint camera views. Comparing the proposed method and other common methods on the VIPeR dataset and CUHK campus dataset, experimental results show that the proposed method has higher recognition efficiency.

Fast Sparse Decomposition of Linear FM Signal Using FFT

Ou Guojian , Zhang Shufang , Deng Jianxun , Jiang Qingping

2018, 33(5):865-871. DOI: 10.16337/j.1004-9037.2018.05.011

Abstract (870) HTML (1103) PDF 1.57 M (1492) Comment (0) Favorites

Abstract:This paper proposes a fast algorithm for sparse decomposition of linear FM (LFM) signal to solve the deficiency of traditional methods that there are a large number of atoms. The atoms in the over-complete dictionary are structured based on linear FM signal, and fast sparse decomposition of LFM is completed by using combined dictionary. By analysis, the number of atoms in two over-complete dictionaries is much smaller than that in one over-complete dictionaries, and the most matched atom in another dictionary can be found with the use of searching the maximum based on fast Fourier transform. Simulation results show that the computational efficiency of the proposed method is better than that of three other algorithms using one over-completed dictionary, and the sparsity is better.

SAR Image Target Recognition Based on Multi-linear Principal Component Analysis and Tensor Analysis

Huan Ruohong , Tao Yifan , Chen Yue , Yang Peng , Bao Shenglin

2018, 33(5):872-879. DOI: 10.16337/j.1004-9037.2018.05.012

Abstract (671) HTML (1225) PDF 788.28 K (1326) Comment (0) Favorites

Abstract:For enhancing the target recognition effect of synthetic aperture radar image, a method of synthetic aperture radar image target recognition based on multi-linear principal component analysis and tensor analysis is proposed in this paper. Firstly, a four-order tensor training sample is constructed. Then, multi-linear principal component analysis is used to get the multi-linear projection matrix, and the core tensor is obtained from the multi-linear projection matrix. Finally, linear discriminant analysis is used to train the core tensor and classify the test samples. In the experiments, the proposed multilinear principal component analysis and tensor analysis method in this paper is applied to MSTAR public database for recognition experiments, and compared with principal component analysis and two-dimensional principal component analysis in recognition rate. Experimental results show that the method effectively preserves the image structure information and improves the target recognition rate.

A Method for Removing Baseline Drift in ECG Signal Based on Improved EEMD

Lin Jinzhao , Liu Lele , Li Guoquan , Bai Tong , Wang Huiqian , Pang Yu

2018, 33(5):880-890. DOI: 10.16337/j.1004-9037.2018.05.013

Abstract (844) HTML (1759) PDF 6.95 M (1988) Comment (0) Favorites

Abstract:A method to eliminate baseline drift of ECG signal based on improved ensemble empirical mode decomposition is proposed for the disadvantage of poor filtering in traditional method. The method can weaken the mode mixing of empirical mode decomposition, and make up for the shortcomings of EEMD. It establishes the criterion for adding auxiliary white noise in EEMD method, and then determines the two important parameters,i.e., the magnitude of auxiliary white noise and the ensemble times. The method extracts the baseline drift signal from the noisy signal and then reconstructs intrinsic mode function to obtain the "clean" ECG signal, which provides a prerequisite for subsequent research. The experimental results show that the de-noising method, compared with the traditional method, can improve the SNR, reduce root mean square error, keep the characteristic of the waveform, and solve the problem of low frequency component loss.

Three-Way Decision Evaluation System Based on TOPSIS

Sun Zuoyu , Yu Hong , Wang Guoyin

2018, 33(5):891-899. DOI: 10.16337/j.1004-9037.2018.05.014

Abstract (855) HTML (1554) PDF 425.93 K (1266) Comment (0) Favorites

Abstract:Technique for order performance by similarity to ideal solution (TOPSIS) is one of the multi-attribute decision-making methods and is used to solve supplier selections and other questions generally. In the process of evaluation focusing on production, there are many evaluation objects and hierarchical structures among objects. Decision makers pay more attention to the classification of objects, namely:good, medium or poor, though the evaluation systems for above problem are very insufficient. Therefore, this paper proposes a new three-way decision evaluation system based on TOPSIS. First, the analytic hierarchy process(AHP) method is used to calculate the weight of attributes. Then, to solve the hierarchy relations among objects, a preliminary classification is added before the process of evaluation. Finally, the improved TOPSIS method is used to evaluate objects. Case analysis result shows that the proposed method is effective.

Wang Xin , Chen Xi , Qian Fulan , Zhang Yanping

2018, 33(5):900-910. DOI: 10.16337/j.1004-9037.2018.05.015

Abstract (818) HTML (2563) PDF 1.62 M (2193) Comment (0) Favorites

Abstract:Link prediction is an important research direction of complex networks,and the method based on the node similarity is one of the most popular methods. So far, most of the node similarity prediction methods using link density have not considered the difference of each common neighbor node, that is, the contribution of different nodes to the link is different. Therefore, this paper proposes a link prediction algorithm based on the node contribution and link density of the common neighbor nodes(LDNC). The algorithm first calculates the link information between the common neighbor nodes as the link density of the nodes, and then defines node-coupling clustering coefficient to describe the contribution of the common neighbor nodes, and finally combines the two parameters.Experiments based on the real-world datasets show that the LDNC is more accurate compared with four baseline link prediction algorithms (CN,AA,RA and Jaccard) and the CNBIDE algorithm based on the node link density.

Fisher Discriminative Constraint Dictionary Learning Algorithm Based on Profiles

Li Zhengming , Yang Nanyue , Cen Jian

2018, 33(5):911-920. DOI: 10.16337/j.1004-9037.2018.05.016

Abstract (597) HTML (1816) PDF 751.86 K (1520) Comment (0) Favorites

Abstract:To improve the discriminative ability of the coding coefficients, the Profiles (the line vectors of coding coefficients matrix) of Fisher discriminative dictionary learning (PFDDL) is proposed. Firstly, the Profiles can indicate the corresponding atoms which are used by the training samples to encode in the dictionary learning, and an adaptive method is proposed to construct the labels of atoms. Since there are one-to-one correspondences between the Profiles and atoms, then the Fisher discriminative criterion is imposed on the Profiles so that they have small within-class compactness but large between-class separability. Thus, it can encourage the atoms of the same class to reconstruct the training sample of the same class, and enhance the discriminative ability of the coding coefficients, then improve the performance of dictionary learning. Experimental results show that the PFDDL algorithm can achieve better classification performance than other sparse coding and dictionary learning algorithms on the three face and one handwriting databases.

Sentiment Analysis of Twitter Data Based on CNN

Wang Yuhan , Zhang Chunyun , Zhao Baolin , Xi Xiaoming , Geng Leilei , Cui Chaoran

2018, 33(5):921-927. DOI: 10.16337/j.1004-9037.2018.05.017

Abstract (945) HTML (3022) PDF 784.53 K (1848) Comment (0) Favorites

Abstract:With the increasing popularity of social networks, sentiment analysis based on Twitter text has become a hotspot in recent years. The sentiment tendencies contained in tweets are important for mining user needs and predicting major events. However, the existing sentiment classification methods are mostly based on hand-made text features, and it is hard to mine implicit deep semantics of texts. In addition, because of special characteristics, such as short text and arbitrariness of users' behavior, it is more difficult to improve performance of current sentiment classification. This paper presents a novel Twitter sentiment classification model based on convolutional neural network (CNN). In order to explore sentiment tendency of tweets, the proposed model utilizes a dynamic CNN architecture to learn deep semantics from tweets, which initializes input word embedding with word2vec method. Experimental results show that our proposed model can achieve a recall rate of 82.3%, which is much higher than performances of traditional classification methods.

Image Based Outlier Detection Algorithm for Point Cloud Reconstruction

Yang Yuwei , Li Xinggang , Zhang Yaping

2018, 33(5):928-935. DOI: 10.16337/j.1004-9037.2018.05.018

Abstract (698) HTML (2189) PDF 2.77 M (1573) Comment (0) Favorites

Abstract:A reconstructed 3D point cloud model based on image usually includes many outliers. These outliers may exist in isolation or dense aggregation forming point clusters, and they may also be distributed around the model or even on the mode surface.It is difficult to filter out outliers of various distribution states through one detection method. Therefore, a comprehensive outlier detection algorithm is proposed. Firstly, the outliers far from the main body of the model are eliminated by space distance, and the search speed of outliers is accelerated by constructing the spatial topological relationship.Then, the boundary matching method is used to filter out the outliers around the model by comparing the smaller clusters with the largest ones respectively. Finally, an improved K-means algorithm is used to cluster and classify the point cloud data according to RGB color value features, and the outliers adhering to the model surface are detected and filtered by combining the identified outliers.Simulation results show that the proposed algorithm can effectively filter out the outliers with multiple distribution states in the reconstructed point cloud.

Evaluation of Class Overlap Measures on Imbalanced Data Classification

Xing Yan , Chen Jiafeng , Jia Xiaoyan , Wang Xin

2018, 33(5):936-944. DOI: 10.16337/j.1004-9037.2018.05.019

Abstract (518) HTML (690) PDF 478.55 K (1342) Comment (0) Favorites

Abstract:Class overlap is defined as the overlay degree of data from different classes, quantified by the approaches of geometrical statistics and information theory, and it is used to measure the complexity of a classification. There are imbalanced data in the real world, and the great disparity of the sample amounts challenges classification. With the help of experiments, we evaluate the efficiency of the class overlap measures on imbalanced data classification. Firstly, focusing on two-class classification, the experiments are designed to evaluate the efficiency of the class overlap measures on synthetic unbalanced data, which are generated with various skewness, class boundary shapes, feature types and probability distributions. Secondly, according to the experimental results on the artificial data, the influence rules of the imbalanced ratio on the measures are analyzed, then the ways of the measures to guide unbalanced data classification are concluded. Finally, the conclusions are evaluated on the real-world imbalanced data sets. The experimental results demonstrate that those measures with higher robustness on data skeness can efficiently guide classifiers selection for imbalanced data classification.

For Authors

Quick search

Volume retrieval

External Links