I.INTRODUCTION
Because of the wide existence of nonstationary signals, the time–frequency (TF) analysis methods on analyzing these signals, such as the short-time Fourier transform (STFT) [1], the Wigner-Ville distribution (WVD) [2], and the continuous wavelet transform (CWT) [3], have been developed for a long time. Much work has been done by a number of researchers, aiming to obtain the TF representation (TFR) with high TF resolution. In general, these methods can be divided into three categories, which are the basis-based TF analysis methods, the decomposition-based TF analysis method, and the post-processing methods.
For the basis-based TF analysis methods, it is necessary to construct the basis function in advance. Here, several classic basis-based methods are introduced. The STFT is a type of linear transform that employs a window function to truncate the raw nonstationary signal to obtain a series of truncated signals. By regarding these truncated signals as stationary signals and operating with Fourier transform on them, the TFR is obtained. For any piece of truncated signal, because we operate with Fourier transform on the product of the truncated signal and the window function, the chosen window function has a significant effect on the final TFR. Furthermore, the STFT is limited by the uncertainty principle, that is, we cannot obtain favorable time resolution and frequency resolution concurrently. One is acquired at the cost of the other necessarily. The WVD is a type of bilinear transform that does not need a window function to truncate the raw signal. Therefore, the WVD can inherently obtain a TFR with high resolution. However, because of its bilinear property, the TFR obtained using WVD is interfered by the illusive frequency components that generate from the crossing terms while analyzing multi-component signals, which limits the application of WVD to some extent. CWT is a type of TFR method whose wavelet basis is generated by mother wavelet and father wavelet, not the traditional Fourier basis. Different from STFT which uses the same window width in whole frequency domain, CWT can adaptively change the window width, that is, CWT uses the wide window width in low frequency range and narrow window width in high frequency range. By doing this, CWT can obtain a TFR with high TF resolution. However, its parameters are difficult to set. The chirplet transform (CT) is a type of method that aims to analyze the chirp-like signals, which obtain a TFR with high TF resolution by rotating the chirplets to match the instantaneous frequency (IF) of target signal to make it more stationary [4]. However, it is only suitable for the mono-component signals with linear IF. To improve the limitations of the above methods, researchers have done a lot of work. In order to improve the STFT, STFT with adaptive window width based on the chirp rate (ASTFT) was proposed by adopting adaptive widow width [5,6]. By making the standard deviation of the Gaussian window function be a function of time and frequency instead of an unchanged parameter in traditional STFT, the window width in the ASTFT can adaptively vary with time and frequency. Hence, ASTFT can obtain different TF resolution at different time and frequency points. Although ASTFT can improve the TF resolution to some extent, its parameters are difficult to set. Furthermore, because it does not improve the parameter for window size, the inherent limitation resulted from the uncertainty principle remains. In order to improve the WVD, the pseudo-WVD (PWVD) was proposed to restrain the crossing terms by adding window function in time and frequency domain [7]. Although the PWVD improves the bad effect of crossing terms to some extent, this limitation is still not solved because of the bilinear property of WVD. Furthermore, the inherently high TF resolution is lost due to the added window function. In order to improve the CT, many extending methods are proposed. To expand the application scope, the polynomial CT (PCT) was proposed to analyze the signals with nonlinear IF using polynomial kernel to replace the linear kernel in tradition CT [8]. However, if the IF of target signal cannot be fitted by polynomial, the PCT is not effective. Therefore, the spline-kernel CT (SCT) was proposed using splined kernel, which can be suitable for any complex IF [9]. However, PCT and SCT can only adapt to mono-component signals. To make CT suitable for multi-component signals, general linear CT (GLCT) was proposed [10]. GLCT obtains a TFR with high TF resolution by generating a series of sub-TFRs and fusing them into one. Because fusing rule is to choose the maximum in all sub-TFRs at each time and frequency center in whole TF plane, it inevitably produces the fake frequency components. Furthermore, the smaller the frequency interval between adjacent frequency component, the more serious the illusive frequency component. To improve the smear effect, the velocity synchronous linear CT (VSLCT) was proposed by constructing the basis according to the rotating speed to make the chirplets match the IFs better [11]. In this way, VSLCT can obtain a TFR with high TF resolution. It is a good idea to vitiate the bad effect of the uncertainty principle by utilizing an adaptive window size. However, it performs not very well on analyzing the signals with the crossing IFs.
For the decomposition-based methods, the core idea of these methods is to decompose the multi-component signal into a series of mono-component signals without constructing basis function in advance. Here, several decomposition methods are introduced. Hilbert–Huang transform as well as the improved methods, such as ensemble empirical mode decomposition [12] and ensemble local mean decomposition [13], is a type of classic decomposition method [14], which needs no prior knowledge. However, it is only suitable for the signals whose IF varies with time slowly. Mode mixing is serious while analyzing the signals whose IF varies with time violently. Adaptive iterative generalized demodulation method is a type of decomposition-based method using surrogate test to decompose the multi-component signal [15]. It needs prior knowledge in advance, for example, the changing trend of IF with time. Furthermore, when the signals contain too many frequency components, it is not efficient. The variational nonlinear chirp mode decomposition is another type of decomposition-based method that considers the target signals as a whole and does not separate the sub-components successively [16]. Although its corresponding decomposition is efficient, it not only needs the number of frequency components in advance but also the changing trend with time for all frequency components to assign initial iterative values.
For the post-processing methods, such as reassignment method [17], synchro-extracting method [18], and synchro-squeezed wavelet transform [19], their performance mainly depends on the original TFR obtained by traditional methods, such as STFT and CWT. It is worth to mention that post-processing methods can obtain a TFR with high TF resolution, when traditional methods can obtain relatively nice results. However, if the TFR obtained by traditional methods is not in a favorable level, the results obtained using post-processing methods are not satisfying as well. Furthermore, for the signal with crossing IFs, there exists signal distortion at the intersection of IFs in the TFR obtained using post-processing methods.
In this paper, we propose a novel TF analysis method called OSTFT by replacing the even symmetric window function with odd symmetric function. By acquiring minimum amplitude of 0 at time and frequency center, OSTFT can obtain a TFR with high TF resolution. Different from the TFR obtained using STFT whose TF resolution deteriorates because of the energy leakage, the TFR obtained using OSTFT acquires high TF resolution by utilizing energy leakage. In addition, it is worth to mention that the proposed OSTFT can vitiate the effect of window size that we choose on the TFR obtained. Furthermore, it has a nice performance on the special signals, for example, the signals with crossing IFs. The effectiveness of OSTFT has been validated on two numerical signals with complex IFs and an experimental signal collected from a brown bat [20].
The rest of paper is structured as follows: A succinct description of the main limitation of windowed transforms and a detailed discussion of the proposed OSTFT are provided in Section II, the effectiveness of the proposed method is examined using numerical signals and an experiment signal in Section III and IV-A, respectively, a discussion part is displayed in Section IV-B, and the conclusions that can be drawn from this study are presented in Section V.
II.OSTFT
A.LIMITATION OF WINDOWED TRANSFORM
In this subsection, we begin our study with uncertainty principle limiting the performance of windowed transform, which is given as follows:
where x(t) is a certain signal, X(ω) is the Fourier transform (FT) of x(t), and Ex is the energy of signal x(t). where tc is time center and ωc is the frequency center. where σt is the time bandwidth and σω is the frequency bandwidth.By calculations, we can obtain
According to (4), we know that the time resolution and frequency resolution cannot obtain the best levels concurrently, that is, one achieves a high level at the cost of the other necessarily. Among most of applications, we always hope that high frequency resolution is obtained to recognize the significant frequency component. However, because of the nonstationarity of nonstationary signal, we analyze a series of windowed signals in a short time using a window function to truncate the raw signal. Therefore, high frequency resolution is difficult to obtain under this condition. Furthermore, it is worthy of mentioning that the bad resolution is shown as the form of energy leakage. Here, we use a numerical signal with a sampling frequency of 200 Hz to present it further, which is constructed as the following expressions: where v(t) is the IF, which is given as follows:In this case, we use STFT with window size of 128 to analyze this signal, whose corresponding analyzing results are shown in Fig. 1. Figure 1(a) shows the resulting TFR. Here, we take the time center at 1 s as an example. As per (6), the true IF at 1 s is 72 Hz. By observing Fig. 1(a), it can be seen that the leaked energy distributes in a range from to , where is a certain frequency interval. The amplitude spectrum at 1 s shown in Fig. 1(b) presents the same results clearly.
According to the above analysis, we can find that the leaked energy leads to the TFR with bad TF resolution. And in this paper, we mainly aim to resolve this problem to obtain a TFR with high TF resolution.
B.REVIEW OF STFT
In this subsection, we take the STFT as an example. By in-depth analysis of STFT, we can know the reason why STFT cannot obtain a TFR with high TF resolution. The STFT of a signal is represented as the following expression:
where is the time center, is the frequency center, j denotes ; and g denotes window function which is usually taken as Gaussian function, which is represented as the following expression: where σ is the standard deviation.Because STFT is a windowed transform, it can be explained in a single window. Therefore, we can obtain
where tc is the time center of a certain window and is the half window length.To clearly present our core idea, we first regard s(u), in (9), as a mono-component signal with IF of v(u). In a certain window (a short time), based on Taylor’s theorem, v(u) can be represented as
where v(tc) is the IF at time center tc; v′(tc) is the first-order derivative of v(u) at time center tc, and the remainder is ignored.According to (10), the phase function of s(u) in a certain window is obtained, which is given as the following expression:
Hence, s(u) can be represented as the following equivalent form:
where A(u) is the instantaneous amplitude and φ(u) is the instantaneous phase.According to Euler’s formula, (12) can be written as the following expression:
Substituting (11) into (13) producesFor the sake of the convenience in next deduction, we take some mathematical skills as follows:
Here, the reason why STFT cannot achieve a pretty favorable TFR for nonstationary signals is uncovered. According to the convolution transform [21], (9) can be written in the following form:
where * denotes convolution calculation, FT denotes Fourier transform, andAccording to (15), (16), and (18), based on the linear property of FT, we can obtain
where s1 is (16) and s2 is (18).Substituting (23) into (20) produces
Here, we first only consider the FTs1(ω) in (24). According to time and frequency shifting properties of FT, we can obtain where s1o is (17) and g(u) is (8).According to (24), (25), and (26), we can obtain
where s1 is (16) and s1o is (17).According to (27), we can obtain
Carrying out the same operations as FTs1(ω), we can get the similar result of FTs2(ω) in (24).
Because , , and are even symmetric functions with symmetric axes of ω=0, ω=v(tc), and ω=-v(tc), respectively, and are even symmetric functions with symmetric axes of ω=v(tc) and ω=-v(tc), respectively. And is composed by both two parts as shown in Fig. 2 clearly. The result shown in Fig. 2 is in accordance with the result shown in Fig. 1.
The above analysis mainly aims to explain mono-component signals. As for the multi-component signal, we can regard it as the summation of its every frequency component, which is represented as
where n is the number of intrinsic mode functions, and smono,I is the ith intrinsic mode function.Based on the linear property of FT, the FT of smulti is the summation of FT of smono,i. Therefore, the above explanation results of the mono-component signal are suitable for the multi-component signal as well.
By observing (28) and (29), we can find that the and mainly depend on , the FT of window function. And among nearly all windowed transforms, the even symmetric window function is employed, which inevitably leads to energy leakage resulting in TFR with bad TF resolution. Aiming to resolve this problem, we propose a novel algorithm called the OSTFT, whose detailed explanations are shown in Section II-C.
C.PROPOSED OSTFT APPROACH
To resolve the above problem, completely different from conventional TF analysis methods employing the even symmetric window function, we employ the odd symmetric window function. It is worthy of mentioning that the proposed method aims not to restrain the leaked energy but to utilize it. By doing this, OSTFT can obtain a TFR with high TF resolution. This idea is suitable for all TF analysis methods based on FT. Here, we still take STFT as an example.
There exist many suitable odd symmetric window functions. However, in this study, we take a constructed window function called Gaussian-like window function (GL-window function), which is presented as the following expression:
where a and b are positive parameters, respectively. In this study, a and b are taken as 200 and 10, respectively.According to (31), we can obtain the FT of l(t),
By observing (31) and (32), it can be seen that FTl(ω) has the same expression as l(t).
Here, replacing g(u-tc) with l(u-tc) in (9) produces
where l(u-tc) is an odd symmetric function with symmetric center (tc, 0).In addition, FT of l(u-tc) can also be calculated, which is given as follows:
where l(u) is (31) with symmetric center (0, 0).By the similar deductions shown in Section II-B, we can obtain the results with similar expression like (27) for OSTFT,
where s1 is (16), s1o is (17), s2 is (18), s2o is (19), and l(u) is (31).Based on (35) and (36), we can obtain
Because and are even symmetric functions with symmetric axes ω=v(tc) and ω=−v(tc), respectively, and is an odd function, and are odd symmetric functions with symmetric centers (v(tc), 0) and (−v(tc), 0), respectively. Hence, and are still even symmetric functions with symmetric axes ω=v(tc) and ω=−v(tc), respectively. But, completely different from conventional STFT whose amplitude functions, shown in (28) and (29), acquire maximums at ω=v(tc) and ω=−v(tc), respectively, the amplitude functions using OSTFT, shown in (37) and (38), acquire minimums of 0 at ω=v(tc) and ω=−v(tc), respectively, as shown clearly in Fig. 3. In this way, the TFR obtained using OSTFT can obtain the high TF resolution.
Here, we still use the signal of x1(t) to further explain the idea of OSTFT. The parameter for window size is set to 128. And the corresponding analyzing results are shown in Fig. 4. Figure 4(a) shows the TFR using OSTFT. According to Fig. 4(a), it can be seen that OSTFT can obtain a TFR with quite high TF resolution by utilizing the leaked energy. Furthermore, the estimated IF is nearly overlapped with the true IF. Figure 4(b) shows the amplitude spectrum at 1 s. From Fig. 4(b), the result is perfectly in accordance with the result shown in Fig. 3.
In fact, if the peak of each IF curve of the TFR using STFT can be well extracted, they are extremely close to the curves obtained by OSTFT. However, these curves should be extracted manually. The IF curves of the TFR of OSTFT can be easily contained because they are zeros in the TFR.
III.SIMULATION EVALUTION
In this section, two simulated numerical signals with complex IFs are employed to verify the performance of OSTFT.
A.CASE 1
In this subsection, the first multi-component signal with complex IFs is used to evaluate the performance of OSTFT, which is constructed as the following expression:
whereThe corresponding analyzing results are displayed in Fig. 5. Figure 5(a) and (b) shows the waveform with sampling rate of 1·8 kHz and the corresponding IFs, respectively. TFRs obtained using STFT and OSTFT with window size of 128 are shown in Fig. 5(c) and (e). According to Fig. 5(c), because of relatively serious energy leakage, the TFR obtained using STFT cannot obtain the high TF resolution. However, the TFR obtained using OSTFT shown in Fig. 5(e) shows sharp IF trajectories, which proves that OSTFT can obtain the TFR with high TF resolution. By observing estimated IFs (red curves) and true IFs (black curves) in Fig. 5(d), it can be seen that estimated IFs can track true IFs very well. Hence, the value of the Error is low. It proves that OSTFT can acquire satisfactory accuracy, even for this kind of complex signal.
B.CASE 2
In this subsection, the other numerical signal with crossing IFs is constructed to evaluate the ability of OSTFT in analyzing special signals, which is constructed as the following expression:
whereThe corresponding analyzing results are displayed in Fig. 6. Figure 6(a) and (b) shows the waveform with sampling rate of 100 Hz and the corresponding IFs, respectively. TFRs obtained using STFT and OSTFT with window size of 128 are displayed in Fig. 6(c) and (d), respectively. According to Fig. 6(c), TFR obtained using STFT is very blurred, which leads to bad TF resolution. Furthermore, because of the relatively energy leakage, it is difficult to determine whether the IFs are truly intersected or not. However, from the TFR obtained using OSTFT, shown in Fig. 6(d), the IF trajectories in the TFR are clear, sharp, and accurate by comparing with true IFs in Fig. 6(b). Furthermore, according to Fig. 6(d), it can be seen that the TFR using OSTFT can not only acquire the high TF resolution but also show the intersection of the IFs shown in the red square can be captured accurately.
Through the above analysis, we can know that, even for this type of special signal, OSTFT can have a favorable performance.
Based on the above analysis of two cases, OSTFT can acquire TFR with pretty satisfying TF resolution for multi-component signals with not only complex IFs but also crossing IFs, which cannot be well resolved by traditional STFT.
IV.EXPERIMENT
In this section, an echolocation signal emitted by a large brown bat collected from real life is employed to test the effectiveness of OSTFT in Section A. Furthermore, two significant points about OSTFT are discussed in Section B.
A.EXPERIMENTAL CASE 1
In this subsection, the signal collected from a brown bat is analyzed using OSTFT, which is a multi-component signal with four time-varying IFs. STFT is chosen as a comparison method. In this case, the parameter for window size is set to 128 for both methods. The corresponding analyzing results are displayed in Fig. 7. Figure 7(a) shows the waveform with sampling period of 7 μs. TFRs obtained using STFT and OSTFT are shown in Fig. 7(b) and (c), respectively. According to Fig. 7(b), we can find that IF trajectories in the TFR obtained using STFT are blurred, resulting in bad TF resolution. However, IF trajectories, deep blue curves indicated by red arrows shown in Fig. 7(c), are very clear and sharp, which proves that OSTFT can acquire a TFR with high TF resolution by utilizing the leaked energy. By the above analysis, it can be seen that OSTFT has a good performance on obtaining a TFR with a high TF resolution.
B.EXPERIMENTAL CASE 2
As rotation machineries become increasingly complex, TF analysis methods are an effective tool to diagnose faults in the machinery. A vibration signal collected from a water turbine is analyzed here. The sampling frequency of this signal is 16 Hz, and the number of sampling points is 1024. Figure 8(a) and (b) shows the corresponding waveform and rotation frequency, respectively. Then, the STFT and OSTFT are used for comparison. Figure 8(c) and (d) shows that the TFR is composed of several harmonics with regard to the rotation frequency; these harmonics can reveal the health condition of the water turbine.
It can be seen that more harmonics with higher resolution can be found in Fig. 8(d) compared with the STFT-based TFR shown in Fig. 8(c).
C.DISCUSSION
In this subsection, we mainly discuss two important issues. One is the effect of the parameter for window size we choose on the TFR obtained using OSTFT. The other is about the amplitude information at time and frequency centers ignored by OSTFT.
Firstly, we discuss the first significant issue. Based on the analysis in Section II, because the window function has a main effect on the energy leakage, and the OSTFT acquires the minimums of 0 at the time and frequency center, we can know that the TFR obtained using OSTFT is not sensitive to the window size, that is, the OSTFT vitiates the effect of window size on the TFR obtained. Here, we still use the numerical signal x2(t) to further explain it. The sampling rate of x2(t) is 1800 Hz, which lasts for 1 s. Hence, the total size of data points is 1800. Here, we use the OSTFT to analyze this signal by setting different window sizes as 100 and 300, respectively. The latter window size is three times as large as the former. In this case, we still take the STFT as a comparison method. The corresponding analyzing results are displayed in Fig. 9. By comparing TFRs obtained using STFT with window size of 100 and 300, respectively, as shown in Fig. 9(a) and (c), we can find that this parameter for window size has a serious effect on the TFRs. Furthermore, the larger the window size, the relatively higher the TF resolution. Although the large window size can help to acquire a relatively high TF resolution, it is not suitable for strong nonstationary signals. As shown in Fig. 9(c), the IF trajectory of v2 is more blurred than that of v1, because of their different levels of nonstationarity. However, through comparing Fig. 9(b) and (d), we can find that the TFRs obtained using OSTFT can acquire high TF resolution, whether the window size is set as 100 or 300. Furthermore, according to TFRs obtained using OSTFT, it can be also seen that the parameter for window size only has an effect on leaked energy. By the above analysis, we can know that OSTFT can vitiate the effect of window size on TFR obtained.
Secondly, we discuss the other significant point. According to the analysis shown in Section II-C, the OSTFT inevitably ignores the amplitude information at time and frequency centers, because OSTFT acquires the minimums of 0 at time and frequency centers in the TF plane. Although the OSTFT loses the amplitude information at time and frequency centers, the amplitude changing trend of different frequency components can be obtained according to that of leaked energy. Here, we construct a numerical signal similar with x2(t) to further explain it by changing the coefficients of different intrinsic modes of x2(t). And the rest of parameters of x2(t) remain unchanged. The constructed signal is given as the following expression:
where Ais are 5, 1, and 10, and when is are 1, 2, and 3, respectively.In this case, we still take the STFT as the comparison method. The corresponding analyzing results are displayed in Fig. 10. According to the TFR obtained using STFT shown in Fig. 10(a), we can obtain the amplitude changing trend of different IFs based on the color bar, for example, the amplitude of v3 is high than that of v1. As for the TFR obtained using OSTFT shown in Fig. 10(b), even though we cannot directly acquire the amplitude changing trend of different IFs at time and frequency centers, we can know their changing trend according to the amplitude changing trend of leaked energy of different IFs.
V.CONCLUSION
In this paper, we presented a novel TF analysis method called OSTFT by replacing the even symmetric window function with odd symmetric window function. Therefore, OSTFT can obtain a TFR with high TF resolution by acquiring minimum of 0 at time and frequency centers in the TF plane. It is worth to mention that the OSTFT can vitiate the effect of window size that we choose on the TFR obtained. Furthermore, the OSTFT has a satisfying performance on signals with complex IFs, even crossing IFs. The OSTFT has been validated by two numerical signals with complex IFs and two signals collected from a brown bat and from wind turbine vibration in the real life. The proposed OSTFT inevitably has its limitations. Because of the inherent property ignoring the amplitude information at time and frequency center, it is difficult for the OSTFT to reconstruct the signal.