BSS_WEIBULL_ICA_IJEST10-02-11-106

download BSS_WEIBULL_ICA_IJEST10-02-11-106

of 8

Transcript of BSS_WEIBULL_ICA_IJEST10-02-11-106

  • 8/7/2019 BSS_WEIBULL_ICA_IJEST10-02-11-106

    1/8

    M. El-Sayed Wahed / International Journal of Engineering Science and Technology

    Vol. 2(11), 2010, 6739-6746

    BLIND SOURCE SEPARATUION BASED

    ON THE WEIBULL MIXTURE MODELM. El-Sayed Wahed

    Department of Mathematics, Faculty of Science

    Zagazig University, Zagazig, (Egypt).

    E-mail: [email protected]

    Abstract.

    Modeling the distribution of the observed data with a parametric approach is an important tool of statistical

    signal processing. In applications where the data have many clusters, like image segmentation, a multi componentprobabilistic model representation such as mixture modeling is required. Actually, in many real applications in the

    field of image processing, Audio, blind equalization, there are a need to better approximate the observed data and

    the use of mixture approach. This paper aims to provide a realistic distribution based on Mixture of weilbull

    distribution, which has the advantage to characterize parameters estimation in each component in the mixture. Wepropose a formulation of the Nelder-Mead under weilbull distribution. For this, the method Likelihood function is

    used to update the mixture parameters. An application of this technique is considered for modeling signals. In which

    we used it as score function for blind signal separation (BSS), we resort to the Fast Independent Component

    Analysis (FICA) approach whilst to adaptively estimate the parameters of such score function.

    Keywords:

    Mixture Weilbull Distribution (MWD), Nelder-Mead (NM), Maximum Likelihood (ML), Independent

    Component Analysis (ICA), Blind Signal Separation (BSS).

    1. Introduction

    The problem of independent component analysis (ICA) has received wide attention in various fields such as

    biomedical signal analysis and processing (EEG, MEG, ECG), geophysical data processing, data mining, speech

    recognition, image recognition and wireless communications (Amari and Cichocki, 1998; Amari et al, 1997; Belland Sejnowski, 1995; Cichock et al., 1997; Gardner, 1991; Slock, 1996). In many applications, the sensorysignals(Observations obtained from multiple sensors) are generated by a linear generative model which is unknownto us . In other words, the observations are linear instantaneous mixtures of unknown source signals and the

    objective is to process the observations in such a way that the outputs correspond to the separate primary source

    signals.

    The mixture modeling technique has been widely used for the estimation of the probability density functionand has found significant applications in various domains (see for example, Refs. [10, 2,11]). We propose here the

    use of the weilbull distribution as the basic distribution in the mixture [9]. The weilbull distribution has been

    employed to model and detect Gaussian and non-Gaussian signals recently in speech modeling and image and video

    coding. We focus in this paper on the parameters estimation of the mixture weilbull model. To these end NM

    method used to obtain an ML (Maximum Likelihood) has been developed. In this method the shape parameters areupdated by numerical optimization of the Likelihood function. Applications of this technique in BSS are considered.

    The rest of this paper is organized as follows: In section 2 we introduce the Mixture of Weibull (MW)

    approach. In Section 3, we present our approaches for the parameters estimation. In section 4, we introduce BSS. Insection 5, we discuss the Application of the MW model in (BSS).

    2. The Mixture of Weibull Distribution

    A mixture of Weibull is a parametric statistical model which assumes that the data originates from aweighted sum of several Weibull sources. More specifically, a MW is defined as[12]:

    k

    i

    iiiii cxpwxp1

    ),,|()|( (1)

    ISSN: 0975-5462 6739

  • 8/7/2019 BSS_WEIBULL_ICA_IJEST10-02-11-106

    2/8

    M. El-Sayed Wahed / International Journal of Engineering Science and Technology

    Vol. 2(11), 2010, 6739-6746

    kicw iiii ,...,2,1,),,,( .

    k is the number of mixture density components.

    iw is the ith mixture weight and satisfies

    k

    i

    ii ww1

    1,0 .

    ),,|( iiii cxp is an individual density of the Weibull density which is characterized by the

    following probability density function [9].

    iiiiii cxp

    |c-x|-exp|cx|

    2

    1),,|( i

    1

    iiii (2)

    where ic is the location parameter, 0i is the scale parameter, 0i is the shape parameter .By varying theparameters of (Weibull), its possible to characterize a large class of distributions such as gaussian, subgaussian

    (more peaked, than gaussian, heavier tail) and supergaussian (flatter, more uniform).Fig 1 show some examples of

    pdfs for MW for k=2 ( k=1 is Weibull that introduce in[9]) .Thanks to the shape parameters the MW is more

    flexible and can approximate a large class of statistical distributions, this distribution requires to estimate k*4

    parameters, kicm iiii ,...,2,1),,,(

    . Particular we discuss the estimation of these parameters in detail inthe following sections.

    -4 -2 0 2 4 6 8 10 120

    0.1

    0.2

    0.3

    0.4

    0.5

    g

    f

    h

    w

    Fig 1:the probability density functions of MW

    for 9,7,2,4,3,4,5,2,3,7,2,4,7,2,2 21221 cck with different weight

    .3,0.4,0.60.7,01 m , 12 1 mm and 8,1,3,21 are ),,,( whfg respectively.

    3. The log-likelihood function to estimate MWD parameters

    The log-likelihood function of (2), given by [2].

    k

    1i

    1-N

    1j

    iiiiiji, ]),,c|(xpw[log)|L( jx (3)

    whereN the sample size and kiip j ,...,1()x|(ji, , )]1,0[ Nj represents the conditional

    expectation of ip given the observation jx ,means the posterior probability that jx belongs to the ith

    component.

    ISSN: 0975-5462 6740

  • 8/7/2019 BSS_WEIBULL_ICA_IJEST10-02-11-106

    3/8

    M. El-Sayed Wahed / International Journal of Engineering Science and Technology

    Vol. 2(11), 2010, 6739-6746

    In the case of Generalized Gamma distribution, if we substitute (2) into (3) and after some manipulation we obtain

    the following form of )|L( x :

    k

    i

    N

    j

    k

    i

    N

    j

    ijiijiiijii ccw1 1 1 1

    ,ji, x||x|log)1()2log()(log)(log)(log)|L( x

    (4)

    Accordingly, we obtain for ki ,...,2,1 the following nonlinear equation related to the estimated parameters byderivatives of the log-likelihood function with respect to iiic ,, and setting these derivatives to zeros we obtain

    the following equations.

    0)(

    ,,0)(

    ,0)(

    iii

    LL

    c

    L

    |x|x|x(5)

    N

    j

    iiiiiii

    i

    ij ccci

    1

    0|x|log|x||x|log1

    (6)

    N

    jiii

    iij

    i

    c1 0|x|

    1

    (7)

    N

    j

    iiiiiiii

    ii

    i

    ij csignccsignc

    i

    1

    10)x(|x|)x(

    |x|

    1

    (8)

    After a little mathematical manipulation, the ML estimate of i is obtained.

    0

    1|x|

    |x|

    ||log

    1

    1

    1

    1

    N

    j ijN

    k

    ikki

    N

    k

    ki

    ijji

    N

    j

    ji

    i

    i

    i

    c

    c

    cu

    (9)

    Where i is the estimator of i then the estimator of i and ic are given by the following equations:

    N

    k

    ikki

    N

    k

    ki

    i

    ic1

    1

    |x|

    (10)

    0|x|)x(|x|

    1

    1

    N

    j

    ijiiij

    ij

    i

    jiiccsign

    c

    (11)

    4. Blind Signal Separation (BSS).

    By definition, independent component analysis (ICA) is the statistical method that searches for a lineartransformation, which can effectively minimize the statistical dependence between its components [10]. Under the

    physically plausible assumption of mutual statistical independence between these components, the most application

    of ICA is blind signal separation (BSS). In its simplest form, BSS aims to recover a set of unknown signals, the so-

    called original sourcesT

    n21 (t)]s(t),....,s(t),[s(t)s n

    R , by relying exclusively on information that can be

    extracted from their linear and instantaneous mixturesT

    m21 (t)]x(t),....,x(t),[x(t) xm

    R , given by

    ISSN: 0975-5462 6741

  • 8/7/2019 BSS_WEIBULL_ICA_IJEST10-02-11-106

    4/8

    M. El-Sayed Wahed / International Journal of Engineering Science and Technology

    Vol. 2(11), 2010, 6739-6746

    m,2,1,,(t)(t) tsAx (12)

    where Amxn

    R is an unknown mixing matrix of full rank and m n. In doing so, BSS remains truly (blind) inthe sense that very little to almost nothing be known a priori for the mixing matrix or the original source signals.

    Often sources are assumed to be zero-mean and unit-variance signals with at most one having a Gaussian

    distribution. The problem of source estimation then boils down to determining the un-mixing matrix W

    nxm

    R

    such that the linear transformation of the sensor observation is

    (t)(t) xWu , t = 1,2,,n , (13)

    where u(t)=nT

    n21 (t)]u(t),....,u(t),[u R yield an estimate of vectors(t) corresponding to the original or truesources. In general, the majority of BSS approaches perform ICA, by essentially optimizing the negative log-likelihood (objective) function with respect to the un-mixing matrix W such that

    n

    l 1

    lu |)det(|log)](upE[log),L( l WWu , (14)

    where E[.] represents the expectation operator and )(up lu l is the model for the marginal probability density

    function (pdf) of lu , for all nl ,...,2,1 . Normally, matrix W is regarded as the parameter of interest and the pdfs of the sources are considered to be nuisance parameters. In effect, when correctly hypothesizing upon the

    distribution of the sources, the maximum likelihood (ML) principle leads to estimating functions, which in fact are

    the score functions of the sources [7].

    )(log)( lul

    ll updu

    du

    l (15)

    In principle, the separation criterion in (14) can be optimized by any suitable ICA algorithm where contrasts are

    utilized (see e.g., [7]). A popular choice of such a contrast-based algorithm is the so-called fast (cubic) converging

    Newton-type (fixed-point) algorithm, normally referred to as FastICA [1], based on

    ])((E[T

    k1k uuDWW klll uuEdiag W])))([( , (16)

    where, as defined in [8],

    )]))(uE[])u(udiag(1/(E[ ' lllll D , (17)

    withT

    nn2211 )](u),....,(u),(u[(t) being valid for all nl ,...,2,1 . In the ICA framework, accuratelyestimating the statistical model of the sources at hand is still an open and challenging problem [7]. Practical BSS

    scenarios employ difficult source distributions and even situations where many sources with very different pdfs are

    mixed together. Towards this direction, a large number of parametric density models have been made available in

    recent literature. Examples of such models include the generalized Gaussian density (GGD)[4], the generalizedlambda density(GLD) and the generalized beta distribution (GBD) or even combinations and generalizations such as

    super and generalized Gaussian mixture model (GMM) [5], the generalized gamma density (GGD) [3]. In the

    following section we show example Mixture Weibull Density (MWD) for signal modeling in blind signal

    separation.5. Application of MWD in blind signal separation

    Novel flexible score function is obtained, by substituting (1) into (15) for the source estimates

    nll ,...,2,1,u , it quickly become obvious that our proposed score function inherits a generalized parametricstructure ,which in turn can be attributed to the highly flexible MWD parent model . In this case, a simple calculus

    yield the flexible BSS score function

    ISSN: 0975-5462 6742

  • 8/7/2019 BSS_WEIBULL_ICA_IJEST10-02-11-106

    5/8

    M. El-Sayed Wahed / International Journal of Engineering Science and Technology

    Vol. 2(11), 2010, 6739-6746

    k

    i

    ii

    il

    i

    iliililiii

    lu

    li

    ii

    lcu

    cucusigncuwup 1

    22

    l||

    1||exp)(||

    2

    11)|(u

    (18)

    In principle )|( ll u .Is capable of modeling a large number of signals, such as speech or communication signals.

    This is due to the fact that its characterization depends explicitly on all parameters kicw iiii ,...,2,1,,,,

    Other commonly used score functions can be obtained by substituting appropriate values for iiii cm ,,, in

    (18), for instance, when 1k we have score function

    1||||

    )()|(u 1111

    1

    1

    l1

    cu

    cu

    cusignl

    l

    l

    l (19)

    When 11 and 11 we have a scaled form of the GGD-based score function constitutes such a special case of(17).

    1

    111||)()|(

    cucusignu llll (20)

    The function)|(

    ll ucould become singular, in some special cases, essentially those corresponding to heavy-

    tailed (or sparse) distribution defined for 0|| il cu . In practice, to circumvent such deficiency, the denominatorin (18) can be modified slightly to read function

    k

    i

    ii

    il

    i

    iliililiii

    lu

    li

    ii

    lcu

    cucusigncuwup 1

    22

    l||

    1||exp)(||

    2

    11)|(u

    (21)

    where is a small positive parameter (typically around 104) which, when put to use, can almost always guarantee

    that the discontinuity of (18) or values in or approaching the region 0|| il cu is completely avoided.

    6. Numerical experiments

    To investigate the separation performance of the proposed MWD-based FastICA BSS method, a set of

    numerical experiments preformed in which we consider two example one artificial at 2k and anther real (speechsignal), we illustrate this in the following two examples

    6.1. Example 1

    In this example 2k and the data set used consists different realizations of independent signals, thenumber of data samples has also been designed to be relatively small, for example, N = 250.The source signals are

    mixed (noiselessly) with randomly generated full rank mixing matrices A. The FastICA method is implemented in

    the so-called simultaneous separation mode, whereas the stopping criterion is set to410 . FastICA is executed

    using the flexible MWD model is used to model the distribution of the unknown sources, while (9)(11) are

    employed to adaptively calculate the necessary parameters of the MWD-based score function defined in (20).

    Now to show the performance in this case we consider three source signals (source) where this signals generatedrandom from MWD. Let the mixing matrix A and un-mixing matrix W are defined as follow

    ,

    .48.32.17

    .86.65.75

    .37.79.56

    A

    1.86871.8798-1.9485

    0.16460.1924-0.2184

    0.25660.2177-0.2128

    W

    By using the equation sAx we obtain mixed signals as show in Fig (2) where mixing signals in left and sourcesignals in right

    ISSN: 0975-5462 6743

  • 8/7/2019 BSS_WEIBULL_ICA_IJEST10-02-11-106

    6/8

    M. El-Sayed Wahed / International Journal of Engineering Science and Technology

    Vol. 2(11), 2010, 6739-6746

    0 100 200 3000

    500

    1000

    fisrt mixing

    mixing signals

    0 100 200 3000

    500

    1000

    second mixing

    0 100 200 3000

    200

    400

    third mixing

    0 100 200 3000

    500

    1000Source signals

    fisrt source

    0 100 200 3000

    5

    10

    second source

    0 100 200 3000

    200

    400

    third source

    Fig(2)

    After using FastICA we recover the sources and we show the estimated signals in left and original signals in right inFig (3) with different in scale only.

    0 100 200 3000

    5

    first estimated

    estimated Signals

    0 100 200 3000

    5

    10

    second estimated

    0 100 200 3000

    2

    4

    third estimated

    0 100 200 3000

    500

    1000Source Signals

    first source

    0 100 200 3000

    5

    10

    second source

    0 100 200 3000

    200

    400

    third source

    Fig (3)

    6.2. Example.1.

    Consider three speech signals as sources, mixing matrix A and demixing matrix W are given as follow

    .50.78.95

    .96.56.22

    .32.8977.

    A and

    .0589.0874.0429

    .0982.0467.0159

    .8760.0340.0987

    W

    ISSN: 0975-5462 6744

  • 8/7/2019 BSS_WEIBULL_ICA_IJEST10-02-11-106

    7/8

    M. El-Sayed Wahed / International Journal of Engineering Science and Technology

    Vol. 2(11), 2010, 6739-6746

    By using the equation sAx we obtain mixed signals as show in figure (4) where mixing signals in left andoriginal signals in right. We recover the source by using FastICA and we show the estimated signals in left withscales, permutation and original signals in right in figure (5).

    0 2000 4000 6000 8000-1

    0

    1

    fisrt mixing

    mixing signals

    0 2000 4000 6000 8000-1

    0

    1

    second mixing

    0 2000 4000 6000 8000-0.5

    0

    0.5

    third mixing

    0 2000 4000 6000 8000-1

    0

    1Source signals

    fisrt source

    0 2000 4000 6000 8000-0.5

    0

    0.5

    second source

    0 2000 4000 6000 8000-1

    0

    1

    third source

    Fig(4)

    6. Algorithm performance

    The separation performance for ICA algorithm is evaluated with the cross-talk error measure

    n

    j

    n

    i jlnl

    jin

    i

    n

    j linl

    ji

    p

    p

    p

    pPI

    1 1

    2

    1

    2

    1 1

    2

    1

    2

    1||max

    ||1

    ||max

    ||

    (20)

    Note that here jip represents the elements of the permutation matrix P = WA, which after assuming that all

    sources have been successfully separated should ideally reduce to a permuted and scaled version of the identity

    matrix. The separation performance for first example is dBPI 72.18 and for second example

    is dBPI 01.26

    0 2000 4000 6000 8000-5

    0

    5

    fisrt estimated

    estimated signals

    0 2000 4000 6000 8000-10

    0

    10

    second estimated

    0 2000 4000 6000 8000-5

    0

    5

    third estimated

    0 2000 4000 6000 8000-1

    0

    1Source signals

    fisrt source

    0 2000 4000 6000 8000-0.5

    0

    0.5

    second source

    0 2000 4000 6000 8000-1

    0

    1

    third source

    ISSN: 0975-5462 6745

  • 8/7/2019 BSS_WEIBULL_ICA_IJEST10-02-11-106

    8/8

    M. El-Sayed Wahed / International Journal of Engineering Science and Technology

    Vol. 2(11), 2010, 6739-6746

    Fig(5)

    5.Conclusion

    The main motivation of this study is to provide estimators for the MWD parameters. For this, a new

    formulation of the NM algorithm is conducted to include the parameters estimation. We have derived a novel

    parametric family of flexible score functions, based exclusively on the MWD model. To calculate the parameters of

    these functions in an adaptive BSS setup, we have chosen to maximize the ML equation with the NM optimization

    method. Simulation results show that the proposed approach is capable of separating mixtures of signals.

    Reference:

    [1] A.Hyvarinen, E.Oja, A fast fixed-point algorithm for independent component analysis, Neural computation, vol 9, no 7, October 1997, pp.1483-1492.

    [2] C. Tzagkarakis, A. Mouchtaris, P. Tsakalides; .Musical Genre Classification VIA Generalized Gaussian and Alpha-Stable Modeling,ICASSP 2006 Proceedings. Vol.5, 14-19 May 2006 pp 217.220.

    [3] E.W. Stacy, A generalization of the gamma distribution, Ann. Math. Stat. vol 33, no 3, September 1962, pp. 1187 -1192.[4] K. Kokkinakis, A.K. Nandi, Exponent parameter estimation for generalized Gaussian probability density functions with application to

    speech modeling, Signal Process. vol 85, no 9, September 2005,pp. 1852-1858.[5] J.A. Palmer, K. Kreutz-Delgado, S. Makeig, Super-Gaussian mixture source model for ICA, in: Proceedings of the International

    Conference on Independent Component Analysis and Blind Signal Separation, Charleston, SC, USA, March 5-8 ,2006, pp. 854-861.

    [6] J. Eriksson, J. Karvanen, V. Koivunen, Source distribution adaptive maximum likelihood estimation of ICA model, in: Proceedings of theSecond International Conference on ICA and BSS, Helsinki, Finland, June 19-22, 2000, pp. 227- 232.

    [7] J.-F. Cardoso, Blind signal separation: statistical principles, Proc. IEEE, vol 86, no 10, October 1998, pp. 2009- 2025.

    [8] J. Karvanen, V. Koivunen, Blind separation methods based on Pearson system and its extensions, Signal Process. vol 82, no 4, April 2002,pp. 663-673.

    [9] M. El-Sayed Wahed, (2009):Blind signal separation using an adaptive Weibull distribution. International Journal of Physical SciencesVol. 4(5), pp. 265-270.

    [10] Pekka Paalanen, Joni-Kristian Kamarainen, Jarmo Ilonen, Heikki Klviinen; .Feature representation and discrimination based on Gaussianmixture model probability densities Practices and algorithms. Pattern Recognition 39 (2006) pp. 1346-1358.

    [11] Vimal Bhatia, Bernard Mulgrew, .Non-parametric likelihood based channel estimator for Gaussian mixture noise, Signal Processing Vol.87 (2007) pp 2569-2586.

    [12] Wendy L. Martinezand Angel R. Martinez computational statistics with matlab,Boca Raton London New York Washington, D.C.

    ISSN: 0975-5462 6746