kb/data/en.wikipedia.org/wiki/Bayes_space-0.md

14 KiB
Raw Blame History

title chunk source category tags date_saved instance
Bayes space 1/1 https://en.wikipedia.org/wiki/Bayes_space reference science, encyclopedia 2026-05-05T13:54:32.727668+00:00 kb-cron

Bayes space is a function space defined as an equivalence class of measures with the same null-sets. Two measures are defined to be equivalent if they are proportional. The basic ideas of Bayes spaces have their roots in Compositional Data Analysis and the Aitchison geometry. Theoretical applications are mainly in statistics, specifically functional data analysis of density functions, aka density data analysis. Practical applications are in geochemistry, COVID-19 modelling, sediment analysis and developmental research. Alternative approaches to density analysis are based on the Wasserstein metric, often termed Wasserstein regression, have also been applied to medicine. The basic structure of the Bayes space is that of a vector space, with addition and multiplication being defined by perturbation and powering. The space is formed over a

    σ
  

{\displaystyle \sigma }

-finite reference/base measure, denoted

    λ
  

{\displaystyle \lambda }

or

    P
  

{\displaystyle P}

depending on whether it is infinite or finite. Densities are considered as Radon-Nikodym derivatives of the measures with same null-sets as the base measure, and are equivalent if they are proportional. In case of finite base measures, Hilbert space structure can be achieved by defining a centered log-ratio transformation on the measures, mapping them to a subset of

      L
      
        2
      
    
    (
    P
    )
  

{\displaystyle L^{2}(P)}

consisting of functions integrating to 0.

For data analysis, density functions are typically estimated using so-called ZB-splines to smooth over a histogram of the data, using Kernel density estimation, or using a combination of the two.

== Definitions and main results == Consider a finite base measure

    P
  

{\displaystyle P}

(not necessarily a probability measure) on a domain

    Ω
  

{\displaystyle \Omega }

. This may be a uniform distribution on a bounded interval, or it can be a Radon-Nikodym derivative of the Lebesgue measure (the Gaussian distribution, for example). In practice,

    Ω
  

{\displaystyle \Omega }

is often truncated to a bounded interval. If we take two densities

    f
    ,
    g
  

{\displaystyle f,g}

with respect to

    P
  

{\displaystyle P}

, they are said to be B-equivalent if there exists a

    c
    >
    0
  

{\displaystyle c>0}

s.t

    f
    (
    x
    )
    =
    c
    ⋅
    g
    (
    x
    )
  

{\displaystyle f(x)=c\cdot g(x)}

, denoted

    f
    
      =
      
        B
      
    
    g
  

{\displaystyle f=_{B}g}

(the convention

    c
    ⋅
    ∞
    =
    ∞
  

{\displaystyle c\cdot \infty =\infty }

is used in cases where a measure is infinite). It can be shown that

    (
    
      =
      
        B
      
    
    )
  

{\displaystyle (=_{B})}

is an equivalence relation. The Bayes space

    B
    (
    P
    )
  

{\displaystyle B(P)}

is defined as the quotient space of all measures with the same null-sets in

    Ω
  

{\displaystyle \Omega }

as

    P
  

{\displaystyle P}

under the equivalence relation

    (
    
      =
      
        B
      
    
    )
  

{\displaystyle (=_{B})}

. The first challenge to analysing density functions is that

    B
    (
    P
    )
  

{\displaystyle B(P)}

is not linear space under ordinary addition and multiplication since the ordinary difference between two densities would not be non-negative everywhere. Like in the Aitchison geometry for finite dimensional data, perturbation and powering is defined for densities: Perturbation

    (
    f
    ⊕
    g
    )
    (
    x
    )
    
      =
      
        B
      
    
    f
    (
    x
    )
    ⋅
    g
    (
    x
    )
  

{\textstyle (f\oplus g)(x)=_{B}f(x)\cdot g(x)}

Powering

    α
    ⊙
    f
    (
    x
    )
    
      =
      
        B
      
    
    f
    (
    x
    
      )
      
        α
      
    
  

{\textstyle \alpha \odot f(x)=_{B}f(x)^{\alpha }}

where

    f
    (
    x
    )
    ,
    
       
    
    g
    (
    x
    )
  

{\displaystyle f(x),{\text{ }}g(x)}

are densities in

    B
    (
    P
    )
  

{\displaystyle B(P)}

and

    α
  

{\displaystyle \alpha }

is some real number. It can be shown using the properties of multiplication and powering of real numbers that

    (
    B
    (
    P
    )
    ,
    ⊕
    ,
    ⊙
    )
  

{\displaystyle (B(P),\oplus ,\odot )}

forms a vector space over the real numbers. The definition of Bayes space does not strictly require a finite reference measure

    P
  

{\displaystyle P}

. If Bayes space is defined over an infinite reference measure

    λ
  

{\displaystyle \lambda }

, it must be

    σ
  

{\displaystyle \sigma }

-finite (like the Lebesgue measure). The finite reference measure is, however, necessary for adding Hilbert space structure to a subset of

    B
    (
    P
    )
  

{\displaystyle B(P)}

. Consider the subspace

      B
      
        p
      
    
    (
    P
    )
    =
    {
    f
    ∈
    B
    (
    P
    )
    
      |
    
    
      ∫
      
        Ω
      
    
    
      |
    
    log
    
    (
    f
    (
    x
    )
    )
    
      
        |
      
      
        p
      
    
    d
    P
    (
    x
    )
    <
    ∞
    }
  

{\textstyle B^{p}(P)=\{f\in B(P)|\int _{\Omega }|\log(f(x))|^{p}dP(x)<\infty \}}

. For

    p
    =
    2
  

{\displaystyle p=2}

, this is a linear subspace and isometrically isomorphic to the Hilbert space

      L
      
        0
      
      
        2
      
    
    (
    P
    )
    =
    {
    g
    ∈
    
      L
      
        2
      
    
    (
    P
    )
    :
    
      ∫
      
        Ω
      
    
    g
    (
    x
    )
    d
    P
    (
    x
    )
    =
    0
    }
  

{\textstyle L_{0}^{2}(P)=\{g\in L^{2}(P):\int _{\Omega }g(x)dP(x)=0\}}

via the centered log-ratio (clr) transformation

      clr
    
    (
    f
    (
    x
    )
    )
    =
    log
    
    (
    f
    (
    x
    )
    )
    
    
      
        1
        
          P
          (
          Ω
          )
        
      
    
    ∫
    log
    
    (
    f
    (
    x
    )
    )
    d
    P
    (
    x
    )
    ∈
    
      L
      
        0
      
      
        2
      
    
    (
    P
    )
  

{\displaystyle {\text{clr}}(f(x))=\log(f(x))-{\frac {1}{P(\Omega )}}\int \log(f(x))dP(x)\in L_{0}^{2}(P)}

. The subspace of log-square integrable functions is termed the Bayes Hilbert space. It can be shown that the clr transformation is a linear isomorphism between the two spaces. Defining an inner product on

      B
      
        2
      
    
    (
    P
    )
  

{\displaystyle B^{2}(P)}

as the inner product of the clr transformations will provide the Hilbert space structure for

      B
      
        2
      
    
    (
    P
    )
  

{\displaystyle B^{2}(P)}

, obtaining the centered log-ratio transformation as a linear isometry. Specifically, this results in the Aitchison distance between two densities in the same Bayes Hilbert space

    d
    (
    f
    ,
    g
    )
    =
    ∫
    (
    
      clr
    
    (
    f
    (
    x
    )
    
    
      clr
    
    (
    g
    (
    x
    )
    
      )
      
        2
      
    
    d
    P
    (
    x
    )
  

{\displaystyle d(f,g)=\int ({\text{clr}}(f(x)-{\text{clr}}(g(x))^{2}dP(x)}

which for densities with respect to the Lebesgue measure on an interval

    Ω
    =
    [
    a
    ,
    b
    ]
  

{\displaystyle \Omega =[a,b]}

"simplifies" to

    d
    (
    f
    ,
    g
    )
    =
    
      ∫
      
        a
      
      
        b
      
    
    
      
        (
        
          
            [
            
              
                log
              
              (
              f
              (
              x
              )
              
              
                
                  1
                  
                    b
                    
                    a
                  
                
              
              
                ∫
                
                  a
                
                
                  b
                
              
              
                log
              
              (
              f
              (
              x
              )
              )
              d
              x
            
            ]
          
          
          
            [
            
              
                log
              
              (
              g
              (
              x
              )
              
              
                
                  1
                  
                    b
                    
                    a
                  
                
              
              
                ∫
                
                  a
                
                
                  b
                
              
              
                log
              
              (
              g
              (
              x
              )
              )
              d
              x
            
            ]
          
        
        )
      
      
        2
      
    
    d
    x
  

{\displaystyle d(f,g)=\int _{a}^{b}\left(\left[{\text{log}}(f(x)-{\frac {1}{b-a}}\int _{a}^{b}{\text{log}}(f(x))dx\right]-\left[{\text{log}}(g(x)-{\frac {1}{b-a}}\int _{a}^{b}{\text{log}}(g(x))dx\right]\right)^{2}dx}

.

== Multivariate densities == The measure

    P
  

{\displaystyle P}

does not have to be univariate (one-dimensional), but can also be defined as a product measure on Cartesian products, characterising bivariate (two-dimensional) or multivariate densities. The geometric structure of Hilbert spaces can be used to decompose multivariate densities orthogonally into independent and interaction parts using the concept of "geometric marginals". This decomposition has relations to copula theory. The geometry in

      B
      
        2
      
    
    (
    P
    )
  

{\displaystyle B^{2}(P)}

defines norms on densities that can be used to quantify "relative simplicial deviance", which is measure of how much of a bivariate distribution can be explained by the interaction part; in the multivariate case the relative simplicial deviance can be generalised to the "information composition".

== See also == Compositional data analysis Functional data analysis Copulas Information geometry

== References ==