kb/Competitive_regret-0.md at 1a02be5a8ee81d6c337ba8a4de8d6212f4b2e292

turtle89431 1a02be5a8e Scrape wikipedia-science: 16831 new, 4190 updated, 21574 total (kb-cron)

2026-05-05 07:38:32 -07:00

8.9 KiB

Raw Blame History

title	chunk	source	category	tags	date_saved	instance
Competitive regret	1/1	https://en.wikipedia.org/wiki/Competitive_regret	reference	science, encyclopedia	2026-05-05T14:37:35.023483+00:00	kb-cron

In decision theory and machine learning, competitive regret refers to a performance measure that evaluates an algorithm's regret relative to an oracle or benchmark strategy. Unlike traditional regret, which compares against the best fixed decision in hindsight, competitive regret compares against decision-makers with different capabilities—either with greater computational resources or access to additional information. The formal definition of competitive regret typically involves a ratio or difference between the regret of an algorithm and the regret of a reference oracle. An algorithm is considered to have "good" competitive regret if this ratio remains bounded even as the problem size increases. This framework has applications in various domains including online optimization, reinforcement learning, portfolio selection, and multi-armed bandit problems. Competitive regret analysis provides researchers with a more nuanced evaluation metric than standard regret, helping them develop algorithms that can achieve near-optimal performance even under practical constraints and uncertainty.

== Competitive regret to the oracle with full power == Consider estimating a discrete probability distribution

    p
  

{\displaystyle p}

on a discrete set

        X
      
    
  

{\displaystyle {\mathcal {X}}}

based on data

    X
  

{\displaystyle X}

, the regret of an estimator

    q
  

{\displaystyle q}

is defined as

      max
      
        p
        ∈
        
          
            P
          
        
      
    
    
      r
      
        n
      
    
    (
    q
    ,
    p
    )
    .
  

{\displaystyle \max _{p\in {\mathcal {P}}}r_{n}(q,p).}

where

        P
      
    
  

{\displaystyle {\mathcal {P}}}

is the set of all possible probability distribution, and

      r
      
        n
      
    
    (
    q
    ,
    p
    )
    =
    
      E
    
    (
    D
    (
    p
    
      |
    
    
      |
    
    q
    (
    X
    )
    )
    )
    .
  

{\displaystyle r_{n}(q,p)=\mathbb {E} (D(p||q(X))).}

where

    D
    (
    p
    
      |
    
    
      |
    
    q
    )
  

{\displaystyle D(p||q)}

is the Kullback–Leibler divergence between

    p
  

{\displaystyle p}

and

    q
  

{\displaystyle q}

== Competitive regret to the oracle with limited power ==

=== Oracle with partial information === The oracle is restricted to have access to partial information of the true distribution

    p
  

{\displaystyle p}

by knowing the location of

    p
  

{\displaystyle p}

in the parameter space up to a partition. Given a partition

      P
    
  

{\displaystyle \mathbb {P} }

of the parameter space, and suppose the oracle knows the subset

    P
  

{\displaystyle P}

where the true

    p
    ∈
    P
  

{\displaystyle p\in P}

. The oracle will have regret as

      r
      
        n
      
    
    (
    P
    )
    =
    
      min
      
        q
      
    
    
      max
      
        p
        ∈
        P
      
    
    
      r
      
        n
      
    
    (
    q
    ,
    p
    )
    .
  

{\displaystyle r_{n}(P)=\min _{q}\max _{p\in P}r_{n}(q,p).}

The competitive regret to the oracle will be

      r
      
        n
      
      
        
          P
        
      
    
    (
    q
    ,
    
      
        P
      
    
    )
    =
    
      max
      
        P
        ∈
        
          P
        
      
    
    (
    
      r
      
        n
      
    
    (
    q
    ,
    P
    )
    −
    
      r
      
        n
      
    
    (
    P
    )
    )
    .
  

{\displaystyle r_{n}^{\mathbb {P} }(q,{\mathcal {P}})=\max _{P\in \mathbb {P} }(r_{n}(q,P)-r_{n}(P)).}

=== Oracle with partial information === The oracle knows exactly

    p
  

{\displaystyle p}

, but can only choose the estimator among natural estimators. A natural estimator assigns equal probability to the symbols which appear the same number of time in the sample. The regret of the oracle is

      r
      
        n
      
      
        n
        a
        t
      
    
    (
    p
    )
    =
    
      min
      
        q
        ∈
        
          
            
              Q
            
          
          
            n
            a
            t
          
        
      
    
    
      r
      
        n
      
    
    (
    q
    ,
    p
    )
    ,
  

{\displaystyle r_{n}^{nat}(p)=\min _{q\in {\mathcal {Q}}_{nat}}r_{n}(q,p),}

and the competitive regret is

      max
      
        p
        ∈
        
          
            P
          
        
      
    
    (
    
      r
      
        n
      
    
    (
    q
    ,
    p
    )
    −
    
      r
      
        n
      
      
        n
        a
        t
      
    
    (
    p
    )
    )
    .
  

{\displaystyle \max _{p\in {\mathcal {P}}}(r_{n}(q,p)-r_{n}^{nat}(p)).}

== Example == For the estimator

    q
  

{\displaystyle q}

proposed in Acharya et al.(2013),

      r
      
        n
      
      
        
          
            P
          
          
            σ
          
        
      
    
    (
    q
    ,
    
      Δ
      
        k
      
    
    )
    ≤
    
      r
      
        n
      
      
        n
        a
        t
      
    
    (
    q
    ,
    
      Δ
      
        k
      
    
    )
    ≤
    
      
        
          
            O
          
          ~
        
      
    
    (
    min
    (
    
      
        1
        
          n
        
      
    
    ,
    
      
        k
        n
      
    
    )
    )
    .
  

{\displaystyle r_{n}^{\mathbb {P} _{\sigma }}(q,\Delta _{k})\leq r_{n}^{nat}(q,\Delta _{k})\leq {\tilde {\mathcal {O}}}(\min({\frac {1}{\sqrt {n}}},{\frac {k}{n}})).}

Here

      Δ
      
        k
      
    
  

{\displaystyle \Delta _{k}}

denotes the k-dimensional unit simplex surface. The partition

        P
      
      
        σ
      
    
  

{\displaystyle \mathbb {P} _{\sigma }}

denotes the permutation class on

      Δ
      
        k
      
    
  

{\displaystyle \Delta _{k}}

, where

    p
  

{\displaystyle p}

and

      p
      ′
    
  

{\displaystyle p'}

are partitioned into the same subset if and only if

      p
      ′
    
  

{\displaystyle p'}

is a permutation of

    p
  

{\displaystyle p}

== References ==

8.9 KiB Raw Blame History Unescape Escape

8.9 KiB

Raw Blame History