Singular statistical models
        singular : Fisher information is degenerate.
- relates to non-identification problems. $\theta \to p(x|\theta)$
 
- cause several problems in statistics and machine learning
 
- ex. Gaussian mixture model, Hidden markov model
 
- differential geometric (like dually flat manifold) cannot be directly applied. –>
 
Legendre duality
z = f’(x0) + f(x0) - f’(x0)
p = f’(x) 
φ(p) = xp - f(x)
=> when f is convex, a point <-> a tangent line
=> when f is non-convex in general, the relationship betweeen hypersurfaces (wavefronts)
- hypersurfaces admitting singularity is larger than wavefronts. (ex. Whitney umbrella)
 
Legendre submanifold L
- Lf := {(x, x’, f(x))} is an example of Legendre submanifold.
 
- generalization of the graph of functions (x,f(x)).
 
- regular modelの定義は, π1とπ'1の写像が両方とも準同型であること.
 
x <- Base space (x,z) <-π- (x,p,z) -π’-> (p,z’) Fiber space -> p
Coherent tangent bandles E, E'
- prior studies from mathematicians
 
Dually flat structure of singular models
Canonical divergence
- The canonical divergence on L is deifined by D_L(p,q) = z(p) + z’(q) - x(p)^T p(q)
- if L is regular, it reduces to Bregman divergence.
 
 
- The generalized Pythagorean theorem on Legendre submanifolds also holds!
- introduce e-curve and m-curve. The orthogonaliy is not defined via metrics but between affine structures. (reduces to metrics version on regular models.) –>