Takimoto et al., 2000 - Google Patents

The last-step minimax algorithm

Takimoto et al., 2000

Document ID: 15710344759469848895
Author: Takimoto E; Warmuth M
Publication year: 2000
Publication venue: Algorithmic Learning Theory: 11th International Conference, ALT 2000 Sydney, Australia, December 11–13, 2000 Proceedings 11

External Links

Cited by

Snippet

We consider on-line density estimation with a parameterized density from an exponential family. In each trial t the learner predicts a parameter θ t. Then it receives an instance xt chosen by the adversary and incurs loss-ln p (χ t| θ t which is the negative log-likelihood of χ …

Continue reading at www.researchgate.net (PDF) (other versions)

238000007476 Maximum Likelihood 0 description 1

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6296—Graphical models, e.g. Bayesian networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F1/00—Details of data-processing equipment not covered by groups G06F3/00 - G06F13/00, e.g. cooling, packaging or power supply specially adapted for computer application
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2207/00—Indexing scheme relating to methods or arrangements for processing data by operating upon the order or content of the data handled

Similar Documents

Publication	Publication Date	Title
Barnes et al.	2020	Lower bounds for learning distributions under communication constraints via fisher information
Takimoto et al.	2000	The last-step minimax algorithm
Schölkopf et al.	2001	A generalized representer theorem
Tong et al.	2001	Active learning for structure in Bayesian networks
Buntine	2009	Estimating likelihoods for topic models
Hsu et al.	2015	Mixing time estimation in reversible markov chains from a single sample path
US20050119829A1 (en)	2005-06-02	Robust bayesian mixture modeling
Haussler et al.	1992	How well do Bayes methods work for on-line prediction of f+ 1; 1g values
Morvai et al.	2021	On universal algorithms for classifying and predicting stationary processes
De Sa et al.	2018	Minibatch Gibbs sampling on large graphical models
Temlyakov	2008	Approximation in learning theory
Hennig	2002	Fixed point clusters for linear regression: computation and comparison
JP2000010961A (en)	2000-01-14	Device and method for estimating data generation probability and recording medium
Kang et al.	2005	Fast simulation for multifactor portfolio credit risk in the t-copula model
Ben-David et al.	2001	Agnostic boosting
Chen et al.	2021	Corruption robust active learning
Takimoto½ et al.	0	The Last-Ëtep Minimax Algorithm
Han et al.	2021	Optimal prediction of markov chains with and without spectral gap
Warmuth et al.	2005	Leaving the span
Huang et al.	2024	Denoising diffusion probabilistic models are optimally adaptive to unknown low dimensionality
Herbster et al.	2020	Online multitask learning with long-term memory
Chen et al.	2020	Active online domain adaptation
Milman et al.	2004	Geometric parameters in learning theory
Harremoës et al.	2007	Convergence of Markov chains in information divergence
Puolamäki et al.	2009	Bayesian solutions to the label switching problem