Universal Complexity Bounds Based on Value Iteration and Application to Entropy Games

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Allamigeon, Xavier; Gaubert, Stéphane; Katz, Ricardo, David; Skomra, Mateusz
المصدر:
49th International Colloquium on Automata, Languages, and Programming (ICALP 2022)
https://laas.hal.science/hal-03698207
49th International Colloquium on Automata, Languages, and Programming (ICALP 2022), Jul 2022, Paris, France
الموضوع:
Mean-payoff games; entropy games; value iteration; Perron root; separation bounds; parameterized complexity; [INFO.INFO-GT]Computer Science [cs]/Computer Science and Game Theory [cs.GT]
نوع التسجيلة:
conference object
اللغة:
English

معلومة اضافية
- Contributors:
  TROPICAL (TROPICAL); Centre de Mathématiques Appliquées - Ecole Polytechnique (CMAP); École polytechnique (X)-Centre National de la Recherche Scientifique (CNRS)-École polytechnique (X)-Centre National de la Recherche Scientifique (CNRS)-Inria Saclay - Ile de France; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria); Équipe Méthodes et Algorithmes en Commande (LAAS-MAC); Laboratoire d'analyse et d'architecture des systèmes (LAAS); Université Toulouse Capitole (UT Capitole); Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse); Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J); Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3); Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP); Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole); Université de Toulouse (UT)
- بيانات النشر:
  HAL CCSD
- الموضوع:
  2022
- Collection:
  Université Toulouse 2 - Jean Jaurès: HAL
- الموضوع:
  Paris; France
- نبذة مختصرة :
  International audience ; We develop value iteration-based algorithms to solve in a unified manner different classes of combinatorial zero-sum games with mean-payoff type rewards. These algorithms rely on an oracle, evaluating the dynamic programming operator up to a given precision. We show that the number of calls to the oracle needed to determine exact optimal (positional) strategies is, up to a factor polynomial in the dimension, of order R/sep, where the "separation" sep is defined as the minimal difference between distinct values arising from strategies, and R is a metric estimate, involving the norm of approximate sub and super-eigenvectors of the dynamic programming operator. We illustrate this method by two applications. The first one is a new proof, leading to improved complexity estimates, of a theorem of Boros, Elbassioni, Gurvich and Makino, showing that turn-based mean-payoff games with a fixed number of random positions can be solved in pseudo-polynomial time. The second one concerns entropy games, a model introduced by Asarin, Cervelle, Degorre, Dima, Horn and Kozyakin. The rank of an entropy game is defined as the maximal rank among all the ambiguity matrices determined by strategies of the two players. We show that entropy games with a fixed rank, in their original formulation, can be solved in polynomial time, and that an extension of entropy games incorporating weights can be solved in pseudo-polynomial time under the same fixed rank condition.
- Relation:
  hal-03698207; https://laas.hal.science/hal-03698207; https://laas.hal.science/hal-03698207/document; https://laas.hal.science/hal-03698207/file/main.pdf
- Rights:
  info:eu-repo/semantics/OpenAccess
- الرقم المعرف:
  edsbas.B358A334

تعليقات

No Comments.

Universal Complexity Bounds Based on Value Iteration and Application to Entropy Games

اتصل بنا

اتبع