# Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition

> Research article (DSpace@MIT (Massachusetts Institute of Technology), 2020) · cited 28× · AI/ML

**Wikidata**: [openalex:W3035759338](https://www.wikidata.org/wiki/openalex:W3035759338)  
**Source**: https://4ort.xyz/entity/learning-adversarial-markov-decision-processes-with-bandit-feedback-and-unknown-transition
