2017
Deep Multi-agent Reinforcement Learning in Sequential Social Dillemas
16th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Joel Leibo, Vinicius Zambaldi, Marc Lanctot and Thore Greapel)
2015
Deep Reinforcement Learning in Games Research
Reinforcement Learning workshop at NIPS’15
(with Gerry Tesauro, Joe Bigus, Ban Kawas and Kamil Rocki)
2013
Solution Methods for Constrained Markov Decision Process with Continuous Probability Modulation
29th Conference on Uncertainty in Artificial Intelligence
(with Marek Petrik and Dharmashankar Subramanian)
Mycotoxin Testing in Food-Stock Lots
IBM Research Technical Report
(with Ramesh Natarajan, Mary Helander and Bonnie Ray)
2012
Playing Repeated Stackelberg Games with Unknown Opponents
11th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Gerry Tesauro and Richard Segal)
Delayed Observation Planning in Partially Observable Domains
11th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Pradeep Varakantham)
2011
Approximation Methods for Infinite Bayesian Stackelberg Games:
Modeling Distributional PAyoff Uncertainty
10th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Christopher Kiekintveld and Milind Tambe)
GUARDS and PROTECT: Next Generation Applications of Security Games.
ACM SIGecom Exchanges
(with Bo An, James Pita, Eric Shieh, Milind Tambe and Christopher Kiekintveld)
Multiagent Communication Security in Adversarial Settings
IEEE/WIC/ACM International Conference on Intelligent Agent Technology
(with Steven Okamoto, Praveen Paruchuri, Yonghong Wang, Katia Sycara and Mudhakar Srivatsa)
2010
A Decision Theoretic Approach to Data Leakage Prevention
2nd IEEE International Conference on Privacy, Security, Risk and Trust
(with Mudhakar Srivatsa and Pradeep Varakantham)
Methods and Algorithms for Infinite Bayesian Stackelberg Security Games
Conference on Decision and Game Theory for Security
(with Christopher Kiekintvelt and Milind Tambe)
ALARMS: Alerting and Reasoning Management System for Next Generation Aircraft Hazards
26th Conference on Uncertainty in Artificial Intelligence
(with Nathan Schurr and Alan Carlin)
Risk Sensitive Planning in Partially Observable Environments
9th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Pradeep Varakantham)
Robust Bayesian Methods for Stackelberg Security Games
9th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Christopher Kiekintvelt and Milind Tambe)
Function Allocation for NextGen Airspace via Agents
9th International Joint Conference on Autonomous Agents and Multi-agent Systems (Industry Track)
(with Nathan Schurr and Paul Picciano)
Introducing Communication to Dis-POMDPs with Locality of Interaction
Web Intelligence and Agent Systems: An International Journal
(with Makoto Tasaki, Yuichi Yabu, Yuki Iwanuri, Makoto Yokoo, Pradeep Varakantham and Milind Tambe)
2009
Exploiting Coordination Locales in Distributed POMDPs via Social Model Shaping
19th International Conference on Automated Planning and Scheduling
(with Pradeep Varakantham, Jun-young Kwak, Matthew Taylor, Paul Scerri and Milind Tambe)
Allocation of Continuous Resources in Agent Teams: Towards Conquering Uncertainty
Book Published by VMD Verlag
Exploiting Coordination Locales in Distributed POMDPs via Social Model Shaping
Workshop on Multiagent Sequential Decision Making held at 8th International Joint Conference on Autonomous Agents and Multi-agent Systems (extended version of the ICAPS’09 paper)
(with Jun-young Kwak, Pradeep Varakantham, Matthew Taylor, Paul Scerri and Milind Tambe)
Coordinating Randomized Policies for Increasing Security of Agent Systems
Journal of Information Technology and Management (ITM)
(with Praveen Paruchuri, Jonathan Pearce, Fernando Ordonez, Sarit Kraus and Milind Tambe)
Planning with Continuous Resources for Agent Teams
8th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Milind Tambe)
Improving Adjustable Autonomy Strategies for Time-Critical Domains
8th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Nathan Schurr and Milind Tambe)
2008
Towards Faster Planning with Continuous Resources in Stochastic Domains
23rd AAAI Conference on Artificial Intelligence
(with Milind Tambe)
Introducing Communication to Dis-POMDPs with Locality of Interaction
IEEE/WIC International Joint Conference on Web Intelligence and Intelligent Agent Technology
(with Makoto Tasaki, Yuichi Yabu, Yuki Iwanuri, Makoto Yokoo, Milind Tambe, Pradeep Varakantham)
Efficient Algorithms to Solve Bayesian Stackelberg Games for Security Applications
Nectar Papers Track of the 23rd AAAI Conference on Artificial Intelligence
(with Praveen Paruchuri, Jonathan P. Pearce, Milind Tambe, Fernando Ordonez and Sarit Kraus)
Not All Agents Are Equal: Scaling up Distributed POMDPs for Agent Networks
7th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Tapana Gupta, Pradeep Varakantham, Milind Tambe and Makoto Yokoo)
The Application of a Game Theoretic Model for Security at the Los Angeles International Airport
Industry Track of the 7th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with James Pita, Manish Jain, Fernando Ordóñez, Christopher Portway, Craig Western, Praveen Paruchuri, Milind Tambe and Sarit Kraus)
RIAACT: A Robust Approach to Adjustable Autonomy for Human-Multiagent Teams
7th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Nathan Schurr and Milind Tambe)
Playing Games for Security: An Efficient Exact Algorithm for Solving Bayesian Stackelberg Games
7th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Praveen Paruchuri, Jonathan Pearce, Janusz Marecki, Milind Tambe, Fernando Ordóñez, Sarit Kraus)
2007
On Opportunistic Techniques for Solving Decentralized MDPs with Temporal Constraints
6th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Milind Tambe)
Letting Loose a SPIDER on a Network of POMDPs: Generating Quality Guaranteed Policies
6th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Pradeep Varakantham, Milind Tambe and Makoto Yokoo)
A Fast Analytical Algorithm for Solving Markov Decision Processes with Real-Valued Resources
20th International Joint Conference on Artificial Intelligence
(with Sven Koenig and Milind Tambe)
SPIDER Attack on a Network of POMDPs: Towards Quality Bounded Solutions
AAAI Spring Symposium on Game Theoretic and Decision Theoretic Agents
(with Pradeep Varakantham, Milind Tambe and Makoto Yokoo)
2006
Dangers in Multiagent Rescue using DEFACTO
Lecture Notes in Computer Science - An International Journal, Springer Academic Press
(with Nathan Schurr, Milind Tambe and Paul Scerri)
A Fast Analytical Algorithm for Markov Decision Process with Continuous State Spaces
8th Workshop on Game Theoretic and Decision Theoretic Agents held at the
5th International Joint Conference on Autonomous Agents and Multiagent Systems
(with Zvi Topol and Milind Tambe)
2005
The DEFACTO System: Training Tool for Incident Commanders
17th Conference on Innovative Applications of Artificial Intelligence
(with Nathan Schurr, Paul Scerri, J. P. Lewis and Milind Tambe)
The DEFACTO System for Human Omnipresence to Coordinate Agent Teams:
The Future of Disaster Response.
4th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Nathan Schurr, Nikhil Kasinadhuni, Milind Tambe J. P. Lewis and Paul Scerri)
Towards Flexible Coordination of Human-Agent Teams
Multiagent and Grid Systems - An International Journal, IOS Press
(with Nathan Schurr, Milind Tambe and Paul Scerri)
Conflicts in Teamwork: Hybrids to the Rescue
4th International Joint Conference on Autonomous Agents and Multi-agent Systems
(with Milind Tambe, Emma Bowring, Hyuckchul Jung, Gal Kaminka, Rajiv Maheswaran, Jay Modi,
Ranjit Nair, Jonathan Pearce, Praveen Paruchuri, David Pynadath, Paul Scerri, Nathan Schurr and Pradeep Varakantham)
Dangers in Multiagent Rescue using DEFACTO
2nd International Workshop on Safety and Security in Multiagent Systems held at the
4th International Joint Conference on Autonomous Agents and Multiagent Systems
(with Nathan Schurr, Milind Tambe and Paul Scerri)
The Future of Disaster Response: Humans Working with Multiagent Teams using DEFACTO
AAAI Spring Symposium on “Artificial Intelligence Technologies for Homeland Security”
(with Nathan Schurr, Milind Tambe and J.P. Lewis and Nikhil Kasinadhuni)
Agent-based Simulations for Disaster Rescue Using the DEFACTO Coordination System
Book Chapter in Emergent Information Technologies and Enabling Policies for Counter Terrorism,
Wiley-IEEE Press
(with Nathan Schurr and Milind Tambe)
The DEFACTO System: Coordinating Human-Agent Teams for the Future of Disaster Response
Book Chapter in Programming Multiagent Systems, Springer Academic Press
(with Nathan Schurr, Milind Tambe, Paul Scerri and J. P. Lewis)
2004
Automata, Formal Languages and Algorithms
Book Published by Jacek Skalmierski Computer Studio
2003
Planning in Semantic Network using Node Nested Grammars
Artificial Intelligence Journal of the Ukrainian Academy of Sciences
Representing Time Intervals in Semantic Networks
International Conference on Problems of Decision Making under Uncertainties
Semantic Networks and Intelligent Agents
Textbook Published by Jacek Skalmierski Computer Studio
2002
LHCb level-1 Trigger – Real Time Sorter
Workshop on LHCb Level 1 trigger, European Laboratory for Nuclear Research
Textbook Published by Jacek Skalmierski Computer Studio
2001
Methods of Artificial Intelligence
Textbook Published by Jacek Skalmierski Computer Studio
2000
Textbook Published by Jacek Skalmierski Computer Studio
Below is a list of my selected publications. For a complete list click here.