2017


Deep Multi-agent Reinforcement Learning in Sequential Social Dillemas

16th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Joel Leibo, Vinicius Zambaldi, Marc Lanctot and Thore Greapel)


2015


Deep Reinforcement Learning in Games Research

Reinforcement Learning workshop at NIPS’15

(with Gerry Tesauro, Joe Bigus, Ban Kawas and Kamil Rocki)


2013


Solution Methods for Constrained Markov Decision Process with Continuous Probability Modulation

29th Conference on Uncertainty in Artificial Intelligence

(with Marek Petrik and Dharmashankar Subramanian)



Mycotoxin Testing in Food-Stock Lots

IBM Research Technical Report

(with Ramesh Natarajan, Mary Helander and Bonnie Ray)


2012


Playing Repeated Stackelberg Games with Unknown Opponents

11th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Gerry Tesauro and Richard Segal)



Delayed Observation Planning in Partially Observable Domains

11th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Pradeep Varakantham)


2011


Approximation Methods for Infinite Bayesian Stackelberg Games:

Modeling Distributional PAyoff Uncertainty

10th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Christopher Kiekintveld and Milind Tambe)


GUARDS and PROTECT: Next Generation Applications of Security Games.

ACM SIGecom Exchanges

(with Bo An, James Pita, Eric Shieh, Milind Tambe and Christopher Kiekintveld)


Multiagent Communication Security in Adversarial Settings

IEEE/WIC/ACM International Conference on Intelligent Agent Technology

(with Steven Okamoto, Praveen Paruchuri, Yonghong Wang, Katia Sycara and Mudhakar Srivatsa)


2010


A Decision Theoretic Approach to Data Leakage Prevention

2nd IEEE International Conference on Privacy, Security, Risk and Trust

(with Mudhakar Srivatsa and Pradeep Varakantham)


Methods and Algorithms for Infinite Bayesian Stackelberg Security Games

Conference on Decision and Game Theory for Security

(with Christopher Kiekintvelt and Milind Tambe)


ALARMS: Alerting and Reasoning Management System for Next Generation Aircraft Hazards

26th Conference on Uncertainty in Artificial Intelligence

(with Nathan Schurr and Alan Carlin)


Risk Sensitive Planning in Partially Observable Environments

9th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Pradeep Varakantham)


Robust Bayesian Methods for Stackelberg Security Games

9th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Christopher Kiekintvelt and Milind Tambe)


Function Allocation for NextGen Airspace via Agents

9th International Joint Conference on Autonomous Agents and Multi-agent Systems (Industry Track)

(with Nathan Schurr and Paul Picciano)


Introducing Communication to Dis-POMDPs with Locality of Interaction

Web Intelligence and Agent Systems: An International Journal

(with Makoto Tasaki, Yuichi Yabu, Yuki Iwanuri, Makoto Yokoo, Pradeep Varakantham and Milind Tambe)


2009


Exploiting Coordination Locales in Distributed POMDPs via Social Model Shaping

19th International Conference on Automated Planning and Scheduling

(with Pradeep Varakantham, Jun-young Kwak, Matthew Taylor, Paul Scerri and Milind Tambe)


Allocation of Continuous Resources in Agent Teams: Towards Conquering Uncertainty

Book Published by VMD Verlag


Exploiting Coordination Locales in Distributed POMDPs via Social Model Shaping

Workshop on Multiagent Sequential Decision Making held at 8th International Joint Conference on Autonomous Agents and Multi-agent Systems (extended version of the ICAPS’09 paper)

(with Jun-young Kwak, Pradeep Varakantham, Matthew Taylor, Paul Scerri and Milind Tambe)


Coordinating Randomized Policies for Increasing Security of Agent Systems

Journal of Information Technology and Management (ITM)

(with Praveen Paruchuri, Jonathan Pearce, Fernando Ordonez, Sarit Kraus and Milind Tambe)


Planning with Continuous Resources for Agent Teams

8th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Milind Tambe)


Improving Adjustable Autonomy Strategies for Time-Critical Domains

8th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Nathan Schurr and Milind Tambe)


2008


Towards Faster Planning with Continuous Resources in Stochastic Domains

23rd AAAI Conference on Artificial Intelligence

(with Milind Tambe)


Introducing Communication to Dis-POMDPs with Locality of Interaction

IEEE/WIC International Joint Conference on Web Intelligence and Intelligent Agent Technology

(with Makoto Tasaki, Yuichi Yabu, Yuki Iwanuri, Makoto Yokoo, Milind Tambe, Pradeep Varakantham)


Efficient Algorithms to Solve Bayesian Stackelberg Games for Security Applications

Nectar Papers Track of the 23rd AAAI Conference on Artificial Intelligence

(with Praveen Paruchuri, Jonathan P. Pearce,  Milind Tambe, Fernando Ordonez and Sarit Kraus)


Not All Agents Are Equal: Scaling up Distributed POMDPs for Agent Networks

7th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Tapana Gupta, Pradeep Varakantham, Milind Tambe and Makoto Yokoo)


Deployed ARMOR Protection:

The Application of a Game Theoretic Model for Security at the Los Angeles International Airport

Industry Track of the 7th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with James Pita, Manish Jain, Fernando Ordóñez, Christopher Portway, Craig Western, Praveen Paruchuri, Milind Tambe and Sarit Kraus)


RIAACT: A Robust Approach to Adjustable Autonomy for Human-Multiagent Teams

7th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Nathan Schurr and Milind Tambe)


Playing Games for Security: An Efficient Exact Algorithm for Solving Bayesian Stackelberg Games

7th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Praveen Paruchuri, Jonathan Pearce, Janusz Marecki, Milind Tambe, Fernando Ordóñez, Sarit Kraus)


2007


On Opportunistic Techniques for Solving Decentralized MDPs with Temporal Constraints

6th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Milind Tambe)


Letting Loose a SPIDER on a Network of POMDPs: Generating Quality Guaranteed Policies

6th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Pradeep Varakantham, Milind Tambe and Makoto Yokoo)


A Fast Analytical Algorithm for Solving Markov Decision Processes with Real-Valued Resources

20th International Joint Conference on Artificial Intelligence

(with Sven Koenig and Milind Tambe)


SPIDER Attack on a Network of POMDPs: Towards Quality Bounded Solutions

AAAI Spring Symposium on Game Theoretic and Decision Theoretic Agents

(with Pradeep Varakantham, Milind Tambe and Makoto Yokoo)


2006


Dangers in Multiagent Rescue using DEFACTO

Lecture Notes in Computer Science - An International Journal, Springer Academic Press

(with Nathan Schurr,  Milind Tambe and Paul Scerri)


A Fast Analytical Algorithm for Markov Decision Process with Continuous State Spaces

8th Workshop on Game Theoretic and Decision Theoretic Agents held at the

5th International Joint Conference on Autonomous Agents and Multiagent Systems

(with Zvi Topol and Milind Tambe)


2005


The DEFACTO System: Training Tool for Incident Commanders

17th Conference on Innovative Applications of Artificial Intelligence

(with Nathan Schurr, Paul Scerri,  J. P. Lewis and Milind Tambe)


The DEFACTO System for Human Omnipresence to Coordinate Agent Teams:

The Future of Disaster Response.                                                                                                             

4th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Nathan Schurr, Nikhil Kasinadhuni, Milind Tambe J. P. Lewis and Paul Scerri)


Towards Flexible Coordination of Human-Agent Teams

Multiagent and Grid Systems - An International Journal, IOS Press

(with Nathan Schurr,  Milind Tambe and Paul Scerri)


Conflicts in Teamwork: Hybrids to the Rescue

4th International Joint Conference on Autonomous Agents and Multi-agent Systems

(with Milind Tambe, Emma Bowring, Hyuckchul Jung, Gal Kaminka, Rajiv Maheswaran, Jay Modi,

Ranjit Nair, Jonathan Pearce, Praveen Paruchuri, David Pynadath, Paul Scerri, Nathan Schurr and Pradeep Varakantham)


Dangers in Multiagent Rescue using DEFACTO

2nd International Workshop on Safety and Security in Multiagent Systems held at the

4th International Joint Conference on Autonomous Agents and Multiagent Systems

(with Nathan Schurr,  Milind Tambe and Paul Scerri)


The Future of Disaster Response: Humans Working with Multiagent Teams using DEFACTO

AAAI Spring Symposium on “Artificial Intelligence Technologies for Homeland Security”

(with Nathan Schurr,  Milind Tambe and J.P. Lewis and Nikhil Kasinadhuni)


Agent-based Simulations for Disaster Rescue Using the DEFACTO Coordination System

Book Chapter in Emergent Information Technologies and Enabling Policies for Counter Terrorism,

Wiley-IEEE Press

(with Nathan Schurr and Milind Tambe)


The DEFACTO System: Coordinating Human-Agent Teams for the Future of Disaster Response

Book Chapter in Programming Multiagent Systems, Springer Academic Press

(with Nathan Schurr,  Milind Tambe, Paul Scerri and J. P. Lewis)


2004


Automata, Formal Languages and Algorithms

Book Published by Jacek Skalmierski Computer Studio


2003


Planning in Semantic Network using Node Nested Grammars

Artificial Intelligence Journal of the Ukrainian Academy of Sciences


Representing Time Intervals in Semantic Networks

International Conference on Problems of Decision Making under Uncertainties


Semantic Networks and Intelligent Agents

Textbook Published by Jacek Skalmierski Computer Studio


2002


LHCb level-1 Trigger – Real Time Sorter

Workshop on LHCb Level 1 trigger, European Laboratory for Nuclear Research


Graphs and Recursions

Textbook Published by Jacek Skalmierski Computer Studio


2001


Methods of Artificial Intelligence

Textbook Published by Jacek Skalmierski Computer Studio


2000


Data Structures

Textbook Published by Jacek Skalmierski Computer Studio

Below is a list of my selected publications. For a complete list click here.