Accepted Extended Abstracts

List of accepted Extended Abstracts
Paper ID Authors Title
885 Chaya Levinger, Noam Hazon, Amos Azaria Fair Sharing: The Shapley Value for Ride-Sharing and Routing Games
895 Haris Aziz, Edward Lee The Temporary Exchange Problem
898 Kaisheng Wu, Yong Qiao, Kaidong Chen, Fei Rong, Liangda Fang, Liping Xiong, Zhao-Rong Lai, Qian Dong Automatic Synthesis of Generalized Winning Strategy of Impartial Combinatorial Games
899 Gang Chen A New Framework for Multi-Agent Reinforcement Learning – Centralized Training and Exploration with Decentralized Execution via Policy Distillation
906 Sourav Medya, Tiyani Ma, Arlei Silva, Ambuj Singh A Game Theoretic Approach For Core Resilience
918 Murat Sensoy, Maryam Saleki, Simon Julier, Reyhan Aydogan, John Reid Not all Mistakes are Equal
927 Cristobal Pais Vulcano: Operational Fire Suppression Management Using Deep Reinforcement Learning
939 Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Weixun Wang, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhaodong Wang, Jiajie Peng Efficient Deep Reinforcement Learning through Policy Transfer
940 Yang Chen, He Zhao, Jiamou Liu, Hongyi Su Social Structure Emergence: A Multi-agent Reinforcement Learning Framework for Relationship Building
947 Nanda Kishore Sreenivas, Shrisha Rao Analyzing the Effects of Memory Biases and Mood Disorders on Social Performance
954 Zeyuan Cui, Shijun Liu, Li Pan, Qiang He Translating Embedding with Local Connection for Knowledge Graph Completion
957 Shivika Narang, Yadati Narahari A Study of Incentive Compatibility and Stability Issues in  Fractional Matchings
966 Hitoshi Shimizu, Tatsushi Matsubayashi, Akinori Fujino, Hiroshi Sawada Theme Park Simulation based on Questionnaires for Maximizing Visitor Surplus
967 Yuyu Xu, David Jeong, Pedro Sequeira, Jonathan Gratch, Javed Aslam, Stacy Marsella A Supervised Topic Model Approach to Learning Effective Styles within Human-Agent Negotiation
988 Shuangfeng Zhang, Yuan Liu, Xingren Chen, Xin Zhou A POMDP-based Method for Analyzing  Blockchain System Security Against Long Delay Attack
1015 Joseph Singleton, Richard Booth An Axiomatic Approach to Truth Discovery
1026 Geoff Nitschke Coordinated Control of Autonomous Vehicle Traffic on Roads of the Future
1031 Alessandro Farinelli, Antonello Contini, Davide Zorzi Decentralized task assignment for multi-item pickup and delivery in logistic scenarios
1032 Jaelle Scheuerman, Jason L. Harman, Nicholas Mattei, K. Brent Venable Heuristic Strategies in Uncertain Approval Voting Environments
1035 Pankaj Kumar Deep Reinforcement Learning for Market Making
1041 Shubham Pateria, Budhitama Subagdja, Ah Hwee Tan Hierarchical Reinforcement Learning with Integrated Count-based Discovery of Salient Subgoals
1042 Sarit Kraus, Abhijin Adiga, Sekharipuram Ravi Boolean Games: Inferring Agents’ Goals Using Taxation Queries
1044 Jan Karwowski, Jacek Mańdziuk, Adam Żychowski Anchoring Theory in Sequential Stackelberg Games
1047 Zhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai Sequential Advertising Agent with Interpretable User Hidden Intents
1048 Chao Yu, Yinzhao Dong, Paul Weng, Ahmed Maustafa, Hui Cheng, Hongwei Ge Structure-Motivated Interactive LEarning (SMILE): Towards Efficient and Interpretable Deep Reinforcement Learning in Robotic Control
1061 Weiwei Chen Aggregation of Support-Relations of Bipolar Argumentation Frameworks
1067 Marin Lujak, Alberto Fernandez, Eva Onaindia A Decentralized Multi-Agent Coordination Method for Dynamic and Constrained Production Planning
1073 Anirban Santara, Rishabh Madan, Balaraman Ravindran, Pabitra Mitra ExTra: Transfer-guided Exploration
1075 Rafael H. Bordini, Rem Collier, Jomi F. Hübner, Alessandro Ricci Encapsulating Reactive Behaviour in Goal-based Plans for Programming BDI Agents
1076 Raphael Koster, Joel Leibo, Dylan Hadfield-Menell, Gillian Hadfield Learning compliance and enforcement behaviors: silly rules and the microfoundations of normativity
1078 Gabor Erdelyi, Yongjie Yang Microbribery in Group Identification
1083 Jiaoyang Li, Andrew Tinka, Scott Kiesel, Joseph W. Durham, T. K. Satish Kumar, Sven Koenig Large-Scale Lifelong Multi-Agent Path Finding in Warehouses
1086 Ganesh Ghalme, Swapnil Dhamal, Shweta Jain, Sujit Gujar, Yadati Narahari Ballooning Multi-Armed Bandits
1089 Haris Aziz, Serge Gaspers, Zhaohong Sun Mechanism Design for School Choice with Soft Diversity Constraints
1090 Haris Aziz, Serge Gaspers, Zhaohong Sun, Makoto Yokoo Multiple Levels of Importance in Matching with Distributional Constraints
1098 Hiroaki Yamada, Naoyuki Kamiyama Optimal Control of Pedestrian Flows by Congestion Forecasts Satisfying User Equilibrium Conditions
1103 Harel Yedidsion, Shani Alkoby, Peter Stone Sequential Online Chore Division for Autonomous Vehicle Convoy Formation
1111 Chao Yu, Tianpei Yang, Wenxuan Zhu, Yinzhao Dong, Guangliang Li Adaptively Shaping Interactive Reinforcement Learning through Online Human Demonstrations
1113 Xiaobai Ma, Jayesh Gupta, Mykel Kochenderfer Normalizing Flow Policies for Multiagent Systems
1120 Shufan Wang, Jian Li Online Algorithms for Multi-shop Ski Rental with Machine Learned Predictions
1121 Marina Knittel, MohammadTaghi Hajiaghayi Matching Affinity Clustering: Improved Hierarchical Clustering at Scale with Guarantees
1130 Yusuke Nakata, Sachiyo Arai Mini-batch Bayesian Inverse Reinforcement Learning for Multiple Dynamics
1151 Timothy Verstraeten, Eugenio Bargiacchi, Pieter Libin, Diederik Roijers, Ann Nowé Thompson Sampling for Factored Multi-Agent Bandits
1160 Hangtian Jia, Chunxu Ren, Yujing Hu, Yingfeng Chen, Tangjie Lv, Changjie Fan, Hongyao Tang, Jianye Hao Mastering Basketball with Deep Reinforcement Learning: An Integrated Curriculum Training Approach
1167 Enrico Marchesini, Alessandro Farinelli Genetic Deep Reinforcement Learning for Mapless Navigation
1174 Yi Zhang, Yu Qian, Yichen Yao, Haoyuan Hu, Yinghui Xu Learning to cooperate: Application of deep reinforcement learning for online AGV path finding
1184 Cornelio Cristina, Andrea Loreggia, Michele Donini, Maria Silvia Pini, Francesca Rossi Voting with Random Classifiers (VORACE)
1188 Hans L. Bodlaender, Tesshu Hanaka, Lars Jaffke, Hirotaka Ono, Yota Otachi, Tom C. van der Zanden Hedonic Seat Arrangement Problems
1190 Dengji Zhao, Yiqing Huang, Liat Cohen, Tal Grinshpoun Coalitional Games with Stochastic Characteristic Functions Defined by Private Types
1217 Omer Shiran Svarzbard, Ron Lavi Competition Among Contests: a Safety Level Analysis
1220 Anagha Kulkarni, Siddharth Srivastava, Subbarao Kambhampati Signaling Friends and Head-Faking Enemies Simultaneously: Balancing Goal Obfuscation and Goal Legibility
1222 Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu, Doina Precup Option-Critic in Cooperative Multi-agent Systems
1237 Yijie Zhang, Roxana Radulescu, Patrick Mannion, Diederik Roijers, Ann Nowé Opponent Modelling for Reinforcement Learning in Multi-Objective Normal Form Games
1244 Zhi Zhang, Jiachen Yang, Hongyuan Zha Integrating independent and centralized multi-agent reinforcement learning for traffic signal network optimization
1249 Rose Wang, Sarah Wu, James Evans, Joshua Tenenbaum, David C. Parkes, Max Kleiman-Weiner Too many cooks: Coordinating multi-agent collaboration through inverse planning
1251 Prasanth Murali, Ha Trinh, Lazlo Ring, Reza Asadi, Timothy Bickmore A Friendly Face in the Crowd: Calming Agents in the Audience to Help Overcome Public Speaking Anxiety
1254 Van Nguyen, Vasileiou Stylianos, Tran Cao Son, William Yeoh Conditional Updates of Answer Set Programming and Its Application in Explainable Planning
1255 Alvaro Velasquez, Daniel Melcer Verification-Guided Tree Search
1274 Michael Amir, Alfred Bruckstein Fast Dispersion of a Swarm of Crash-prone Robots
1283 Eoin O’Neill, David Lillis, Gregory O’Hare, Rem Collier Explicit Modelling of Resources for Multi-Agent MicroServices using the CArtAgO framework
1285 Ruben Rodriguez Torrado, Ahmed Kalifa, Michael Green, Niels Justesen, Sebastian Risi, Julian Togelius Bootstrapping Conditional GANs for Video Game Level Generation
1289 Shuwa Miura, Shlomo Zilberstein Maximizing Plan Legibility in Stochastic Environments
1296 Dhaval Adjodah, Dan Calacci, Abhimanyu Dubey, Anirudh Goyal, P m Krafft, Esteban Moro, Alex Pentland Leveraging Communication Topologies Between Learning Agents in Deep Reinforcement Learning
1306 Joseph Suarez, Phillip Isola, Igor Mordatch, Yilun Du Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks
1315 Ferdinando Fioretto, Lesia Mitridati, Pascal Van Hentenryck Privacy-Preserving Optimization of Integrated Gas and Energy Markets
1323 Mona Alshehri, Napoleon Reyes, Andre Barczak Evolving Meta-Level Reasoning with Reinforcement Learning and A* for Coordinated Multi-Agent Path-planning
1337 Atena MTabakhi, Yuanming Xiao, William Yeoh Embedding Preference Elicitation Within the Search for DCOP Solutions
1339 Jose Cadena, Achla Marathe, Anil Vullikanti Finding Spatial Clusters Susceptible to Epidemic Outbreaks due to Undervaccination
1340 Prasanth Murali, Ameneh Shamekhi, Dhaval Parmar, Timothy Bickmore Argumentation is More Important than Appearance for Designing Culturally Tailored Virtual Agents
1345 Rohit Murali, Suravi Patnaik, Stephen Cranefield Mining international norms from the GDELT database
1356 Nate Gruver, Jiaming Song, Mykel Kochenderfer, Stefano Ermon Multi-agent Adversarial Inverse Reinforcement Learning with Latent Variables
1376 Amit Sarker, Abdullahil Baki Arif, Moumita Choudhury, Md. Mosaddek Khan C-CoCoA: A Continuous Cooperative Constraint Approximation Algorithm to Solve Functional DCOPs
1397 Erinc Merdivan, Sten Hanke, Matthieu Geist Modified Actor-Critics
1405 Zoi Terzopoulou, Alexander Karpov, and Svetlana Obraztsova Restricted Domains of Dichotomous Preferences with Possibly Incomplete Information
1437 Kumar Abhishek, Shweta Jain, Sujit Gujar Designing Truthful Contextual Multi-Armed Bandits based Sponsored Search Auctions
1448 Adam Żychowski, Jacek Mańdziuk A Generic Metaheuristic Approach to Sequential Security Games
1450 Shoeb Siddiqui, Sujit Gujar, Ganesh Vanahalli BitcoinF: Achieving Fairness For Bitcoin In Transaction Fee Only Model
1455 Nutchanon Yongsatianchot, Stacy Marsella A Computational Model Of Human Hurricane Evacuation Decision
1461 Xiang Liu, Weiwei Wu, Minming Li, Wanyuan Wang Two-sided Auctions with Budgets:  Fairness, Incentives and Efficiency
1467 Wolfram Barfuss Reinforcement learning dynamics in the infinite memory limit
1470 Shubham Gupta, Rishi Hazra, Ambedkar Dukkipati Networked Multi-Agent Reinforcement Learning with Emergent Communication
1471 Shubham Gupta, Ambedkar Dukkipati Winning an Election: On Emergent Strategic Communication in Multi-Agent Networks
1473 Thomas Spooner, Rahul Savani Robust Market Making via Adversarial Reinforcement Learning
1496 Guy Zaks, Gilad Katz CoMet: A Meta Learning-Based Approach for Cross-Dataset Labeling Using Co-Training
1497 Michele Flammini, Bojana Kodric, Giovanna Varricchio, Martin Olsen Distance Hedonic Games
1503 Wishnu Prasetya, Mehdi Dastani Alice: Agents for Testing Games
1506 Jinmingwu Jiang, Kaigui Wu Cooperative Pathfinding based on memory-efficient Multi-agent RRT*
1510 Vaibhav Bajaj, Sachit Rao Autonomous shape formation and morphing in a dynamic environment by a swarm of robots
1513 Giovanni Ciatto, Davide Calvaresi, Michael Schumacher, Andrea Omicini An Abstract Framework for Agent-Based Explanations in AI
1518 Zhiwei Zeng, Chunyan Miao, Cyril Leung, Zhiqi Shen, Jing Jih Chin Explainable and Contextual Preferences based Decision Making with Assumption-based Argumentation for Diagnostics and Prognostics of Alzheimer’s Disease
1520 Theodor Cimpeanu, The Anh Han Fear of Punishment Promotes the Emergence of Cooperation and Enhanced Social Welfare in Social Dilemmas
1530 Sai Ganesh Nagarajan, David Balduzzi, Georgios Piliouras Robust Self-organization in Games: Symmetries, Conservation Laws and Dimensionality Reduction
1544 Dorothea Baumeister, Tobias Hogrebe Complexity of Election Evaluation and Probabilistic Robustness
1572 Arthur Casals, Assia Belbachir, Amal El Fallah Seghrouchni Adaptive and collaborative agent-based traffic regulation Using Behavior Trees
1580 Vahid Yazdanpanah, Mehdi Dastani, Shaheen Fatima, Nicholas R. Jennings, Devrim M. Yazan, W. Henk M. Zijm TasCore: Dynamic Task Coordination Using Multiagent Strategic Reasoning
1584 Leandro Soriano Marcolino, Elnaz Shafipour Yourdshahi, Matheus Aparecido do Carmo Alves, Plamen Angelov Decentralised Task Allocation in the Fog: An On-line Genetic Estimator for Ad-hoc Teamwork
1585 Stefan Olafsson, Teresa O’Leary, Timothy Bickmore Laughter is the best therapy: Promoting behavior change using humorous conversational agents
1591 Nagat Drawel, Jamal Bentahar, Hongyang Qu Computationally Grounded Quantitative Trust with Time
1615 Olga Petrova, Karel Durkota, Galina Alperovich, Karel Horak, Michal Najman, Branislav Bosansky, Viliam Lisy Discovering Imperfectly Observable Adversarial Actions using Anomaly Detection
1618 Mahak Goindani, Jennifer Neville Cluster-Based Social Reinforcement Learning
1629 Sandhya Saisubramanian, Ece Kamar, Shlomo Zilberstein Mitigating the Negative Side Effects of Reasoning with Imperfect Models: A Multi-Objective Approach
1640 Niclas Boehmer, Edith Elkind Stable Roommate Problem With Diversity Preferences
1664 Guillermo Romero Moreno, Markus Brede, Long Tran-Thanh Continuous Influence Maximisation for the Voter Dynamics: Is Targeting High-Degree Nodes a Good Strategy?
1669 Yuzhong Huang, Andres Abeliuk, Fred Morstatter, Pavel Atanasov, Aram Galstyan Deep Aggregation of Hybrid Crowd Forecasts
1670 Matteo D’Auria, Eric Scott, Rajdeep Singh Lather, Javier Hilty, Sean Luke Distributed, Automated Calibration of Agent-based Model Parameters and Agent Behaviors
1677 Carlos Azevedo, Bruno Lacerda, Nick Hawes, Pedro U. Lima Long-run multi-robot planning with uncertain task durations
1683 Itay Shtechman, RIca Gonen, Erel Segal-Halevi Fair Cake-Cutting Algorithms with Real Land-Value Data
1698 Meghan Chandarana, Michael Lewis, Katia Sycara, Sebastian Scherer Human-in-the-loop Planning and Monitoring of Swarm Search and Service Missions
1701 Eliahu Khalastchi, Meir Kalech Efficient Hybrid Fault Detection for Autonomous Robots
1711 Francesco Riccio, Roberto Capobianco, Daniele Nardi GUESs: Generative modeling of Unknown Environments and Spatial Abstraction for Robots
1713 Dorothea Baumeister, Linus Boes, Tessa Seeger Irresolute Approval-based Budgeting
1716 Joel Schlosser, Aaron Keech, Phillip Odom, Zsolt Kira Unsupervised Danger Avoidance Through Reward Shaping
1726 Shih-Yun Lo, Elaine Short, Andrea Thomaz Robust Planning with Emergent Human-like Behavior for Agents Traveling in Groups
1729 Marina Moreira, Brian Coltin, Rodrigo Ventura Cooperative Real-Time Inertial Parameter Estimation
1733 Yifang Chen, Alex Cuellar, Haipeng Luo, Jignesh Modi, Heramb Nemlekar, Stefanos Nikolaidis Fair Contextual Multi-Armed Bandits: Theory and Experiments
1767 Andrea Baisero, Chris Amato Learning Complementary Representations of the Past using Auxiliary Tasks in Partially Observable Reinforcement Learning
1769 Yu Wang, Hongxia Jin An Interpretable Multimodal Visual Question Answering System using Attention-based Weighted Contextual Features
1785 Guohui Ding, Joewie Koh, Kelly Merckaert, Bram Vanderborght, Marco Nicotra, Christoffer Heckman, Alessandro Roncone, Lijun Chen Distributed Reinforcement Learning for Cooperative Multi-Robot Object Manipulation
1788 Bingyu Liu, Shangyu Xie, Yuan Hong Privacy-Aware Double Auction for Divisible Resources without a Mediator
1798 Amirarsalan Rajabi, Chathika Gunaratne, Alexander Mantzaris, Ivan Garibay Modeling disinformation and the effort to counter it: a cautionary tale of when the treatment can be worse than the disease
1800 Akshat Agarwal, Sumit Kumar, Katia Sycara Learning Transferable Cooperative Behavior in Multi-Agent Teams
1809 Hau Chan, David C. Parkes, Karim Lakhani The Price of Anarchy of Self-Selection in Tullock Contests
1817 Keting Lu, Shiqi Zhang, Xiaoping Chen AutoEG: Automated Experience Grafting for Off-Policy Deep Reinforcement Learning
1830 Gilad Asharov, Tucker Hybinette Balch, Antigoni Polychroniadou, Manuela Veloso PPDPs: Privacy-Preserving Dark Pools
1885 Rupert Mitchell, Jenny Fletcher, Jacopo Panerati, Amanda Prorok Multi-Vehicle Mixed-Reality Reinforcement Learning for Autonomous Multi-Lane Driving
1886 Fatema Tuj Johora, Hao Cheng, Jörg P. Müller, Monika Sester An Agent-Based Model for Trajectory Modelling in Shared Spaces: A Combination of Expert-Based and Deep Learning Approaches
1888 Qingbiao Li, Fernando Gama, Alejandro Ribeiro, Amanda Prorok Graph Neural Networks for Decentralized Multi-Robot Path Planning
1897 Vincent Koeman, Koen Hindriks Tool Use of Average and Active Cognitive Agent Developers
1899 Francesca Mosca, Jose Such, Peter McBurney ELVIRA: an Explainable Agent for Value and Utility-driven Multiuser Privacy
1900 Yukun Cheng, Xiaotie Deng, Yuhao Li Limiting the deviation incentives in resource sharing networks
Paper ID Authors Title
1 Hanna Klaudel  * , Johan Arcile, Raymond Devillers VerifCar: : a framework for modeling and model checking communicating autonomous vehicles (abstract)
2 Felipe Leno da Silva  * , Garrett Warnell , Anna Helena Reali Costa , Peter Stone Agents Teaching Agents: A Survey on Inter-agent Transfer Learning
3 Haris Aziz  * Strategyproof Multi-Item Exchange Under Single-Minded Dichotomous Preferences
4 Dave de Jonge  * , Dongmo Zhang Strategic Negotiations for Extensive-Form Games
5 Jieting Luo  * , John-Jules Meyer , Max Knobbout A Formal Framework for Reasoning about Opportunistic Propensity in Multi-agent Systems
6 Pablo Hernandez-Leal  * , Bilal Kartal , Matthew E. Taylor A Very Condensed Survey and Critique of Multiagent Deep Reinforcement Learning
7 John Doucette  * , Alan Tsang , Hadi Hosseini , Kate Larson , Robin Cohen Inferring True Voting Outcomes in Homophilic Social Networks
8 Noam Hazon  * , Mira Gonen Probabilistic Physical Search on General Graphs: Approximations and Heuristics
9 Cristina Cornelio  * , Maria Silvia Pini , Francesca Rossi , K. Brent Venable Sequential voting in multi-agent soft constraint aggregation
10 Roxana Radulescu  * , Patrick Mannion , Diederik Roijers , Ann Nowe Multi-Objective Multi-Agent Decision Making: A Utility-based Analysis and Survey
11 Chengwei Zhang  * , Jianye Hao SA-IGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes
12 Avi Rosenfeld  * , Ariella Richardson Why, Who, What, When and How about Explainability in Human-Agent Systems
13 Rica Gonen  * , Ozi Egri COMBIMA: Truthful, Budget Maintaining, Dynamic Combinatorial Market
14 Andreasa Morris-Martin * , Marina De Vos , Julian Padget Norm Emergence in Multiagent Systems: A Viewpoint Paper
15 Olabambo Ifeoluwa Oluwasuji, Obaid Malik, Jie Zhang, Sarvapali Dyanand, Ramchurn Solutions to the Fair Electric Load Shedding Problem in Developing Countries