BIBLIOS

  Ciências References Management System

Visitor Mode (Login)
Need help?


Back

Publication details

Document type
Conference papers

Document subtype
Full paper

Title
One Arm to Rule Them All: Online Learning with Multi-armed Bandits for Low-Resource Conversational Agents

Participants in the publication
Vânia Mendonça (Author)
Luísa Coheur (Author)
INSTITUTO SUPERIOR TÉCNICO
Alberto Sardinha (Author)
INSTITUTO SUPERIOR TÉCNICO

Summary
In a low-resource scenario, the lack of annotated data can be an obstacle not only to train a robust system, but also to evaluate and compare different approaches before deploying the best one for a given setting. We propose to dynamically find the best approach for a given setting by taking advantage of feedback naturally present on the scenario in hand (when it exists). To this end, we present a novel application of online learning algorithms, where we frame the choice of the best approach as a multi-armed bandits problem. Our proof-of-concept is a retrieval-based conversational agent, in which the answer selection criteria available to the agent are the competing approaches (arms). In our experiment, an adversarial multi-armed bandits approach converges to the performance of the best criterion after just three interaction turns, which suggests the appropriateness of our approach in a low-resource conversational agent.

Date of Publication
2021

Event
EPIA Conference on Artificial Intelligence

Publication Identifiers
ISSN - 0302-9743

Publisher
Springer International Publishing

Number of pages
10
Starting page
625
Last page
634

Document Identifiers
DOI - https://doi.org/10.1007/978-3-030-86230-5_49
URL - http://dx.doi.org/10.1007/978-3-030-86230-5_49

Keywords
Online learning Multi-armed bandits Conversational agents


Export

APA
Vânia Mendonça, Luísa Coheur, Alberto Sardinha, (2021). One Arm to Rule Them All: Online Learning with Multi-armed Bandits for Low-Resource Conversational Agents. EPIA Conference on Artificial Intelligence, 625-634

IEEE
Vânia Mendonça, Luísa Coheur, Alberto Sardinha, "One Arm to Rule Them All: Online Learning with Multi-armed Bandits for Low-Resource Conversational Agents" in EPIA Conference on Artificial Intelligence, , 2021, pp. 625-634, doi: 10.1007/978-3-030-86230-5_49

BIBTEX
@InProceedings{57544, author = {Vânia Mendonça and Luísa Coheur and Alberto Sardinha}, title = {One Arm to Rule Them All: Online Learning with Multi-armed Bandits for Low-Resource Conversational Agents}, booktitle = {EPIA Conference on Artificial Intelligence}, year = 2021, pages = {625-634}, address = {}, publisher = {Springer International Publishing} }