Reinforcement learning for on-line sequence transformation

Szczegóły
Abstrakt

Tytuł:: Reinforcement learning for on-line sequence transformation
Autorzy:: Rypeść, Grzegorz
Lepak, Łukasz
Wawrzyński, Paweł
Data publikacji:: 2022
Słowa kluczowe:: training
computer science
computer architecture
transform
reinforcement learning
benchmark testing
transformer
trening
informatyka
transformacja
uczenie ze wzmocnieniem
testowanie
transformator
Język:: angielski
Dostawca treści:: BazTech
: Artykuł

Przejdź do źródła

In simultaneous machine translation (SMT), an output sequence should be produced as soon as possible, without reading the whole input sequence. This requirement creates a trade-off between translation delay and quality because less context may be known during translation. In most SMT methods, this trade-off is controlled with parameters whose values need to be tuned. In this paper, we introduce an SMT system that learns with reinforcement and is able to find the optimal delay in training. We conduct experiments on Tatoeba and IWSLT2014 datasets against state-of-the-art translation architectures. Our method achieves comparable results on the former dataset, with better results on long sentences and worse but comparable results on the latter dataset.

1. The project was funded by POB Research Centre for Artificial Intelligence and Robotics of Warsaw University of Technology within the Excellence Initiative Program – Research University (ID-UB).

2. Track 1: 17th International Symposium on Advanced Artificial Intelligence in Applications

3. Opracowanie rekordu ze środków MEiN, umowa nr SONP/SP/546092/2022 w ramach programu "Społeczna odpowiedzialność nauki" - moduł: Popularyzacja nauki i promocja sportu (2022-2023).

Informacja

Reinforcement learning for on-line sequence transformation