Using Semantic Similarity as Reward for Reinforcement Learning in ...

Our experiments show that reinforcement learning with semantic similarity reward improves the BLEU scores from the baseline LSTM NMT model. Anthology ID: ...

The solution in related work is to train the model to convergence using CCE, then add a fine-tuning step with semantic similarity loss. ... ... Note that BLEU ...

Abstractive summarization with deep reinforcement learning using ...

We then introduce a deep reinforcement learning algorithm that uses the proposed semantic similarity measures as rewards, together with a mixed ...

Reinforcement Learning-powered Semantic Communication via ...

similarity are given a positive reward, while those with a low semantic similarity are therefore punished (also see Fig. 2(b)). In a ...

Semantic Similarity; A novel metric to measure relatedness between two different agents in Deep Reinforcement Learning scenarios with ...

Query focused Summarization using Deep Reinforcement Learning

To aid the RL training, we propose a better semantic similarity reward, enabled by a novel Passage Embedding scheme developed using Cluster ...

Combining Semantic Guidance and Deep Reinforcement Learning ...

The reward function for the agent is usually learnt using a generative adversarial network (GAN) [9], which provides a measure of similarity between the final.

Reinforced Abstractive Text Summarization With Semantic Added ...

Recent studies have sought to overcome the bias that cross-entropy-based learning methods can have through reinforcement learning (RL)-based ...

Rewarding Semantic Similarity under Optimized Alignments for AMR ...

(2019) separately train paraphrastic sentence embeddings that provide semantic similarity rewards to a neu- ... Deep reinforcement learning with dis-.

Exploring Reinforcement Learning Rewards for Summarisation

30 References · Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization · Multi-Reward Reinforced Summarization with ...

Dialogue Generation using Reinforcement Learning and Neural ...

We study the effects of reward functions such as semantic coherence, information flow and ease of answering during simulated agent to agent conversation (the ...

Evaluating BERT-based Rewards for Question Generation with ...

Prior works have identified a range of shortcomings (including semantic drift and exposure bias) and thus have turned to the reinforcement ...

Reinforcement Learning-powered Semantic Communication via ...

Request PDF | Reinforcement Learning-powered Semantic Communication via Semantic Similarity | We introduce a new semantic communication mechanism, ...

A Deep Reinforced Model for Zero-Shot Cross-Lingual ...

The model uses reinforcement learning to directly optimize a bilingual semantic similarity metric between the summaries generated in a target language and ...

Survey on reinforcement learning for language processing

A validity reward by checking the output of the primary model at the surface and at semantic levels is used. This reward function requires prior ...

I don't understand the difference between asymmetric retrieval ...

... machine learning” or “AI”). Its descriptions of retrieval and ... using sentence similarity of the search text with chunks (sentences?)

Reinforcement Learning for Abstractive Question Summarization ...

Reinforcement Learning for Abstractive Question Summarization with Question-aware Semantic Rewards · S. Yadav, D. Gupta, +1 author. Dina Demner- ...

Weakly Supervised Deep Reinforcement Learning for Video ...

A new semantic reward Rsem is proposed to measure the similarity between these two video representations, where sim(·, ·) is a similarity function (here we use ...

Evaluating BERT-based Rewards for Question Generation with ...

Information systems → Question answering. KEYWORDS. Question generation, reinforcement learning, reward functions. ACM Reference Format: Peide ...

Longitudinal Data and a Semantic Similarity Reward for Chest X-Ray...

Our reward for reinforcement learning leverages CXR-BERT -- which captures the clinical semantic similarity between reports. Training with this reward forces ...