In this paper, Sourav Panda covers an interesting work combining reinforcement learning and natural language processing by providing a reward function that incorporates a variety of objectives (e.g., information, non-repetition, etc.). The paper is called Deep Reinforcement Learning for Dialogue Generation, by Li et al.
Presentation Link: https://psu.mediaspace.kaltura.com/media/Group+Meeting/1_sf195lgy