Presented poster at the Natural Language Generation, Evaluation, and Metrics (GEM) workshop at ACL-IJCNLP 2021. [paper]