MSc Thesis Presentation - Inna Ivanova

Date

Name:  Inna Ivanova
Date:   27 May 2025
Time:   11:30am Vancouver Time
Location:   Zoom https://zoom.us/j/95231499214?pwd=vOqajGc2u2s0n2HTU6EKnMp5ptqkt3.1
Supervisor(s):  Giuseppe Carenini, Leonid Sigal

Title: Discourse-guided Text-generation from Knowledge Graphs and Image Scene Graphs 

Abstract: This thesis introduces a discourse-guided approach for generating text from semi-structured data -- knowledge graphs and image scene graphs. We provide a novel architecture that integrates discourse planning as an intermediary structuring step, with the objective of enhancing coherence, readability, and the overall quality of generated content. The proposed method reorders input graph nodes into a coherent discourse sequence prior to decoding, utilizing both Pointer Networks and Large Language Models (LLMs) to represent discourse structures. Our experiments focus on two distinct datasets — Agenda (scientific abstracts) and Visual Genome (image captioning) — illustrating that explicit discourse planning consistently enhances performance across standard natural language generation metrics and improves output quality as evaluated by both human and LLM-based assessments. This thesis presents a generalizable approach for integrating discourse structure into neural text generation systems and emphasizes the potential of large language models as both planners and evaluators in natural language creation tasks.