Text-Grounded LLM-Assisted Design Rationale Interfaces: Turning Advertising Layout Metadata into Explainable UI/UX Decision Cards

Qiyou  Wu; Siquan  Meng; Jacob  Zhao

doi:10.51903/ijgd.v3i1.3713

Authors

Qiyou Wu Artificial Intelligence, Northeastern University, MA, USA
Siquan Meng Applied Business Analytics, Boston University, MA, USA
Jacob Zhao Applied Analytics, Columbia University, NY, USA

DOI:

https://doi.org/10.51903/ijgd.v3i1.3713

Keywords:

UI/UX design, explainable AI, advertising layouts, OpenCOLE, LLM critique

Abstract

Automatic graphic design generation has progressed from template retrieval to layered generative pipelines, but designers still need explainable review support that links generated layouts to practical UI/UX criteria. This paper presents a text-grounded, LLM-assisted rationale-card interface for advertising layouts using OpenCOLE annotations, including intention, description, keywords, background caption, object caption, heading, and subheading. The interface converts these fields into structured decision cards covering visual hierarchy, white space, call-to-action (CTA) prominence, brand tone, layout balance, and risk warning. An experimental evaluation was conducted on all 23,419 rows from the OpenCOLE train, validation, and test splits. Because expert UI critique labels are unavailable, the tasks are evaluated as weak-label schema-recovery tasks rather than professional design-quality judgments. A linear SVM achieved 0.888 accuracy and 0.872 macro-F1 for nine-way design-purpose recovery. The best CTA model reached 0.817 accuracy, 0.801 macro-F1, and 0.884 quadratic weighted kappa, while text-based layout-proxy prediction reached 0.734 accuracy and 0.706 macro-F1. Structured cards improved required-dimension coverage from 4.21 to 5.92 out of 6 and field-grounding from 0.681 to 0.918. The findings show that OpenCOLE text fields can support inspectable first-pass design review without replacing expert judgment or direct visual-layout evaluation.

References

Amershi, S., Weld, D., Vorvoreanu, M., Fourney, A., Nushi, B., Collisson, P., Suh, J., Iqbal, S., Bennett, P. N., Inkpen, K., Teevan, J., Kikin-Gil, R., & Horvitz, E. (2019). Guidelines for human-AI interaction. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (pp. 1-13). Association for Computing Machinery. https://doi.org/10.1145/3290605.3300233

Burge, J. E. (2008). Design rationale: Researching under uncertainty. Artificial Intelligence for Engineering Design, Analysis and Manufacturing, 22(4), 311-324. https://doi.org/10.1017/S0890060408000235

Deka, B., Huang, Z., Franzen, C., Hibschman, J., Afergan, D., Li, Y., Nichols, J., & Kumar, R. (2017). Rico: A mobile app dataset for building data-driven design applications. In Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology (pp. 845-854). Association for Computing Machinery. https://doi.org/10.1145/3126594.3126651

Duan, P., Warner, J., Li, Y., & Hartmann, B. (2024). Generating automatic feedback on UI mockups with large language models. In Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery. https://doi.org/10.1145/3613904.3642782

Gupta, K., Lazarow, J., Achille, A., Davis, L. S., Mahadevan, V., & Shrivastava, A. (2021). LayoutTransformer: Layout generation and completion with self-attention. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 1004-1014).

Inoue, N., Kikuchi, K., Simo-Serra, E., Otani, M., & Yamaguchi, K. (2023). LayoutDM: Discrete diffusion model for controllable layout generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 10167-10176).

Inoue, N., Masui, K., Shimoda, W., & Yamaguchi, K. (2024). OpenCOLE: Towards reproducible automatic graphic design generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (pp. 8131-8135).

Jason Kuhn, Yushan Chen, & Evelyn Chan. (2024). AI-Driven Mobile UI Pattern Recognition and Design Topic Mining on RICO: Semantic Clustering and Screenshot-Based Topic Classification. Journal of Advanced Computing Systems , 4(5), 67-83. https://doi.org/10.69987/JACS.2024.40506

Jia, P., Li, C., Yuan, Y., Liu, Z., Shen, Y., Chen, B., Chen, X., Zheng, Y., Chen, D., Li, J., Xie, X., Zhang, S., & Guo, B. (2024). COLE: A hierarchical generation framework for multi-layered and editable graphic design. arXiv. https://arxiv.org/abs/2311.16974

Kikuchi, K., Simo-Serra, E., Otani, M., & Yamaguchi, K. (2021). Constrained graphic layout generation via latent optimization. In Proceedings of the 29th ACM International Conference on Multimedia (pp. 88-96). Association for Computing Machinery. https://doi.org/10.1145/3474085.3475497

Kong, X., Jiang, L., Chang, H., Zhang, H., Hao, Y., Gong, H., & Essa, I. (2022). BLT: Bidirectional layout transformer for controllable layout generation. In European Conference on Computer Vision (pp. 474-490). Springer. https://doi.org/10.1007/978-3-031-19790-1_29

Kunz, W., & Rittel, H. W. J. (1970). Issues as elements of information systems (Working Paper No. 131). Center for Planning and Development Research, University of California, Berkeley.

Lee, J. (1991). What's in design rationale? Human-Computer Interaction, 6(3-4), 251-280. https://doi.org/10.1207/s15327051hci0603&4_3

Li, J., Yang, J., Hertzmann, A., Zhang, J., & Xu, T. (2019). LayoutGAN: Generating graphic layouts with wireframe discriminators. In International Conference on Learning Representations.

Liao, Q. V., Gruen, D., & Miller, S. (2020). Questioning the AI: Informing design practices for explainable AI user experiences. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (pp. 1-15). Association for Computing Machinery. https://doi.org/10.1145/3313831.3376590

Liao, Q. V., Pribic, M., Han, J., Miller, S., & Sow, D. (2021). Question-driven design process for explainable AI user experiences. arXiv. https://arxiv.org/abs/2104.03483

Moran, T. P., & Carroll, J. M. (Eds.). (1996). Design rationale: Concepts, techniques, and use. Lawrence Erlbaum Associates.

Nielsen, J. (1994). Enhancing the explanatory power of usability heuristics. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (pp. 152-158). Association for Computing Machinery. https://doi.org/10.1145/191666.191729

O'Donovan, P., Agarwala, A., & Hertzmann, A. (2015). DesignScape: Design with interactive layout suggestions. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (pp. 1221-1224). Association for Computing Machinery. https://doi.org/10.1145/2702123.2702149

Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). "Why should I trust you?" Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1135-1144). Association for Computing Machinery. https://doi.org/10.1145/2939672.2939778

Shneiderman, B. (2020). Human-centered artificial intelligence: Reliable, safe, and trustworthy. International Journal of Human-Computer Interaction, 36(6), 495-504. https://doi.org/10.1080/10447318.2020.1741118

Wang, D., Yang, Q., Abdul, A., & Lim, B. Y. (2019). Designing theory-driven user-centric explainable AI. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (pp. 1-15). Association for Computing Machinery. https://doi.org/10.1145/3290605.3300831

Yamaguchi, K. (2021). CanvasVAE: Learning to generate vector graphic documents. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 5481-5489).

Yushan Chen, & Evelyn Chan. (2023). Multimodal UI Representation Learning: Ablation of Screenshot, Wireframe, and View-Hierarchy Proxies on an Uploaded 168-Screen Dataset. Journal of Advanced Computing Systems , 3(1), 1-15. https://doi.org/10.69987/JACS.2023.30101

Zhang, H. (2025). LLM-Driven CI Failure Diagnosis and Automated Repair: From GitHub Actions Logs to Patch Recommendation. Journal of Technology Informatics and Engineering, 4(1), 190-214. https://doi.org/10.51903/jtie.v4i1.484

Zheng, X., Qiao, X., Cao, Y., & Lau, R. W. H. (2019). Content-aware generative modeling of graphic design layouts. ACM Transactions on Graphics, 38(4), Article 133. https://doi.org/10.1145/3306346.3322971

Text-Grounded LLM-Assisted Design Rationale Interfaces: Turning Advertising Layout Metadata into Explainable UI/UX Decision Cards

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

Most read articles by the same author(s)

menunew