Visual Brief Cards for Advertising Design: A Structured UI/UX Framework for Turning Creative Intentions into Graphic Design Decisions

Haowei  Tu; Siming Zhao; Andrew Zhou

doi:10.51903/ijgd.v3i1.3714

Authors

Haowei Tu Information Systems, New York University, NY, USA
Siming Zhao Business Analytics, Columbia University, NY, USA
Andrew Zhou Human-Computer Interaction, CMU, PA, USA

DOI:

https://doi.org/10.51903/ijgd.v3i1.3714

Keywords:

visual brief cards, advertising design, UI/UX framework, graphic design generation, design risk, typography

Abstract

Automatic graphic design systems increasingly transform short creative intentions into visual assets, yet designers still need intermediate decisions that are easy to inspect, compare, and revise. This paper proposes Visual Brief Cards, a structured UI/UX framework that converts a design intention into a compact card containing headline, sub-heading, visual object, background mood, keywords, call to action, brand tone, design risk, and typography guidance. In response to the need for a stronger empirical basis, the revised evaluation uses OpenCOLE as the primary benchmark. All OpenCOLE splits were loaded, and the main quantitative comparison is reported on the held-out test split of 2,375 rows; GraphicBench test data and DEsignBench-Prompts are used as secondary checks. Three brief formats are compared under the same deterministic implementation: a free-form brief, a conventional JSON brief, and the proposed Visual Brief Card. On OpenCOLE test data, the card achieved the highest field reconstruction F1 (0.259), complete field coverage (1.000), measurable design-risk recovery (0.302), and the lowest computational scan-time proxy (1.061 s). Free-form text retained the highest TF-IDF semantic similarity (0.252) because it preserved the source wording with less compression. These results support a narrower claim: labeled card structure improves the visibility and recoverability of intermediate design decisions, while human-subject work is still required before making claims about designer trust, workload, or usability in practice.

References

Amershi, S., Weld, D., Vorrell, M., Ringel Morris, M., Fourney, A., Nushi, B., Collisson, P., Suh, J., Iqbal, S., Bennett, P. N., Inkpen, K., Teevan, J., Kikin-Gil, R., & Horvitz, E. (2019). Guidelines for human-AI interaction. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (pp. 1-13). Association for Computing Machinery. https://doi.org/10.1145/3290605.3300233

Brooke, J. (1996). SUS: A quick and dirty usability scale. In P. W. Jordan, B. Thomas, B. A. Weerdmeester, & I. L. McClelland (Eds.), Usability evaluation in industry (pp. 189-194). Taylor & Francis.

Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D. M., Wu, J., Winter, C., ... Amodei, D. (2020). Language models are few-shot learners. In Advances in Neural Information Processing Systems, 33, 1877-1901.

Buchanan, R. (1992). Wicked problems in design thinking. Design Issues, 8(2), 5-21. https://doi.org/10.2307/1511637

Card, S. K., Moran, T. P., & Newell, A. (1983). The psychology of human-computer interaction. Lawrence Erlbaum Associates.

Creative Graphic Design Lab. (2024). DEsignBench-Prompts [Data set]. Hugging Face. https://huggingface.co/datasets/creative-graphic-design/DEsignBench-Prompts

Hart, S. G., & Staveland, L. E. (1988). Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. In P. A. Hancock & N. Meshkati (Eds.), Human mental workload (pp. 139-183). North-Holland.

Horvitz, E. (1999). Principles of mixed-initiative user interfaces. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (pp. 159-166). Association for Computing Machinery. https://doi.org/10.1145/302979.303030

Inoue, N., Masui, K., Shimoda, W., & Yamaguchi, K. (2024). OpenCOLE: Towards reproducible automatic graphic design generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.

Jason Kuhn, Yushan Chen, & Evelyn Chan. (2024). AI-Driven Mobile UI Pattern Recognition and Design Topic Mining on RICO: Semantic Clustering and Screenshot-Based Topic Classification. Journal of Advanced Computing Systems , 4(5), 67-83. https://doi.org/10.69987/JACS.2024.40506

Jia, P., Li, C., Yuan, Y., Liu, Z., Shen, Y., Chen, B., Chen, X., Zheng, Y., Chen, D., Li, J., Xie, X., Zhang, S., & Guo, B. (2023). COLE: A hierarchical generation framework for multi-layered and editable graphic design. arXiv. https://arxiv.org/abs/2311.16974

Ki, D., Zhou, T., Carpuat, M., Wu, G., Mathur, P., & Swaminathan, V. (2025). GraphicBench: A planning benchmark for graphic design with language agents. Preprint.

Kikuchi, K., Simo-Serra, E., Otani, M., & Yamaguchi, K. (2021). Constrained graphic layout generation via latent optimization. In Proceedings of the 29th ACM International Conference on Multimedia (pp. 88-96). Association for Computing Machinery. https://doi.org/10.1145/3474085.3475497

Li, F., Liu, A., Feng, W., Zhu, H., Li, Y., Zhang, Z., Lv, J., Zhu, X., Shen, J., & Lin, Z. (2023). Relation-aware diffusion model for controllable poster layout generation. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (pp. 1249-1258). Association for Computing Machinery. https://doi.org/10.1145/3583780.3615028

Nielsen, J. (1994). Enhancing the explanatory power of usability heuristics. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (pp. 152-158). Association for Computing Machinery. https://doi.org/10.1145/191666.191729

Norman, D. A. (2013). The design of everyday things: Revised and expanded edition. Basic Books.

OpenAI. (2023). GPT-4 technical report. arXiv. https://arxiv.org/abs/2303.08774

Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., Krueger, J., & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In Proceedings of the 38th International Conference on Machine Learning (pp. 8748-8763). PMLR.

Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022). High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 10684-10695).

Shneiderman, B. (2022). Human-centered AI. Oxford University Press.

Sweller, J. (1988). Cognitive load during problem solving: Effects on learning. Cognitive Science, 12(2), 257-285. https://doi.org/10.1207/s15516709cog1202_4

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. In Advances in Neural Information Processing Systems, 30.

Yamaguchi, K. (2021). CanvasVAE: Learning to generate vector graphic documents. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 5481-5489).

Yushan Chen, & Evelyn Chan. (2023). Multimodal UI Representation Learning: Ablation of Screenshot, Wireframe, and View-Hierarchy Proxies on an Uploaded 168-Screen Dataset. Journal of Advanced Computing Systems , 3(1), 1-15. https://doi.org/10.69987/JACS.2023.30101

Visual Brief Cards for Advertising Design: A Structured UI/UX Framework for Turning Creative Intentions into Graphic Design Decisions

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

menunew