Keywords:
Multimodal Large Language Models, Retrieval-Augmented Generation, Natural Language Processing, Computer Vision, Prompt Engineering, Vision-Language Models, AI for Education
Integration of Multimodal Large Language Models (M-LLMs) into Retrieval-Augmented Generation (RAG) systems for technical knowledge management in industrial settings. Multimodal evaluation frameworks for sequential visual content. AI-driven virtual tutoring for programming education.