# Keynotes ## LLMs Meet Computer Vision - Multimodal AI - Vision-Language Models ## Token Economics - Image Processing Costs - Cost Optimization ## Use Cases - Document Analysis - Video Understanding - IoT and Robotics

Keynotes

Technical keynotes and presentations by Dr. Farshid Pirahansiah.


LLMs Meet Computer Vision

Exploring the convergence of Large Language Models and Computer Vision. Covers token economics, multimodal AI, RAG systems, and practical applications in document analysis, video understanding, and IoT/robotics.