Abstract
The growth of intelligent vehicular services, like augmented reality (AR) road simulation, underscores the need for rapid, multi-modal content generation. Generative artificial intelligence (GAI) models, known for their swift production of diverse artificial intelligence-generated content (AIGC), stand out as a prime solution. However, integrating cloud-centric GAI models into vehicular networks is fraught with challenges. Notably, to offer specialized generative edge intelligence (EI) and boost vehicular AIGC, GAI models need to tap into user data and utilize significant computation resources. Moreover, their deployment across vehicular networks is essential for proximity-based distributed inferences. Yet, edge devices are resource-limited, and data sharing can raise safety and privacy concerns. Addressing these challenges, this paper introduces GAI-IoV, an EI-enabled GAI framework facilitated through the cooperation between road-side units (RSUs) and vehicles. Subsequently, we propose the workflow for collaborative fine-tuning and distributed inference. On this basis, two pivotal vehicle-centric problems are then formulated: computation and communication resource allocation for federated fine-tuning (FFT) to optimize time and energy cost, and splitting strategy of shared and local inferences to optimize inference latency and content-generation capability. To solve these optimizations, we introduce a self-adaptive global best harmony search (SGHS) algorithm for resource allocation and a backward induction method for determining inference splitting strategy. Our experiments based on the Stable Diffusion v1-4 model vouch for a superior fine-tuning and inference capabilities of GAI-IoV. Furthermore, simulations underscore its resource utilization and distributed inference efficiency in dynamic vehicular scenarios.
Original language | English (US) |
---|---|
Pages (from-to) | 1 |
Number of pages | 1 |
Journal | IEEE Transactions on Wireless Communications |
DOIs | |
State | Accepted/In press - 2024 |
Externally published | Yes |
All Science Journal Classification (ASJC) codes
- Computer Science Applications
- Electrical and Electronic Engineering
- Applied Mathematics
Keywords
- collaborative inference
- Computational modeling
- Data privacy
- edge intelligence (EI)
- fine-tuning
- Generative artificial intelligence (GAI)
- Inference algorithms
- resource allocation
- Resource management
- Systems architecture
- Task analysis
- Training
- vehicular networks