Efficient Deployment of AI Models on Resource-Constrained Devices: A Novel Multi-Tenant Cascade Scheduler

Sunday 23 February 2025


In recent years, artificial intelligence has made tremendous strides in processing vast amounts of data and performing complex tasks. However, one major challenge that still persists is how to efficiently deploy these AI models on resource-constrained devices such as smartphones or smart home devices. This is because traditional deep learning architectures are designed for powerful computers with ample memory and processing power, making them unsuitable for low-end devices.


To address this issue, researchers have been exploring new techniques for deploying AI models on edge devices. One promising approach is to use a type of architecture called a multi-tenant cascade (MTC), which allows multiple AI models to share resources and optimize performance on resource-constrained devices.


In a recent study, scientists have developed a novel MTC scheduler that can dynamically adapt to changing system conditions and optimize performance for various device tiers. The scheduler, dubbed MultiTASC++, is designed to work with heterogeneous devices, including smartphones, smart home devices, and other edge devices.


The key innovation of MultiTASC++ lies in its ability to continuously monitor the performance of multiple AI models and adjust the scheduling strategy accordingly. This allows the system to dynamically allocate resources between different models, ensuring that each model receives the necessary processing power and memory allocation to perform optimally.


To evaluate the effectiveness of MultiTASC++, researchers conducted a series of experiments using various AI models, including image classification and object detection tasks. The results showed that MultiTASC++ consistently outperformed traditional scheduling strategies in terms of system throughput, accuracy, and satisfaction rate.


One significant advantage of MultiTASC++ is its ability to adapt to changing device conditions, such as changes in device availability or network connectivity. This allows the system to dynamically adjust the scheduling strategy to ensure optimal performance even in the face of uncertainty.


The potential applications of MultiTASC++ are vast. For instance, it could be used to optimize the deployment of AI models on smart home devices, enabling them to perform complex tasks such as image recognition and natural language processing with greater efficiency. Similarly, it could be used to improve the performance of edge devices in industrial settings, such as autonomous vehicles or robots.


As AI continues to evolve and become increasingly ubiquitous in our daily lives, the need for efficient and adaptive deployment strategies will only continue to grow. MultiTASC++ represents a significant step forward in this direction, offering a powerful tool for optimizing the performance of AI models on resource-constrained devices.


Cite this article: “Efficient Deployment of AI Models on Resource-Constrained Devices: A Novel Multi-Tenant Cascade Scheduler”, The Science Archive, 2025.


Artificial Intelligence, Edge Computing, Multi-Tenant Cascade, Scheduling, Optimization, Resource-Constrained Devices, Smartphones, Smart Home Devices, Deep Learning, Heterogeneous Devices.


Reference: Sokratis Nikolaidis, Stylianos I. Venieris, Iakovos S. Venieris, “MultiTASC++: A Continuously Adaptive Scheduler for Edge-Based Multi-Device Cascade Inference” (2024).


Leave a Reply