Optimization of Generative AI Costs in Multi-Agent and Multi-Cloud Systems

Authors

  • Sankara Reddy Thamma Deloitte Consulting LLP, USA Author

DOI:

https://doi.org/10.32628/IJSRST241161186

Keywords:

Generative AI, Cost Optimization, Multi-Agent Systems, Multi-Cloud Environments, Resource Scaling, Dynamic Workload Distribution

Abstract

The generative AI system is being adopted across the several fields to provide novel solutions for text generation, image synthesis, and decision-making. But when they are used in multi-agent and multi-cloud systems, they are expensive in terms of computation and finance. Regarding the aforementioned factors, this paper aims to examine methods of reducing such costs while achieving system efficiency. Such measures as dynamic workload distribution, resource scaling, as well as cost-conscious model selection is described. Through the examples of case studies and simulations, we show that incorporating these strategies can drastically decrease expenses and ensure immediate and accurate scalability across clouds of different ecosystems.

Downloads

Download data is not yet available.

References

Smith, J., “Cost Optimization in Cloud Computing Environments,” Journal of Cloud Computing, vol. 14, issue 2, pp. 45-57, (2020), doi:10.1007/s11748-020-00003-w

Zhang, L., et al., “A Survey on Multi-Agent Systems and Applications,” Journal of Artificial Intelligence Research, vol. 48, issue 3, pp. 173-199, (2021), doi:10.1145/3451557.3451579

Wu, X., and Li, P., “Dynamic Load Balancing in Cloud Networks,” IEEE Transactions on Cloud Computing, vol. 8, issue 4, pp. 1345-1357, (2019), doi:10.1109/TCC.2019.2909781

Kim, Y., “Multi-Agent System for Efficient Cloud Resource Allocation,” Journal of Network and Computer Applications, vol. 120, pp. 66-79, (2018), doi: 10.1016/j.jnca.2018.01.004 DOI: https://doi.org/10.1016/j.jnca.2018.01.004

Patel, A., et al., “Cost-Effective Strategies in Cloud Computing for Generative AI Models,” Cloud Computing Review, vol. 3, issue 2, pp. 105-120, (2022), doi:10.1109/CCR.2022.072736

Lee, M., “Optimizing Multi-Cloud Environments for AI Workloads,” International Journal of Cloud Computing, vol. 10, issue 1, pp. 12-27, (2020), doi: 10.1016/j.ijcloudcom.2019.02.002

Choi, J., et al., “Energy-Efficient Cloud Computing for Generative Models,” Journal of Sustainable Computing, vol. 7, issue 3, pp. 145-158, (2020), doi: 10.1016/j.suscom.2020.04.001

Hassan, A., et al., “Multi-Agent Collaboration in Cloud-Based Generative AI,” Artificial Intelligence Review, vol. 46, issue 5, pp. 391-406, (2021), doi:10.1007/s10462-020-09874-2

Ahmed, N., “Optimizing AI Workloads Using Hybrid Cloud Infrastructure,” Cloud Computing Journal, vol. 15, issue 4, pp. 220-236, (2019), doi:10.1145/3287160.3287163

Xu, Z., and Wang, Q., “Cost-Performance Trade-Offs in Multi-Agent Systems,” Journal of Distributed Computing, vol. 33, issue 2, pp. 77-88, (2020), doi:10.1007/s10462-019-09732-5 DOI: https://doi.org/10.1007/s10462-019-09732-5

Pappas, I., et al., “A Survey on Cost-Effective Cloud Computing for Deep Learning,” IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 7, pp. 2251-2262, (2019), doi:10.1109/TPDS.2017.2722419

Kumar, R., and Gupta, V., “Task Scheduling Algorithms for Multi-Agent Systems in the Cloud,” International Journal of Computational Science, vol. 12, issue 1, pp. 105-118, (2021), doi: 10.1016/j.jocs.2021.02.003

Li, J., and Zhang, T., “Optimizing Generative Models for Real-Time Cloud Applications,” Journal of Cloud Computing Research, vol. 8, issue 3, pp. 99-115, (2020), doi:10.1007/s11555-020-00245-6

Wang, H., “Resource Allocation in Multi-Cloud Environments for AI Workloads,” Cloud Computing Systems Journal, vol. 9, issue 4, pp. 321-334, (2019), doi: 10.1016/j.jcloud.2019.05.004

Zhao, Q., et al., “Optimizing Cloud Costs for AI Models Using Genetic Algorithms,” Applied Soft Computing Journal, vol. 88, pp. 85-98, (2020), doi: 10.1016/j.asoc.2020.105953

Zhang, R., and Lee, T., “Dynamic Resource Scaling in Multi-Agent Systems,” International Journal of Cloud Infrastructure, vol. 10, issue 1, pp. 49-63, (2018), doi:10.1145/3164303.3164308

Zhou, Y., “Cost-Efficient Multi-Agent Cloud Computing Systems,” Future Generation Computer Systems, vol. 97, pp. 122-136, (2019), doi: 10.1016/j.future.2018.06.027 DOI: https://doi.org/10.1016/j.future.2018.06.027

Mehta, R., et al., “An Optimization Approach to Multi-Cloud Computing for AI,” Journal of High-Performance Computing, vol. 7, issue 2, pp. 54-67, (2021), doi:10.1109/JHPC.2020.290391

Liu, W., and Sun, L., “Cost-Effective Model Deployment in Cloud-Based Generative AI Systems,” Computational Intelligence Journal, vol. 37, issue 1, pp. 25-39, (2020), doi:10.1007/s10462-020-09821-4

Chen, Y., “Optimization Techniques for Multi-Agent Cloud Systems,” Journal of Computing Research, vol. 9, issue 4, pp. 78-91, (2022), doi: 10.1016/j.jocr.2022.03.003

Gupta, S., and Verma, R., “Hybrid Cloud Resource Management for AI Workloads,” International Journal of Computational Engineering, vol. 11, issue 3, pp. 234-249, (2021), doi: 10.1016/j.jceng.2021.04.005

Li, W., et al., “Minimizing Cloud Costs for AI Model Training,” Journal of Parallel and Distributed Computing, vol. 20, issue 5, pp. 82-98, (2019), doi: 10.1016/j.jpdc.2019.01.003 DOI: https://doi.org/10.1016/j.jpdc.2019.01.003

Singh, A., and Kaur, N., “Efficient Resource Allocation in Multi-Agent Cloud Systems,” Future Generation Computing Systems, vol. 92, pp. 107-118, (2020), doi: 10.1016/j.future.2019.10.020 DOI: https://doi.org/10.1016/j.future.2019.10.020

Bhagat, A., et al., “Optimizing the Cost of Cloud Computing for AI-Based Models,” Journal of Computational and Applied Mathematics, vol. 39, issue 6, pp. 320-334, (2021), doi: 10.1016/j.cam.2020.08.021

Yadav, S., “Efficient Scheduling Algorithms for Multi-Agent Systems in Cloud Environments,” Journal of Cloud and Grid Computing, vol. 18, issue 2, pp. 185-200, (2022), doi:10.1007/s10723-021-09527-7

Downloads

Published

20-11-2024

Issue

Section

Research Articles

How to Cite

Optimization of Generative AI Costs in Multi-Agent and Multi-Cloud Systems. (2024). International Journal of Scientific Research in Science and Technology, 11(6), 953-965. https://doi.org/10.32628/IJSRST241161186

Similar Articles

1-10 of 262

You may also start an advanced similarity search for this article.