SCALABLE AI SEARCH FOR E-COMMERCE: HYBRID RAG ARCHITECTURE AND VECTOR DATABASES

Authors

DOI:

https://doi.org/10.30888/2663-5712.2025-34-03-010

Keywords:

artificial intelligence, hybrid retrieval, retrieval-augmented generation (RAG), vector databases, e-commerce search, embeddings, approximate nearest neighbor (ANN), multi-tenancy, FinOps optimization, latency management, scalable infrastructure

Abstract

The rapid expansion of multi-tenant e-commerce ecosystems with millions of SKUs demands a new generation of scalable, latency-aware, and cost-efficient AI search architectures. Maintaining stable interactive performance (p95/p99 latency) under fluctuating

References

Ramachandran, Anand. (2025). Advancing Retrieval-Augmented Generation (RAG) Innovations, Challenges, and the Future of AI Reasoning.

Zou, Jiaru & Fu, Dongqi & Chen, Sirui & He, Xinrui & Li, Zihao & Zhu, Yada & Han, Jiawei & He, Jingrui. (2025). GTR: Graph-Table-RAG for Cross-Table Question Answering. 10.48550/arXiv.2504.01346.

Youvan, Douglas. (2025). Retrieval-Augmented Generation (RAG): Advancing AI with Dynamic Knowledge Integration. 10.13140/RG.2.2.30888.89606.

Meduri, Karthik & Nadella, Geeta & Gonaygunta, Hari & Maturi, Mohan & Fatima, Farheen. (2024). Efficient RAG Framework for Large-Scale Knowledge Bases.

Joshi, Satyadhar. (2025). Introduction to Vector Databases for Generative AI: Applications, Performance, Future Projections, and Cost Considerations. IARJSET. 12. 79-93. 10.17148/IARJSET.2025.12210.

Gupta, Shailja & Ranjan, Rajesh & Singh, Surya. (2024). A Comprehensive Survey of Retrieval-Augmented Generation (RAG): Evolution, Current Landscape and Future Directions. 10.48550/arXiv.2410.12837.

Wehnert, Sabine & Chovatta, Bhavya & De Luca, Ernesto. (2025). LLMs, Knowledge Graphs, and Hybrid Search: Task-Specific Approaches to Legal AI in COLIEE.

Yang, Qimin & Zuo, Huan & Su, Runqi & Su, Hanyinghong & Zeng, Tangyi & Zhou, Huimei & Wang, Rongsheng & Chen, Jiexin & Lin, Yijun & Chen, Zhiyi & Tan, Tao. (2025). Dual retrieving and ranking medical large language model with retrieval augmented generation. Scientific Reports. 15. 10.1038/s41598-025-00724-w.

Athamakuri, Swamy & Thiruveedula, Jagadeesh. (2025). Microservices Architecture in E-commerce: A Comparative Analysis of Performance, Scalability, and Maintainability. International Journal for Research Publication and Seminar. 16. 110-124. 10.36676/jrps.v16.i2.56.

Vu, Dinh-Dai & Trần, Minh-Ngọc & Kim, Younghan. (2022). Predictive Hybrid Autoscaling for Containerized Applications. IEEE Access. 10. 1-1. 10.1109/ACCESS.2022.3214985.

Rowley, Jennifer. (2000). Product search in e-shopping: A review and research propositions. Journal of Consumer Marketing. 17. 10.1108/07363760010309528.

Taipalus, Toni. (2024). Vector database management systems: Fundamental concepts, use-cases, and current challenges. Cognitive Systems Research. 85. 101216. 10.1016/j.cogsys.2024.101216.

Wang, Mengzhao & Xu, Xiaoliang & Yue, Qiang & Wang, Yuxiang. (2021). A comprehensive survey and experimental comparison of graph-based approximate nearest neighbor search. PVLDB. 14. 1964-1978. 10.14778/3476249.3476255.

Deshmukh, Gaurav. (2023). Vector Databases.

Published

2025-11-30

How to Cite

Цимбал, А. (2025). SCALABLE AI SEARCH FOR E-COMMERCE: HYBRID RAG ARCHITECTURE AND VECTOR DATABASES. SWorldJournal, 3(34-03), 3–15. https://doi.org/10.30888/2663-5712.2025-34-03-010

Issue

Section

Articles