Efficient Storage and Retrieval Models for Large-Scale Unstructured Data Analytics

Authors

DOI:

https://doi.org/10.62802/pdw29m66

Keywords:

unstructured data analytics, data storage models, information retrieval, big data systems, distributed databases, semantic indexing

Abstract

The exponential growth of digital information has led to an unprecedented surge in unstructured data originating from sources such as social media, multimedia platforms, sensor networks, and enterprise systems. Traditional relational databases and structured storage frameworks are increasingly inadequate for handling the scale, heterogeneity, and velocity of such data. This paper examines efficient storage and retrieval models for large-scale unstructured data analytics, focusing on distributed architectures, indexing strategies, and intelligent retrieval mechanisms. By synthesizing advances in cloud storage systems, NoSQL databases, vector search techniques, and machine learning–assisted data organization, the study evaluates how modern data infrastructures can optimize performance, scalability, and accessibility. The findings highlight the importance of hybrid storage paradigms and semantic retrieval frameworks in enabling rapid, accurate analysis of massive unstructured datasets, thereby supporting data-driven decision-making across industries.

References

Ahmad, H., & Sarwar, M. A. (2025). ILTAF, Waheed Zaman Khan. Unified Intelligence: A Comprehensive Review of the Synergy Between Data Science. Artificial Intelligence, and Machine Learning in the Age of Big Data. Sch J Eng Tech, 8, 585-617.

Cheikh, I., Roy, S., Sabir, E., & Aouami, R. (2026). Energy, scalability, data and security in massive IoT: Current landscape and future directions. IEEE Internet of Things Journal.

Dritsas, E., & Trigka, M. (2025). A Survey on Database Systems in the Big Data Era: Architectures, Performance, and Open Challenges. IEEE Access.

Ghali, M. K., Farrag, A., Won, D., & Jin, Y. (2025). Enhancing knowledge retrieval with in-context learning and semantic search through generative AI. Knowledge-Based Systems, 311, 113047.

Khemka, A., & Raj, G. (2025). Unstructured Data Ingestion: Best Practices for Acquiring, Storing, and Processing Data from 200+ External Sources.

Koukaras, P. (2025). Data Integration and Storage Strategies in Heterogeneous Analytical Systems: Architectures, Methods, and Interoperability Challenges. Information, 16(11).

Salman, M. (2025). Towards Knowledge Graph Construction From Unstructured Text with LLMs, Triple Identification and Alignment to Wikidata.

Schoder, D. (2025). Introduction to the Internet of Things. Internet of things A to Z: technologies and applications, 1-40.

Shermy, R. P., & Saranya, N. (2025). Cloud‐Based Big Data Architecture and Infrastructure. Resilient Community Microgrids, 131-188.

Vahdat, A., Badard, T., & Pouliot, J. (2025). A Semantic Collaborative Filtering-Based Recommendation System to Enhance Geospatial Data Discovery in Geoportals. ISPRS International Journal of Geo-Information, 14(12), 495.

Yuan, Q., & Lai, Y. (2025). Towards Efficient Information Retrieval in Internet of Things Environments Via Machine Learning Approaches. Journal of The Institution of Engineers (India): Series B, 106(1), 363-386.

frontpage

Downloads

Published

2026-02-09