Paper
4 May 2022 IPFS keyword search based on double-layer index
Ling Cao, Yue Li
Author Affiliations +
Proceedings Volume 12172, International Conference on Electronic Information Engineering and Computer Communication (EIECC 2021); 1217209 (2022) https://doi.org/10.1117/12.2634670
Event: International Conference on Electronic Information Engineering and Computer Communication (EIECC 2021), 2021, Nanchang, China
Abstract
Distributed storage plays an increasingly important role in the context of big data. interplanetary file system (IPFS) is a distributed file system, which can form a network of all heterogeneous devices in the same way. Different from traditional HTTP protocol based on physical location, IPFS distributed network is based on content addressing and obtains files through file hash. However, this precise file search method cannot obtain files without file content hash which greatly reduces file utilization and liquidity. Therefore, this paper proposes a two-layer index scheme. After receiving the uploaded file, the node parses the file and establishes the index. The nodes are replicated using a CRDT data structure based on optimistic replication for indexing operations. IPFS pub-sub is used as the CRDT message delivery method between nodes. The first-layer index is the inverted index file corresponding to each keyword. The second-layer index is the CID of the inverted index file for each keyword. Each node maintains full index rather than through a distributed hash table stores dispersion index can ensure complete data search, at the same time greatly reduce search response time. Inverted index files are stored in IPFS network to reduce storage space and facilitate state-based replication of newly added nodes or nodes that have been offline for a long time. Finally, through the analysis of experimental data, it is proved that the scheme can greatly reduce the search response time while occupying acceptable storage space.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Ling Cao and Yue Li "IPFS keyword search based on double-layer index", Proc. SPIE 12172, International Conference on Electronic Information Engineering and Computer Communication (EIECC 2021), 1217209 (4 May 2022); https://doi.org/10.1117/12.2634670
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data storage

Distributed computing

Networks

Associative arrays

Internet

Software engineering

Back to Top