Link Prediction in Agent-based Graph Database System

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

This work presents a scalable and interpretable link prediction framework embedded nativelywithin the Multi-Agent Spatial Simulation (MASS) library. By extending MASS’s distributed graph infrastructure and property-aware computation model, we implement both classical topological heuristics and embedding-based approaches—most notably Fast Random Projection (FastRP) combined with k-Nearest Neighbors (kNN)—to infer potential connections in graph-structured data. Topological algorithms such as Adamic-Adar and Resource Allocation, implementedas distributed primitives, demonstrate parity with Neo4j in accuracy and outperform it in execution time on large-scale query workloads. FastRP embeddings are generated via an agent-driven propagation pipeline that mirrors adjacency-based diffusion, enabling fullgraph vector generation in distributed environments. Though the current FastRP + kNN pipeline in MASS exhibits higher latency due to agent overhead and synchronization, it achieves competitive recall, especially at higher K values, validating its utility for applications that prioritize coverage over ranking precision. Experimental results on the Cora citation network show that MASS supports interactiveand batch link prediction tasks at scale, offering a memory-local alternative to centralized systems like Neo4j. This project transforms MASS from a simulation-only platform into a programmable, graph-native AI engine—capable of powering graph reasoning tasks for knowledge graphs, recommendations, and retrieval-augmented generation.

Description

Thesis (Master's)--University of Washington, 2025

Citation

DOI