Collaborative inference at the network edge enables low-latency and privacy-sensitive artificial intelligence (AI) services without relying on remote cloud infrastructure. Distributing increasingly complex neural network models across nearby edge devices enables the pooling of their compute and memory resources to perform inference beyond the capability of any single device. Existing collaborative inference approaches rely on centralized or static control under assumptions of stable network connectivity, leading to inefficient resource use and degraded inference performance when network or system conditions change. This project reconceptualizes the wireless network not merely as a communication medium, but as a coordination substrate to orchestrate model execution and resource allocation for efficient, scalable, and resilient collaborative inference. The resulting advances will support emerging applications such as mobile health, distributed robotics, and intelligent transportation systems. The project also integrates research into teaching through new courses with hands-on learning modules on edge intelligence and distributed AI. It further strengthens an existing mentoring pipeline spanning K-12 outreach, undergraduate research participation, and graduate training, preparing students for future careers across AI, systems, and networking. The project advances the scientific foundations of network-aware collaborative inference at the intersection of networking, distributed