Network Digital Twins for High-Performance Computing

This talk outlines a holistic framework for network digital twins (NDTs) in high-performance computing, covering construction, optimization, and application. The approach integrates federated learning for privacy-preserving model updates and reinforcement learning for closed-loop control, enabling real-time adaptation to traffic and system dynamics. Topics Covered HPC network digital twin construction: topology and telemetry ingestion, calibration Distributed and regional twin orchestration Federated learning for privacy-preserving NDT updates Reinforcement learning for closed-loop HPC network control Case studies: workload forecasting, congestion-aware routing, anomaly detection Portability to edge and 6G-class environments Links Workshop Program

November 2025 · Z. Zhang

Digital Network Twins for Next-Generation Wireless: Creation, Optimization, and Challenges

Download Paper arXiv Abstract Digital network twins (DNTs), by representing a physical network using a virtual model, offer significant benefits such as streamlined network development, enhanced productivity, and cost reduction for next-generation (nextG) communication infrastructure. Existing works mainly describe the deployment of DNT technologies in various service sections. The full life cycle of DNTs for telecommunication has not yet been comprehensively studied, particularly in the aspects of fine-grained creation, real-time adaptation, resource-efficient deployment, and security protection. This article presents an in-depth overview of DNTs, exploring their concrete integration into networks and communication, covering the fundamental designs, the emergent applications, and critical challenges in multiple dimensions. We also include two detailed case studies to illustrate how DNTs can be applied in real-world scenarios such as wireless traffic forecasting and edge caching. Additionally, a forward-looking vision of the research opportunities in tackling the challenges of DNTs is provided, aiming to fully maximize the benefits of DNTs in nextG networks. ...

October 2025 · Z. Zhang, Z. Peng, H. Yu, M. Chen, Y. Liu

Digital Network Twins for Advanced Networks

Digital Network Twins (DNTs) are virtual representations of physical networks and offer promising solutions for managing complex network environments, especially when combined with machine learning. This talk discusses how DNTs address pressing challenges in advanced networks such as 6G, covering federated learning for privacy-preserving distributed model training, reinforcement learning for autonomous real-time decision-making, and practical applications including edge caching and secure vehicle networks. Topics Covered Network digital twin architecture and synchronization Federated learning for decentralized model training in DNTs Reinforcement learning for closed-loop network control Security: defending against poisoning attacks in distributed twin systems Applications: edge caching optimization, secure vehicular networks

January 2025 · Z. Zhang

Poisoning Attacks and Defenses to Federated Unlearning

Download Paper arXiv Abstract Federated learning allows multiple clients to collaboratively train a global model with the assistance of a server. However, its distributed nature makes it susceptible to poisoning attacks, where malicious clients can compromise the global model by sending harmful local model updates to the server. To unlearn an accurate global model from a poisoned one after identifying malicious clients, federated unlearning has been introduced. Yet, current research on federated unlearning has primarily concentrated on its effectiveness and efficiency, overlooking the security challenges it presents. In this work, we bridge the gap via proposing BadUnlearn, the first poisoning attacks targeting federated unlearning. In BadUnlearn, malicious clients send specifically designed local model updates to the server during the unlearning process, aiming to ensure that the resulting unlearned model remains poisoned. To mitigate these threats, we propose UnlearnGuard, a robust federated unlearning framework that is provably robust against both existing poisoning attacks and our BadUnlearn. The core concept of UnlearnGuard is for the server to estimate the clients’ local model updates during the unlearning process and employ a filtering strategy to verify the accuracy of these estimations. Theoretically, we prove that the model unlearned through UnlearnGuard closely resembles one obtained by train-from-scratch. Empirically, we show that BadUnlearn can effectively corrupt existing federated unlearning methods, while UnlearnGuard remains secure against poisoning attacks. ...

January 2025 · W. Wang, Q. Ma, Z. Zhang, Y. Liu, Z. Liu, M. Fang

Toward Byzantine-Robust Decentralized Federated Learning

Download Paper arXiv Abstract Federated learning (FL) enables multiple clients to collaboratively train machine learning models without revealing their private training data. In conventional FL, the system follows the server-assisted architecture (server-assisted FL), where the training process is coordinated by a central server. However, the server-assisted FL framework suffers from poor scalability due to a communication bottleneck at the server, and trust dependency issues. To address challenges, decentralized federated learning (DFL) architecture has been proposed to allow clients to train models collaboratively in a serverless and peer-to-peer manner. However, due to its fully decentralized nature, DFL is highly vulnerable to poisoning attacks, where malicious clients could manipulate the system by sending carefully-crafted local models to their neighboring clients. To date, only a limited number of Byzantine-robust DFL methods have been proposed, most of which are either communication-inefficient or remain vulnerable to advanced poisoning attacks. In this paper, we propose a new algorithm called BALANCE (Byzantine-robust averaging through local similarity in decentralization) to defend against poisoning attacks in DFL. In BALANCE, each client leverages its own local model as a similarity reference to determine if the received model is malicious or benign. We establish the theoretical convergence guarantee for BALANCE under poisoning attacks in both strongly convex and non-convex settings. Furthermore, the convergence rate of BALANCE under poisoning attacks matches those of the state-of-the-art counterparts in Byzantine-free settings. Extensive experiments also demonstrate that BALANCE outperforms existing DFL methods and effectively defends against poisoning attacks. ...

October 2024 · M. Fang, Z. Zhang, Hairi, P. Khanduri, J. Liu, S. Lu, Y. Liu, Z. Gong

Securing Distributed Network Digital Twin Systems Against Model Poisoning Attacks

Download Paper arXiv Abstract In the era of 5G and beyond, the increasing complexity of wireless networks necessitates innovative frameworks for efficient management and deployment. Digital twins (DTs), embodying real-time monitoring, predictive configurations, and enhanced decision-making capabilities, stand out as a promising solution in this context. Within a time-series data-driven framework that effectively maps wireless networks into digital counterparts, encapsulated by integrated vertical and horizontal twinning phases, this study investigates the security challenges in distributed network DT systems, which potentially undermine the reliability of subsequent network applications such as wireless traffic forecasting. Specifically, we consider a minimal-knowledge scenario for all attackers, in that they do not have access to network data and other specialized knowledge, yet can interact with previous iterations of server-level models. In this context, we spotlight a novel fake traffic injection attack designed to compromise a distributed network DT system for wireless traffic prediction. In response, we then propose a defense mechanism, termed global-local inconsistency detection (GLID), to counteract various model poisoning threats. GLID strategically removes abnormal model parameters that deviate beyond a particular percentile range, thereby fortifying the security of network twinning process. Through extensive experiments on real-world wireless traffic datasets, our experimental evaluations show that both our attack and defense strategies significantly outperform existing baselines, highlighting the importance of security measures in the design and implementation of DTs for 5G and beyond network systems. ...

July 2024 · Z. Zhang, M. Fang, M. Chen, G. Li, X. Lin, Y. Liu

Mapping Wireless Networks into Digital Reality through Joint Vertical and Horizontal Learning

Download Paper arXiv Abstract In recent years, the complexity of 5G and beyond wireless networks has escalated, prompting a need for innovative frameworks to facilitate flexible management and efficient deployment. The concept of digital twins (DTs) has emerged as a solution to enable real-time monitoring, predictive configurations, and decision-making processes. While existing works primarily focus on leveraging DTs to optimize wireless networks, a detailed mapping methodology for creating virtual representations of network infrastructure and properties is still lacking. In this context, we introduce VH-Twin, a novel time-series data-driven framework that effectively maps wireless networks into digital reality. VH-Twin distinguishes itself through complementary vertical twinning (V-twinning) and horizontal twinning (H-twinning) stages, followed by a periodic clustering mechanism used to virtualize network regions based on their distinct geological and wireless characteristics. Specifically, V-twinning exploits distributed learning techniques to initialize a global twin model collaboratively from virtualized network clusters. H-twinning, on the other hand, is implemented with an asynchronous mapping scheme that dynamically updates twin models in response to network or environmental changes. Leveraging real-world wireless traffic data within a cellular wireless network, comprehensive experiments are conducted to verify that VH-Twin can effectively construct, deploy, and maintain network DTs. Parametric analysis also offers insights into how to strike a balance between twinning efficiency and model accuracy at scale. ...

June 2024 · Z. Zhang, M. Chen, Z. Yang, Y. Liu

Poisoning Attacks on Federated Learning-based Wireless Traffic Prediction

Download Paper arXiv Abstract Federated Learning (FL) offers a distributed framework to train a global control model across multiple base stations without compromising the privacy of their local network data. This makes it ideal for applications like wireless traffic prediction (WTP), which plays a crucial role in optimizing network resources, enabling proactive traffic flow management, and enhancing the reliability of downstream communication-aided applications, such as IoT devices, autonomous vehicles, and industrial automation systems. Despite its promise, the security aspects of FL-based distributed wireless systems, particularly in regression-based WTP problems, remain inadequately investigated. In this paper, we introduce a novel fake traffic injection (FTI) attack, designed to undermine the FL-based WTP system by injecting fabricated traffic distributions with minimal knowledge. We further propose a defense mechanism, termed global-local inconsistency detection (GLID), which strategically removes abnormal model parameters that deviate beyond a specific percentile range estimated through statistical methods in each dimension. Extensive experimental evaluations, performed on real-world wireless traffic datasets, demonstrate that both our attack and defense strategies significantly outperform existing baselines. ...

June 2024 · Z. Zhang, M. Fang, J. Huang, Y. Liu

Research Assistant

Department of Computer Science, NC State University Duration: Summer 2024 – Present Research Focus: Network digital twin construction, synchronization, and security for HPC and wireless systems Federated learning robustness against poisoning and Byzantine attacks Reinforcement learning for closed-loop network control Collaboration with Oak Ridge National Laboratory (ExaDIGIT project, 2025 R&D 100 Award)

May 2024 · Z. Zhang

Communication Efficiency and Security for Multi-Agent Reinforcement Learning

Download Thesis (OhioLINK) Abstract This thesis investigates the dual challenges of communication efficiency and Byzantine robustness in decentralized multi-agent reinforcement learning. We develop algorithms that reduce inter-agent communication overhead while maintaining convergence guarantees even in the presence of adversarial (Byzantine) agents, bridging theoretical foundations with practical protocol design for large-scale distributed systems.

May 2023 · Z. Zhang