Federated Multi-Agent RL for Task Offloading in THz-Enabled Space–Terrestrial Integrated Networks
Federated Multi-Agent RL for Task Offloading in THz-Enabled Space–Terrestrial Integrated Networks
张静静
11th International Conference on Computer and Communication Systems (ICCCS 2026
摘要
The increasing demand for computational resources poses substantial challenges for task offloading in terahertz (THz)-enabled Space–Terrestrial Integrated Networks (STINs), particularly in multi-user to multi-satellite scenarios. Achieving efficient task allocation and offloading is crucial for optimizing overall system performance, reducing latency, and improving overall network throughput. However, traditional centralized approaches suffer from excessive communication overhead and heightened privacy risks in STINs. To overcome these limitations, this paper proposes a novel framework based on Federated Learning-Assisted Multi-Agent Deep Deterministic Policy Gradient (FL-MADDPG). The THz-connected user–satellite network is modeled as a multi-agent system, enabling distributed decisionmaking while avoiding raw data sharing. Within this framework, federated learning preserves user privacy, and multi-agent reinforcement learning allows the system to dynamically adapt to complex THz channel variations and fluctuating resource availability. Experimental results demonstrate the effectiveness of FL-MADDPG in improving the overall performance of the network.
|
11th International Conference on Computer and Communication Systems (ICCCS 2026
|
张静静
|
2026
|
Multi-Agent Deep Reinforcement Learning for Task Offloading in LEO Satellite Networks
Multi-Agent Deep Reinforcement Learning for Task Offloading in LEO Satellite Networks
张静静
2025 10th International Conference on Computer and Communication System (ICCCS)
摘要
In recent years, the rapid development of Low Earth Orbit (LEO) satellite networks has highlighted the need for enhanced data processing capabilities. However, the limited computational capacity and inherent heterogeneity of LEO satellites pose significant challenges for efficient task computation, such as minimizing latency and energy consumption. To address these challenges, we propose QMIX-sat, a novel multi-agent reinforcement learning algorithm. We transform the optimization problem into a decentralized partially observable Markov decision process (Dec-POMDP) and solve it using an enhanced version of the QMIX algorithm. In the satellite scenario under study, we introduce attention modules and residual connections to generate accurate task segmentation and allocation decisions, effectively mitigating the issues caused by high-dimensional observation spaces, such as inter-satellite and satellite-ground link states. Additionally, given that QMIX is commonly used in discrete action spaces, this work extends QMIX to continuous action spaces, ensuring precise task segmentation and allocation. We construct a LEO satellite network model for experimental verification, and the results demonstrate that QMIX-sat significantly improves task allocation efficiency and overall performance within LEO satellite networks compared to existing algorithms.
|
2025 10th International Conference on Computer and Communication System (ICCCS)
|
张静静
|
2025
|
Federated Learning With Adjustable Learning Rates for Resource-Constrained Wireless Networks
Federated Learning With Adjustable Learning Rates for Resource-Constrained Wireless Networks
张静静
2025 10th International Conference on Computer and Communication System (ICCCS)
摘要
Wireless federated learning (WFL) suffers from heterogeneity prevailing in data distributions, computing powers, and channel conditions of participating devices. This paper presents a new Federated Learning with Adjusted leaRning ratE (FLARE) framework to mitigate the impact of the heterogeneity. The key idea is to allow the participating devices to adjust their individual learning rates and local training iterations, adapting to their instantaneous computing powers. The convergence upper bound of FLARE is established rigorously under a generic setting with non-convex models, non-i.i.d. datasets and imbalanced computing powers. By minimizing the upper bound, we further optimize the scheduling strategy of FLARE to exploit the channel heterogeneity. A nested problem structure is uncovered to facilitate iterative bandwidth allocation with binary search and device selection with a new greedy method. Experiments demonstrate that FLARE consistently outperforms the baselines in test accuracy, and converges much faster with the proposed scheduling policy.
|
2025 10th International Conference on Computer and Communication System (ICCCS)
|
张静静
|
2025
|
Uplink-Aware Federated Learning Based on Model Pruning in Satellite Networks
Uplink-Aware Federated Learning Based on Model Pruning in Satellite Networks
张静静
ICML 2025 Workshop on Machine Learning for Wireless Communication and Networks (ML4Wireless)
摘要
Satellite federated learning (SFL) allows satellites to collaboratively train models without sharing raw data, enhancing privacy and reducing communication costs. Traditional SFL requires a ground station (GS) to upload models to satellites, under the premise of adequate ground-satellite uplink (GSUL) resources. However, this assumption does not hold in dense LEO constellations, where frequent command interaction or parameter delivery make the bandwidth-constrained uplink a bottleneck. This work proposes satellite federated learning with uplink scheduling and model pruning (FedLSMP). The key idea behind this is jointly optimizing the GSUL bandwidth allocation plan and model compression ratio to maximize the approximated loss reduction, adhering to bandwidth constraints. Finally, numerical results demonstrate that FedLSMP improves convergence rates while reducing GSUL bandwidth usage, achieving higher overall effectiveness compared with conventional SFL approaches.
|
ICML 2025 Workshop on Machine Learning for Wireless Communication and Networks (ML4Wireless)
|
张静静
|
2025
|
RIS-Enabled Integrated Sensing and Communication With Clutter Suppression for LEO Satellite Systems
RIS-Enabled Integrated Sensing and Communication With Clutter Suppression for LEO Satellite Systems
Shili Zhou/Yu Zhu
2025 IEEE/CIC International Conference on Communications in China (ICCC)
|
2025 IEEE/CIC International Conference on Communications in China (ICCC)
|
Shili Zhou/Yu Zhu
|
2025
|
Joint Localization of UEs, Scatterers, and Obstacles with Spatial Non-Stationary ELAA
Joint Localization of UEs, Scatterers, and Obstacles with Spatial Non-Stationary ELAA
Ziqi Ke/Yu Zhu
2025 IEEE/CIC International Conference on Communications in China (ICCC)
|
2025 IEEE/CIC International Conference on Communications in China (ICCC)
|
Ziqi Ke/Yu Zhu
|
2025
|
Gaussian Belief Propagation-Enhanced CRB-KF Framework for Cooperative Localization
Gaussian Belief Propagation-Enhanced CRB-KF Framework for Cooperative Localization
Xin Liang/Yu Zhu
2025 IEEE/CIC International Conference on Communications in China (ICCC)
|
2025 IEEE/CIC International Conference on Communications in China (ICCC)
|
Xin Liang/Yu Zhu
|
2025
|
A High-Resolution Range Profile Estimation Method Based on OFDM Communication Signals
A High-Resolution Range Profile Estimation Method Based on OFDM Communication Signals
Kejiao Li/Yu Zhu
2025 IEEE 101st Vehicular Technology Conference
|
2025 IEEE 101st Vehicular Technology Conference
|
Kejiao Li/Yu Zhu
|
2025
|
Onboard Attitude and Hybrid Beamforming Joint Optimization for LEO Satellite MIMO Systems
Onboard Attitude and Hybrid Beamforming Joint Optimization for LEO Satellite MIMO Systems
Mingyuan Hui/Yu Zhu
2025 IEEE 101st Vehicular Technology Conference
|
2025 IEEE 101st Vehicular Technology Conference
|
Mingyuan Hui/Yu Zhu
|
2025
|
Circle Track Antenna-Assisted Beamforming for Multiuser MIMO Communication Systems
Circle Track Antenna-Assisted Beamforming for Multiuser MIMO Communication Systems
Mengya Liu/Yu Zhu
2025 IEEE 101st Vehicular Technology Conference
|
2025 IEEE 101st Vehicular Technology Conference
|
Mengya Liu/Yu Zhu
|
2025
|
Adaptive Congestion Control Strategies for LEO Satellite Networks
Adaptive Congestion Control Strategies for LEO Satellite Networks
张静静
2024 IEEE 24th International Conference on Communication Technology (ICCT)
摘要
LEO satellite networks are essential for providing global communication coverage, especially in areas where traditional terrestrial networks are unavailable. However, LEO satellite networks face several challenges compared to terrestrial networks, including rapid bandwidth fluctuation, long Round-Trip Time (RTT), and high Bit Error Rate (BER), which can significantly impact communication quality and reliability. To address these issues and optimize network performance, we propose the OnlineAEPC algorithm. This algorithm adaptively adjusts the congestion window size based on real-time environmental conditions, thereby reducing transmission time and packet loss. Extensive simulations validate the performance of our algorithm under various challenging conditions characterized by rapid bandwidth fluctuation, long RTT, and high BER. Our comparisons with baseline algorithms demonstrate that OnlineAEPC offers significant improvements in transmission time and packet loss reduction.
|
2024 IEEE 24th International Conference on Communication Technology (ICCT)
|
张静静
|
2024
|
Accelerating Handover in Mobile Satellite Network
Accelerating Handover in Mobile Satellite Network
张静静
IEEE INFOCOM 2024 - IEEE Conference on Computer Communications
摘要
The construction of Low Earth Orbit (LEO) satellite constellations has recently spurred tremendous attention from academia and industry. 5G and 6G standards have specified LEO satellite network as a key component of 5G and 6G networks. However, ground terminals experience frequent, high-latency handover incurred by satellites’ fast travelling speed, which deteriorates the performance of latency-sensitive applications. To address this challenge, we propose a novel handover flowchart for mobile satellite networks, which can considerably reduce the handover latency. The innovation behind this scheme is to mitigate the interaction between the access and core networks that occupy the majority of time overhead by leveraging the predictable travelling trajectory and spatial distribution inherent in mobile satellite networks. Specifically, we design a fine-grained synchronized algorithm to address the synchronization problem due to the lack of control signalling delivery between the access and core networks. Moreover, we minimize the computational complexity of the core network using information such as the satellite access strategy and unique spatial distribution, which is caused by frequent prediction operations. We have built a prototype for a mobile satellite network using modified Open5GS and UERANSIM, which is driven by actual LEO satellite constellations such as Starlink and Kuiper. We have conducted extensive experiments, and the results demonstrate that our proposed handover scheme can considerably reduce the handover latency compared to the 3GPP Non-terrestrial Networks (NTN) and two other existing handover schemes.
|
IEEE INFOCOM 2024 - IEEE Conference on Computer Communications
|
张静静
|
2024
|
Adaptive Multi-Armed Bandit Learning for Task Offloading in Edge Computing
Adaptive Multi-Armed Bandit Learning for Task Offloading in Edge Computing
张静静
2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)
摘要
The widespread adoption of edge computing has emerged as a prominent trend for alleviating task processing delays and reducing energy consumption. However, the dynamic nature of network conditions and the varying computation capacities of edge servers (ESs) can introduce disparities between computation loads and available computing resources in edge computing networks, potentially leading to inadequate service quality. To address this challenge, this paper investigates a practical scenario characterized by dynamic task offloading. Initially, we examine traditional multi-armed bandit (MAB) algorithms, namely the ε-greedy algorithm and the UCB1-based algorithm. However, both algorithms exhibit certain weaknesses in effectively addressing the tidal data traffic patterns. Consequently, based on MAB, we propose an adaptive task offloading algorithm (ATOA) that overcomes these limitations. By conducting extensive simulations, we demonstrate the superiority of our ATOA solution in reducing task processing latency compared to conventional MAB methods. This substantiates the effectiveness of our approach in enhancing the performance of edge computing networks and improving overall service quality.
|
2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)
|
张静静
|
2024
|
ix-CL: Semi-Supervised Continual Learning for Network Intrusion Detection
ix-CL: Semi-Supervised Continual Learning for Network Intrusion Detection
任久春
2024 6th International Conference on Communications, Information System and Computer Engineering (CISCE)
|
2024 6th International Conference on Communications, Information System and Computer Engineering (CISCE)
|
任久春
|
2024
|
Intelligent Pose Recognition and Evaluation System for Rowing Sports
Intelligent Pose Recognition and Evaluation System for Rowing Sports
任久春
2024 5th International Conference on Electronic Communication and Artificial Intelligence (ICECAI)
|
2024 5th International Conference on Electronic Communication and Artificial Intelligence (ICECAI)
|
任久春
|
2024
|
Surface-Constrained Progressive Feature Preserving Point Cloud Compression
Surface-Constrained Progressive Feature Preserving Point Cloud Compression
胡蝶
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
|
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
|
胡蝶
|
2024
|
Joint beamforming and power allocation optimization for double-IRS-aided systems with phase shift-dependent power consumption
Joint beamforming and power allocation optimization for double-IRS-aided systems with phase shift-dependent power consumption
Qiucen Wu/Yu Zhu
|
IEEE Globecom
|
Qiucen Wu/Yu Zhu
|
2024
|
LayerFED: Speeding Up Federated Learning with Model Split
LayerFED: Speeding Up Federated Learning with Model Split
张静静
2023 IEEE International Conference on Satellite Computing (Satellite)
摘要
Machine learning is increasingly used in edge devices with limited computational resources for tasks such as face recognition, object detection, and voice recognition. Federated Learning (FL) is a promising approach to train models on multiple edge devices without requiring clients to upload their original data to the server. However, challenges such as redundant local parameters during synchronous aggregation and system heterogeneity can significantly impact training performance. To address these challenges, we propose LayerFED, a novel strategy that leverages model splitting and pipelined communication-computation mode. LayerFED enables partial and full updates by splitting the model, and mitigates communication channel congestion during server aggregation by selectively updating parameters during computation. This reduces the amount of information that needs to be communicated between edge devices and the server. We demonstrate through experiments on benchmark datasets that LayerFED improves training time efficiency and accuracy while maintaining model performance.
|
2023 IEEE International Conference on Satellite Computing (Satellite)
|
张静静
|
2023
|
FedGSM: Efficient Federated Learning for LEO Constellations with Gradient Staleness Mitigation
FedGSM: Efficient Federated Learning for LEO Constellations with Gradient Staleness Mitigation
张静静
2023 IEEE 24th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC)
摘要
Recent advancements in space technology have equipped low Earth Orbit (LEO) satellites with the capability to perform complex functions and run AI applications. Federated Learning (FL) on LEO satellites enables collaborative training of a global ML model without the need for sharing large datasets. However, intermittent connectivity between satellites and ground stations can lead to stale gradients and unstable learning, thereby limiting learning performance. In this paper, we propose FedGSM, a novel asynchronous FL algorithm that introduces a compensation mechanism to mitigate gradient staleness. FedGSM leverages the deterministic and time-varying topology of the orbits to offset the negative effects of staleness. Our simulation results demonstrate that FedGSM outperforms state-of-the-art algorithms for both IID and non-IID datasets, underscoring its effectiveness and advantages. We also investigate the effect of system parameters.
|
2023 IEEE 24th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC)
|
张静静
|
2023
|
Distributed Computation for Centralized MMSE Combining in Cell-Free Massive MIMO Systems
Distributed Computation for Centralized MMSE Combining in Cell-Free Massive MIMO Systems
张静静
2023 IEEE 23rd International Conference on Communication Technology (ICCT)
摘要
Cell-Free (CF) massive multiple-input multiple-output (MIMO) technology is poised to revolutionize future wireless networks by significantly improving network capacity. However, the conventional centralized implementation of CF networks brings about computational complexities in the central processing unit (CPU) and imposes a heavy burden on fronthaul links due to extensive channel state information (CSI) exchange. In this paper, we propose a distributed computing architecture that employs the preconditioned conjugate gradient algorithm to achieve the performance level of centralized minimum mean square error (MMSE) combining. The proposed algorithm offers two key advantages: firstly, it offloads a significant portion of computations from the CPU to distributed access points, thereby alleviating the computational burden; secondly, it significantly reduces the transmission of instantaneous and historical data through fronthaul links. These benefits contribute to the overall efficiency and scalability of CF massive MIMO systems, as demonstrated in the simulation results.
|
2023 IEEE 23rd International Conference on Communication Technology (ICCT)
|
张静静
|
2023
|
Optimizing Wireless Signal Strength On Rowing Boat: A 3D Simulation Approach To Overcoming Human Occlusion
Optimizing Wireless Signal Strength On Rowing Boat: A 3D Simulation Approach To Overcoming Human Occlusion
任久春
Proceedings of the 2023 9th International Conference on Communication and Information Processing
|
Proceedings of the 2023 9th International Conference on Communication and Information Processing
|
任久春
|
2023
|
Spatial beamforming design for ISAC systems under per antenna power constraint
Spatial beamforming design for ISAC systems under per antenna power constraint
Kaixin Li/Yu Zhu
|
IEEE SPAWC
|
Kaixin Li/Yu Zhu
|
2023
|
Hybrid beamforming design for multi-user and multi-target ISAC systems
Hybrid beamforming design for multi-user and multi-target ISAC systems
Shili Zhou/Yu Zhu
|
WCSP
|
Shili Zhou/Yu Zhu
|
2023
|
Green beamforming design for IRS-aided systems under phase shift-related power consumption
Green beamforming design for IRS-aided systems under phase shift-related power consumption
Qiucen Wu/Yu Zhu
|
IEEE Globecom
|
Qiucen Wu/Yu Zhu
|
2023
|