Abstract: Policy-gradient-based actor-critic algorithms are amongst the most popular algorithms in the reinforcement learning framework. Their advantage of being able to search for optimal policies ...
Distributed Adaptive Gradient Algorithm With Gradient Tracking for Stochastic Nonconvex Optimization
Abstract: This article considers a distributed stochastic nonconvex optimization problem, where the nodes in a network cooperatively minimize a sum of $L$-smooth ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results