This paper proposes an online actor critic learning-based optimal tracking control method for output-feedback servo positioning systems under unknown external disturbances. The servo system is reformulated into a control-affine form, where the uncertain dynamics are compactly represented as a lumped unknown function. An online identifying filter is introduced to estimate these dynamics, while an actor-critic neural network structure is used to approximate the value function and optimal control input. The proposed method yields an approximate solution to the HamiltonāJacobiāBellman equation, with adaptive update laws ensuring asymptotic convergence of the Bellman residual error. Lyapunov-based analysis guarantees the stability of the closed-loop system. Simulation results confirm the effectiveness of the proposed method in achieving robust tracking under time-varying disturbances.