Question
How can I fix transient, high NTP jitter?
Background information
I have an NTP server on my private network. My servers synchronize from this clock, and usually all is well. An example set of output:
ntpq> pe
remote refid st t when poll reach delay offset jitter
==============================================================================
*10.10.10.249 10.10.100.20 3 u 367 1024 377 0.096 0.145 0.142
ntpq> as
ind assid status conf reach auth condition last_event cnt
===========================================================
1 2378 962a yes yes none sys.peer sys_peer 2
ntpq> rv 2378
associd=2378 status=962a conf, reach, sel_sys.peer, 2 events, sys_peer,
srcadr=10.10.10.249, srcport=123, dstadr=10.10.200.1, dstport=123,
leap=00, stratum=3, precision=-18, rootdelay=1.190, rootdisp=37.155,
refid=10.10.100.20,
reftime=df134714.c026b762 Mon, Aug 6 2018 22:15:48.750,
rec=df134a04.507b5ad6 Mon, Aug 6 2018 22:28:20.314, reach=377,
unreach=0, hmode=3, pmode=4, hpoll=10, ppoll=10, headway=0, flash=00 ok,
keyid=0, offset=0.145, delay=0.096, dispersion=15.187, jitter=0.142,
xleave=0.052,
filtdelay= 0.10 0.10 0.05 0.08 0.09 0.11 0.11 0.11,
filtoffset= 0.14 0.16 0.19 0.12 0.02 -0.02 -0.04 -0.10,
filtdisp= 0.00 15.57 31.37 47.42 63.65 79.41 95.27 110.72
However every once in a while we will see a system increase to a much larger jitter. Digging into that when it happens, we see a single jump in the delay and offset values. Example:
filtdelay= 0.06 0.11 250.20 0.07 0.04 0.10 0.07 0.09,
filtoffset= 0.05 -0.01 124.95 -0.05 -0.05 -0.07 -0.05 -0.03,
Note in this case that offset
(usually, but always) stays within 0.5/-0.5:
# ntpq -pn
remote refid st t when poll reach delay offset jitter
==============================================================================
*10.10.10.249 10.10.100.20 3 u 711 1024 377 0.112 -0.006 47.230
Sometimes the high jitter value can persist, mostly unchanging, for a few hours. The large jitter amount varies from 1 to over 100. Eventually it drops back down below 1.
Addendum We are seeing a correlation between system load and NTP jitter. As a first guess, NTP packets might be colliding with NFS traffic.
EDIT It's not a GPS clock source.
EDIT It's definitely a problem. The jitter we see roughly correlates to high offset values.