"synchronisation lost" errors for Solaris NTP server


Login or Register for Dates, Times and to Reply

 
Thread Tools Search this Thread
Operating Systems Solaris "synchronisation lost" errors for Solaris NTP server
# 1  
"synchronisation lost" errors for Solaris NTP server

Hi,

This is Solaris 9, which is service as NTP server for many unix clients. At backend, it it synching time with three GPS clocks. From past few days, I am noticing time reset to 1 second. Is this a problem ?
I was assuming that if it is a network issue or GPS clock connectivity issue, it should lost sync only with one device. But I see, it is saying "synchronisation lost" for all three devices.

Code:
ntp-serv10 # ntpq -p
     remote           refid      st t when poll reach   delay   offset    disp
==============================================================================
*sea-gps-clock1. .GPS.            1 u  358 1024  377     1.54   -0.904    0.64
 172.28.42.204   .GPS.            1 u   30  512  270    40.76    0.163 16000.0
+172.28.34.204   .GPS.            1 u 1315 1024  376    77.07   -0.799    7.93
ntp-serv10 # cat /var/adm/messages | grep ntp | tail -10
Dec  8 20:45:46 ntp-serv10 xntpd[15247]: [ID 774427 daemon.notice] time reset (step) -1.005217 s
Dec  8 20:45:46 ntp-serv10 xntpd[15247]: [ID 204180 daemon.info] synchronisation lost
Dec  8 20:49:24 ntp-serv10 snmptrapd[15131]: [ID 702911 daemon.warning] localhost [UDP: [127.0.0.1]:-31114]: Trap , DISMAN-EVENT-MIB::sysUpTimeInstance = Timeticks: (893346033) 103 days, 9:31:00.33, SNMPv2-MIB::snmpTrapOID.0 = OID: DISMAN-EVENT-MIB::mteTriggerFired, DISMAN-EVENT-MIB::mteHotTrigger = STRING: status exec ntp, DISMAN-EVENT-MIB::mteHotTargetName = STRING: , DISMAN-EVENT-MIB::mteHotContextName = STRING: , DISMAN-EVENT-MIB::mteHotOID = OID: UCD-SNMP-MIB::extResult.5, DISMAN-EVENT-MIB::mteHotValue = INTEGER: 1, UCD-SNMP-MIB::extNames.5 = STRING: ntpcheck, UCD-SNMP-MIB::extOutput.5 = STRING: PROBLEM: NTP is not synchronized to peer
Dec  8 20:50:06 ntp-serv10 xntpd[15247]: [ID 854739 daemon.info] synchronized to 192.168.70.16, stratum=1
Dec  9 13:08:02 ntp-serv10 xntpd[15247]: [ID 774427 daemon.notice] time reset (step) 0.998904 s
Dec  9 13:08:02 ntp-serv10 xntpd[15247]: [ID 204180 daemon.info] synchronisation lost
Dec  9 13:13:15 ntp-serv10 xntpd[15247]: [ID 854739 daemon.info] synchronized to 172.28.34.204, stratum=1
Dec  9 13:13:14 ntp-serv10 xntpd[15247]: [ID 774427 daemon.notice] time reset (step) -1.003379 s
Dec  9 13:13:14 ntp-serv10 xntpd[15247]: [ID 204180 daemon.info] synchronisation lost
Dec  9 13:17:34 ntp-serv10 xntpd[15247]: [ID 854739 daemon.info] synchronized to 192.168.70.16, stratum=1
ntp-serv10 #

Please advice.

Thanks
# 2  
I don't think it is a big deal.

But, if I were you, I would install chrony . (just takes a minute or two) and compare the results.

Quote:
Chrony is supported on Linux, FreeBSD, NetBSD, macOS, and Solaris. Closely related systems may work too. Any other system will likely require a porting exercise.
# 3  
Are these remote devices GNSS? Why you need 3 evades me. Some specialized clocks will drop connections that act to hog resources.

And why do you need to ping those NTP servers so often? That may be why you are getting dropped.... assuming it did work previously.
# 4  
These are GPS clocks, some kind of harware device. Not sure if those are GNSS, it is maintained by some other team. But I can get more information about it.

So far, I didn't notice any issue on any server or our internal devices, neither any app or DB team reported.
Those messages, stated in my first post are from /var/adm/messages. It is monitoring team, who has set up to create a ticket based on these kinds of alerts. So it is little noise from management, why synchronization is lost to GPS devices and why time is drifted back around 1 second. I am just trying to find these answers.

Another thing I noticed is, poll says 1024. That means 1024 seconds without guidance means slow sync, slow adjustments etc. Am I understanding it correctly? If yes, should "minpoll 4 maxpoll 8" entry in ntp.conf for all 3 GPS devices help?
# 5  
No, poll is the poll interval.
When there is a good and reliable peer for a long time then ntpd will double the poll interval.

So a small poll interval means there is high dispersion(=jitter).

Your bad device is the 172.28.42.204 that is still at the initial 16000 dispersion.
Please test connectivity with
Code:
ping -s 172.28.42.204 1400 100

The default values in ntp.conf are okay.
You should be more worried about security, and add a restriction like
Code:
restrict default notrap nomodify nopeer noquery
restrict 127.0.0.1

(As a quick alternative to a replacement of ntpd with chronyd.)
# 6  
Thanks for explanation.

Quote:
ntp-serv10 # ntpq -p
remote refid st t when poll reach delay offset disp
==============================================================================
*sea-gps-clock1. .GPS. 1 u 416 1024 377 1.54 -1.297 1.82
172.28.42.204 .GPS. 1 u 744 1024 0 44.45 2.298 16000.0
+172.28.34.204 .GPS. 1 u 814 1024 277 77.00 -1.162 1.37
ntp-serv10 #
ntp-serv10 # ping -s 172.28.42.204 1400 100
PING 172.28.42.204: 1400 data bytes
1408 bytes from 172.28.42.204: icmp_seq=0. time=56. ms
1408 bytes from 172.28.42.204: icmp_seq=1. time=44. ms
1408 bytes from 172.28.42.204: icmp_seq=2. time=45. ms
1408 bytes from 172.28.42.204: icmp_seq=3. time=44. ms
1408 bytes from 172.28.42.204: icmp_seq=4. time=44. ms
1408 bytes from 172.28.42.204: icmp_seq=5. time=45. ms
^C
----172.28.42.204 PING Statistics----
6 packets transmitted, 6 packets received, 0% packet loss
round-trip (ms) min/avg/max = 44/46/56
ntp-serv10 #
Here is my current conf file :
Quote:
ntp-serv10 # cat /etc/inet/ntp.conf | grep -v "#"
server 192.168.70.16 prefer
server 172.28.42.204
server 172.28.34.204


driftfile /var/ntp/ntp.drift
statsdir /var/ntp/ntpstats/
filegen peerstats file peerstats type day enable
filegen loopstats file loopstats type day enable
filegen clockstats file clockstats type day enable
How will I explain, if synchronisation is being lost to all these devices? Is it dragging back by (approx) 1 second? I see these messages for today morning too
Quote:
ntp-serv10 # cat /var/adm/messages | grep ntp | tail -10
Dec 9 13:13:15 ntp-serv10 xntpd[15247]: [ID 854739 daemon.info] synchronized to 172.28.34.204, stratum=1
Dec 9 13:13:14 ntp-serv10 xntpd[15247]: [ID 774427 daemon.notice] time reset (step) -1.003379 s
Dec 9 13:13:14 ntp-serv10 xntpd[15247]: [ID 204180 daemon.info] synchronisation lost
Dec 9 13:17:34 ntp-serv10 xntpd[15247]: [ID 854739 daemon.info] synchronized to 192.168.70.16, stratum=1
Dec 11 06:19:58 ntp-serv10 xntpd[15247]: [ID 774427 daemon.notice] time reset (step) 0.999029 s
Dec 11 06:19:58 ntp-serv10 xntpd[15247]: [ID 204180 daemon.info] synchronisation lost
Dec 11 06:24:58 ntp-serv10 xntpd[15247]: [ID 854739 daemon.info] synchronized to 192.168.70.16, stratum=1
Dec 11 06:24:57 ntp-serv10 xntpd[15247]: [ID 774427 daemon.notice] time reset (step) -1.003025 s
Dec 11 06:24:57 ntp-serv10 xntpd[15247]: [ID 204180 daemon.info] synchronisation lost
Dec 11 06:30:18 ntp-serv10 xntpd[15247]: [ID 854739 daemon.info] synchronized to 192.168.70.16, stratum=1
ntp-serv10 #
# 7  
I am not sure how it behaves.
I would disable the bad peer in ntp.conf

Code:
server 192.168.70.16 prefer
#bad#server 172.28.42.204
server 172.28.34.204

driftfile /var/ntp/ntp.drift
statsdir /var/ntp/ntpstats/
filegen peerstats file peerstats type day enable
filegen loopstats file loopstats type day enable
filegen clockstats file clockstats type day enable

restrict default notrap nomodify nopeer noquery
restrict 127.0.0.1

Login or Register for Dates, Times and to Reply

Previous Thread | Next Thread
Thread Tools Search this Thread
Search this Thread:
Advanced Search

Test Your Knowledge in Computers #770
Difficulty: Medium
IBM POWER4 (2001) had off-chip L3 caches of 32 MB per processor, shared among several processors.
True or False?

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Bash script - Print an ascii file using specific font "Latin Modern Mono 12" "regular" "9"

Hello. System : opensuse leap 42.3 I have a bash script that build a text file. I would like the last command doing : print_cmd -o page-left=43 -o page-right=22 -o page-top=28 -o page-bottom=43 -o font=LatinModernMono12:regular:9 some_file.txt where : print_cmd ::= some printing... (1 Reply)
Discussion started by: jcdole
1 Replies

2. UNIX for Dummies Questions & Answers

"Help with bash script" - "License Server and Patch Updates"

Hi All, I'm completely new to bash scripting and still learning my way through albeit vey slowly. I need to know where to insert my server names', my ip address numbers through out the script alas to no avail. I'm also searching on how to save .sh (bash shell) script properly.... (25 Replies)
Discussion started by: profileuser
25 Replies

3. Solaris

Printer configuration Migration from Solaris 10 "LP" to Solaris 11 "CUPS"

Need to find a way to import an LP printers.conf file to CUPS. I have some new Solaris 11.1 boxes that need to have 300 printers added. (0 Replies)
Discussion started by: os2mac
0 Replies

4. Shell Programming and Scripting

awk command to replace ";" with "|" and ""|" at diferent places in line of file

Hi, I have line in input file as below: 3G_CENTRAL;INDONESIA_(M)_TELKOMSEL;SPECIAL_WORLD_GRP_7_FA_2_TELKOMSEL My expected output for line in the file must be : "1-Radon1-cMOC_deg"|"LDIndex"|"3G_CENTRAL|INDONESIA_(M)_TELKOMSEL"|LAST|"SPECIAL_WORLD_GRP_7_FA_2_TELKOMSEL" Can someone... (7 Replies)
Discussion started by: shis100
7 Replies

5. AIX

"too big" and "not enough memory" errors in shell script

Hi, This is odd, however here goes. There are several shell scripts that run in our production environment AIX 595 LPAR m/c, which has sufficient memory 14GB (physical memory) and horsepower 5CPUs. However from time to time we get the following errors in these shell scripts. The time when these... (11 Replies)
Discussion started by: jerardfjay
11 Replies

6. UNIX for Advanced & Expert Users

All alias in .profile lost when "script" command is called

Hi, I was trying to call "script <an ip add>" command from .profile file to log everything whenever anyone logs in to this user. I did the following at the end of .profile. 1) Extracted the IP address who logged in 2) Called script < ip add> . The problem I am facing is all, aliases etc. written... (3 Replies)
Discussion started by: amicon007
3 Replies

7. Linux

NTP treshold "synchronisation lost"

does anyone know how to change the treshold of 128ms in NTP. in order to ignore these alarms: Oct 27 14:44:15 rt1 ntpd: synchronisation lost Oct 27 15:08:25 rt1 ntpd: time reset 0.688591 s Oct 27 15:08:25 rt1 ntpd: synchronisation lost Oct 27 15:28:45 rt1 ntpd: time reset 0.462257 s (0 Replies)
Discussion started by: modcan
0 Replies

8. UNIX for Dummies Questions & Answers

Can you force local NTP server to be accepted as "suitable"?

Is there some way to force the NTP server on a brand-new install to be "suitable" to sync other servers from? (I'm more concerned with synchronization between machines, and less concerned with what the actual time they sync to is) For example, whenever I install fresh from the Fedora DVDs and... (0 Replies)
Discussion started by: jjinno
0 Replies

9. Filesystems, Disks and Memory

Restoring back files from "lost+found" directory

Hi Friends, How can I Restore the Files present under "lost+found" Directory of a FileSystem (in Solaris & Tru64 OS) to their original Locations. Now-a-days I am loosing lots of files in 2 of my Machines, One running Solaris8 and other Tru64(Digital) Unix. Thanx in... (1 Reply)
Discussion started by: dhasarath
1 Replies

Featured Tech Videos