Clock synchronization in a network with asymmetric delays

Question

Assume a computer has a precise clock which is not initialized. That is, the time on the computer's clock is the real time plus some constant offset. The computer has a network connection and we want to use that connection to determine the constant offset $B$.

The simple method is that the computer sends a query to a time server, noting the local time $B + C_1$. The time server receives the query at a time $T$ and sends a reply containing $T$ back to the client, which receives it at a time $B + C_2$. Then $B + C_1 \le T \le B + C_2$, i.e. $T - C_2 \le B \le T - C_1$.

If the network transmission time and the server processing time are symmetric, then $B = T - \dfrac{C_1 + C_2}{2}$. As far as I know, NTP, the time synchronization protocol used in the wild, operates on this assumption.

How can the precision be improved if the delays are not symmetric? Is there a way to measure this asymmetry in a typical Internet infrastructure?

@Raphael Thanks. More friendly Google link to US patent 7602873, official USPTO link. — Gilles 'SO- stop being evil', Mar 08 '12 at 09:58
First thoughts: It's probably impossible with 2 entities. Using ${n\choose2}$ pairs of $n$ entities a better synchronization is likely possible. Then the clocks can be used to measure one-trip times. — rgrig, Mar 08 '12 at 11:13
can you clarify the application/context or is this mainly a theoretical question? — vzn, Oct 01 '12 at 16:55
@vzn From me, it's idle curiosity. There is a practical application, though: clock synchronization (e.g. NTP) in asymmetric networks (e.g. satellite downlink, landline uplink). — Gilles 'SO- stop being evil', Oct 01 '12 at 17:25
there is really at least two questions here it seems (a) measuring network delay accurately and (b) setting time accurately via a time protocol, & while there is strong overlap they are not nec the same thing.... can that be narrowed down or not? — vzn, Oct 01 '12 at 17:32
@vzn The question is about measuring the asymmetry in the network latency. Going from that to clock synchronization is a solved problem for the purpose of this question. — Gilles 'SO- stop being evil', Oct 01 '12 at 18:26
Practical or not, it is a fun question and an interesting problem. Thanks Gilles! — Bingo, Feb 11 '13 at 13:36

Craig Gidney · Answer 1 · 2012-03-21T13:47:17.490

12

Inability to measure asymmetry

No, you can't measure the asymmetry. Consider these two communication diagrams, the first with a negative clock offset and equal delays and the second with no clock offset and entirely asymmetric delays (but the same round trip time).

communication diagram

The important thing to notice is that, from the perspective of both the PC and the server, the two interactions are exactly identical. They receive messages at the same time. They send messages at the same time.

You can create more cases by 'grabbing' the PC timeline and 'sliding' it, holding the message send/receive points fixed relative to their respective timelines. The asymmetries you cause are exactly negated by the clock offset. In fact, you can even make messages go BACKWARDS IN TIME one way (as long as the round trip time is still the same) and the server/client STILL can't tell!

Therefore it is impossible to measure latency asymmetries. In the worst case, where you have no information other than that one way latencies are positive and sum to the round trip time, the accuracy of clock synchronization is limited to the round trip time.

Can intermediate infrastructure help?

Whether or not the intermediate infrastructure can help will be heavily dependent upon your theoretical model of the situation.

If the asymmetry is constant and the intermediate infrastructure is the routers on the communication path between you and the server, then no. Even if each router synchronized their clock with the adjacent router, the errors would compound in the same way as if you had synchronized with the server via communication across the routers.

In the real world you can rely on delays being somewhat symmetric for architectural reasons, repeated synchronizations to reduce asymmetry due to queuing delays (etc), and multiple communication paths to reduce other sorts of asymmetry.

If you put your model's assumptions somewhere in between (because it's interesting to explore model space, of course) I expect the result should also be somewhere in between.

edited Mar 21 '12 at 13:47

answered Mar 21 '12 at 13:30

Craig Gidney

5,842
22
47

1

This should be an answer to your question. Here I'm asking about a more concrete setting, where we may get help from the underlying infrastructure. – Gilles 'SO- stop being evil' Mar 21 '12 at 13:34
I added more content for you. – Craig Gidney Mar 21 '12 at 13:47
this appears to me to be wrong & it can be seen by noting that while the PC send and receive times are the same (upper timeline events coincide in the two cases), the server times are different (lower line in the two cases) & therefore the formula computed by the NTP client is different in the two cases. this can be understood better by labelling the NTP values for $t_1, t_2, t_3, t_4$ in each case (where $t_2, t_3$ are values recorded in server time and sent back to client). as in my answer, the NTP time protocol can indeed measure and adjust for $(t_1 - t_0) \neq (t_3 - t_2)$ – vzn Oct 01 '12 at 21:18
@vzn The server times with respect t messages are the same in both examples. The server's timeline moving to the left represents the starting clock drift being different. The effects of initial clock drift and latency asymmetry happen to be equivalent, so adjusting them both in opposite directions allows the resulting behavior to be equivalent. – Craig Gidney Oct 02 '12 at 19:27
further study, the client/server can tell when their clocks are far out of sync at least outside the round trip time. more info in the polycos et al ref I cite below where they measure different "unidirectional latencies" larger than the NTP uncertainty (which seems to be less than round trip time to NTP servers— ie ~10ms) – vzn Oct 02 '12 at 20:11
@vzn Maybe an example to get you thinking along my train of thought with respect to latency asymmetry. Suppose, 50 years from now, time travel is discovered. I then create an NTP server, but time-shift its responses 50 years into the past from 2062 to 2012 and of course time-shift requests from 2012 to 2062. The round trip time is typical, but the latency asymmetry is 50 years! Do other computers in 2012 running NTP detect this subterfuge? – Craig Gidney Oct 02 '12 at 21:32
think its some good insight into the problem but its not a proof because of various [somewhat hidden] assumptions. try the polycos et al ref for a paper describing real measurements of what gilles calls asymmetric delays above – vzn Oct 02 '12 at 22:14
@vzn The polycos et al paper doesn't measure the asymmetry. Their thesis is about fluctuations of unidirectional latencies, not determining the initial magnitude. – Craig Gidney Oct 02 '12 at 22:52
?? every single table & figure in the paper shows "difference in directional delays" – vzn Oct 02 '12 at 23:22
@vzn They mention assuming synchronization via NTP beforehand, then say that's fine because they only really care about fluctuations. – Craig Gidney Oct 03 '12 at 04:45
they measured difference in directional delays based on NTP. the NTP uncertainty was estimated at about ~10ms. the difference in directional delays was about an order of magnitude higher in real networks. so measuring directional delays less than the NTP uncertainty is still probably impossible (close to what you call the "round trip time"), but in real networks, the directional delays are measurable and substantial and much more than the NTP uncertainty. – vzn Oct 03 '12 at 14:41
If I had a network adapter that processes outgoing messages within 1ms and incoming messages within 100ms I can’t see how you could fix that. – gnasher729 Dec 24 '23 at 15:11

score 2 · Answer 2 · answered Feb 11 '13 at 13:29

Consider a network of time servers known to be synchronous, $\theta = \{A, B, C\}$, and a client machine $P$.

Let $T_{XY}$ be the one way time of flight from machine $X$ to machine $Y$, with the possibility that $T_{XY}\neq T_{YX}$.

Let $\Delta_{XY} = |T_{XY} - T_{YX}|$ be the measure of the asymmetry between machine $X$ and $Y$.

Now, consider that the asymmetry between two synchronous machines can be measured by having the synchronous machines agree to send a one way message to each other at the same time. The difference in the arrival times is $\Delta$ between those machines, i.e.:

$\Delta_{AB} = |T_{AB} - T_{BA}|$

$\Delta_{BC} = |T_{BC} - T_{CB}|$

$\Delta_{CA} = |T_{CA} - T_{AC}|$

can be measured.

Now consider the time of flight of circuits:

$P \rightarrow A \rightarrow B \rightarrow P$, denoted by $C_{AB}$,

$P \rightarrow B \rightarrow A \rightarrow P$, denoted by $C_{BA}$.

$C_{AB} = T_{PA} + T_{AB} + T_{BP}$

$C_{BA} = T_{PB} + T_{BA} + T_{AP}$

Consider the client machine $P$ to initiate both of these circuits simultaneously, and measures the difference in arrival times, $x$:

$x = C_{AB} - C_{BA} = \Delta_{PA} + \Delta_{AB} + \Delta_{BP}$

Both $x$ and $\Delta_{AB}$ are known by previously mentioned measurements, so moving the unknowns to the left hand side:

$x - \Delta_{AB} = \Delta_{PA} + \Delta_{BP}$

Similarly, for $\{C_{AC}, C_{CA}\}$ and $\{C_{BC}, C_{CB}\}$ it can be shown that:

$y - \Delta_{BC} = \Delta_{PB} + \Delta_{CP}$

$z - \Delta_{CA} = \Delta_{PC} + \Delta_{AP}$

Inspecting carefully, we note that $\Delta_{XY} \equiv \Delta_{YX}$. The left sides contain values known from measurements, the right sides contain 3 unknowns in 3 equations.

Solving simultaneously,

$\Delta_{AP} = \frac{r + s - t}{2}$

$\Delta_{BP} = \frac{r - s + t}{2}$

$\Delta_{CP} = \frac{t - r + s}{2}$

where,

$r = x - \Delta_{AB}$

$s = y - \Delta_{BC}$

$t = z - \Delta_{CA}$

How does this circumvent the problem my answer and others have? — Raphael, Feb 11 '13 at 17:28
Well, for one l am using 3 timing serves, not one. And it requires something like 12 messages to be sent - 6 to find the asymmetry between the time servers and 6 to find the asymmetry between the client and the servers. It is not a 1-dimensional solution space because the comprison is between 3 servers and not one. And it does not assume time can go backward. — Bingo, Feb 11 '13 at 21:15
It does rely heavily on 3 perfectly synchronised time servers, the synchronisation of which is left as an exercise for the reader. ^^ — Bingo, Feb 11 '13 at 21:18
@Raphael l think l understand your comment now. Time shifting doesn't work because it is more constrained. eg. Time shifting $A$ w.r.t. $P$ doesn't just affect the time between $A$ and $P$ but also the circuts $PACP, PABP, PBAP, PCAP$, the differences of which are measured and factored into the calculation. Maybe I am still wrong? Not sure :P — Bingo, Feb 11 '13 at 22:26
The issue with this answer is that $C_{AB} - C_{BA}$ cannot be equated to $\Delta_{PA} + \Delta_{AB} + \Delta_{BP}$. — Victor, Jul 31 '23 at 22:12

score 0 · Answer 3 · answered Jul 27 '17 at 05:30

If you only control the endpoints. You can't. See Craig's answer.

Even if you add more machines and a more complex set of computers, like in Bingo's answer, you can reduce to just to machines making the synchronized ones have instantaneous access to the others (delay $T_{XY}$ = 0).

Notice that if you do $T_{AB} = T_{BC} = T_{CA} = 0$, you get $\Delta_{AP} = \Delta_{BP} = \Delta_{CP} = 0$.

So what's wrong? $x = C_{AB}-C_{BA} = \Delta_{PA}+\Delta_{AB}+\Delta_{BP}$

$\Delta_{PA} = |T_{PA} - T_{AP}|$, not $\Delta_{PA} = T_{PA} - T_{AP}$

And if you use the second, then you can not use the assumption $\Delta_{XY} \equiv \Delta_{YX}$ (and if you don't use this, your final equations cancel each other).

So, what can you do? Send a really good clock through mail. ;)

Or, if you have control over all nodes between them, you can check the time to process each packet and calculate the delay between each consecutive pair, that should be symmetrical, if they use the same physical medium both ways.

You may need to account for general relativity, and remember that simultaneity doesn't exist.

“You may need to account for general relativity” No, I don't. I'm perfectly fine with a solution that only works if all the clocks involved are in a fixed frame. There is relativity in a distributed system, but it comes from network latency, not from physics. Its mathematics are completely different. — Gilles 'SO- stop being evil', Jul 27 '17 at 07:57

vzn · Answer 4 · 2012-10-01T18:23:29.123

-1

NTP actually uses 4 time measurements $t_0, t_1, t_2, t_3$ to compute the "offset". they are the "time points" in the round trip of the packet from client to server back to client but can be regarded as time offsets. its assumed that the time offset can be off between client and server but that both can count local elapsed time offsets accurately.

the client after receiving the return packet has all 4 values and computes the actual offset. once the relative offset is calculated between client and server, the "absolute time" offset can be synchronized ie the client can accurately estimate the servers exact offset measured wrt its local time offset, ie the "delta".

$t_0$ = time [offset] sent at client
$t_1$ = time [offset] recd at server
$t_2$ = time [offset] sent at server
$t_3$ = time [offset] recd at client

the actual formula is $\theta = {(t_1 - t_0) + (t_2 - t_3) \over 2}$

note this formula can handle the case where the time from client to server $t_1 - t_0$ is not the same as from server to client $t_3 - t_2$ (either shorter or longer).

on networks, delay time is due to two major factors, mainly latency and bandwidth.

latency is the brief delay in routers on sending new [small] packets and is roughly a different constant at each router. it can be measured with the traceroute utility.
bandwidth is the rate at which large amts of data can be sent eg "upload vs download time" and can also be measured by remote "bandwidth measuring" web sites.

in many modern home/business internet connections the upload rate is much smaller than download rates & this would probably affect the $t_1 - t_0$ vs $t_3 - t_2$ difference whereas latency may be small or somewhat similar between client-to-server and server-to-client.

a basic algorithm to improve accuracy of calculating the offset used in NTP (and can correct for some degree of random network latency) is to repeat the process multiple times and use the "apex of the wedge scattergram". this can be seen on the "clock filter algorithm" on slide 10 of this PPT on NTP by David Mills. see also clock filter algorithm by Mills. (note it can still be employed between a single server and client although the general code is written to allow multiple servers.) this is part of the "mitigation algorithms" described in NTP architecture & algorithms.

edited Oct 01 '12 at 18:23

answered Oct 01 '12 at 18:17

vzn

11,034
1
27
50

2

The question is specifically about the case where the latency is not symmetric. Taking multiple measures won't tell you anything about the constant component in the asymmetry. – Gilles 'SO- stop being evil' Oct 01 '12 at 18:28
the question does not actually contain the word "latency". if you want to sketch out what case you actually have in mind in math form instead of words esp wrt the real NTP formulas, it would certainly help. the formulas & algorithms can indeed measure/handle/cover various cases of "latency" and "asymmetry". – vzn Oct 01 '12 at 18:43
1

$C_1$ and $C_2$ are the latency values (I called them “delay”, the two words are synonymous in this context). – Gilles 'SO- stop being evil' Oct 01 '12 at 18:49
$C_1$ and $C_2$ are influenced by latency but are not really equal to it. in a sense the scattergram actually plots latency.... – vzn Oct 01 '12 at 18:54
note (maybe its not totally obvious in above acct) the $t_1, t_2$ server values are sent in the return packet from server to client. – vzn Oct 01 '12 at 21:41
a paper on the subj, measurement considerations for assessing unidirectional latencies by polyzos et al – vzn Oct 02 '12 at 15:51
traceroute requires (approximately) sychronous clocks, so it can not be used here. You can see in the comments to my answer that the two measurable differences are not enough to figure out the needed quantity (clock offset). – Raphael Oct 02 '12 at 17:18
its a subtle question. in the general case it is possible for the clocks to synchronize adaptively prior to attempting to measure the "unidirectional latency" (more details in the paper). it says that actual unidirectional latencies they measured are about an order of magnitude greater than NTP clock uncertainty around ~10ms (but that might have been due to large distances for latency measurements ie intercontinental). agreed if clock offsets are arbitarily different, its apparently not possible, but the client/server can detect that case based on passing clock values back and forth. – vzn Oct 02 '12 at 17:35
The formula does not work when the delay is asymmetric. Suppose both A and B have the same clock, time from A to B is 1 and B to A is 3. A sends 0 to B at time 0. B receives it at 1, waits 1, and sends (0, 1, 2). A receives it at 5. Then ((1-0)+(2-5))/2 is -1, but we want 0. – Victor Jul 31 '23 at 22:18

score -2 · Answer 5 · answered Mar 21 '12 at 02:36

-2

If only we could send packets back in time

enter image description here

$$B = T_f + \frac{T_b - T_f}{2} - \frac{C_1 + C_2}{2}$$

Assumptions:

$(B+C_2) - T_b = T_f - (B+C_1)$

$T_f - (B + C_2) = (B + C_1) - T_b$

answered Mar 21 '12 at 02:36

Pratik Deoghare

1,731
2
14
21

1

This clever solution is ruled out by the assumption of a “typical Internet infrastructure”. – Gilles 'SO- stop being evil' Mar 21 '12 at 10:10
1

@Gilles I know. :D – Pratik Deoghare Mar 21 '12 at 10:50

score -4 · Answer 6 · answered Mar 12 '12 at 19:25

Here is an idea that sounds absolutely convincing to me and might therefore be absolutely wrong in a dumb way.

Consider the following scenario. We have two nodes $N_1$ and $N_2$ with clocks $C_1$ and $C_2$, respectively. For sake of simplicity, let us assume that the clocks run with the same speed; we denote their difference by $\delta = C_1 - C_2$ which is constant for our purposes. Let us furthermore assume that transmission delays $d_{1 \to 2}$ and $d_{2 \to 1}$ are constant¹.

Have $N_1$ send a message timestamped with $T_1^m$ to $N_2$ and let $T_2^r$ the current time on $C_2$ upon receiving it (do the same for the other direction). In addition, measure round trip time (on either node) $D$ by sending a message back and forth. Now set up this equation system:

$\qquad\begin{align*} T_2^r - T_1^m &= d_{1 \to 2} + \delta \\ T_1^r - T_2^m &= d_{2 \to 1} - \delta \\ D &= d_{1 \to 2} + d_{2 \to 1} \end{align*}$

As this system consists of three equations, has three unknowns and we know there is a solution, it can be solved. Of course the nodes have to exchange their measurements so that they can both compute the same value for $\delta$ (if necessary).

1] I think the assumptions are natural and necessary. They can be justified with the hope that the respective quantities do not change too much for the duration of our synchronisation attempt.

$(d_{1\to2}+\delta) + (d_{2\to1}-\delta) - (d_{1\to2}+d_{2\to1}) = 0$. There are three equations and three unknowns, but the family of solutions is 1-dimensional. It's the same problem we had with Ran's now-deleted answer and briefly discussed in chat: time is relative; we can't distinguish between delays in transmission and skewed clocks. — Gilles 'SO- stop being evil', Mar 12 '12 at 19:33
@Gilles Too bad. We should probably leave one instance of the fallacy for all to see? — Raphael, Mar 12 '12 at 19:41
I can restore the wrong answer I wrote. It might be useful due to the comments made by Gilles. — Ran G., Mar 14 '12 at 03:53

Clock synchronization in a network with asymmetric delays

6 Answers6

Linked