2

I am proposing here a multi-d version of the question I proposed here and from which I have really learnt a lot.

Problem. Given three points "on the Earth", i.e. three couples of longitude/latitude $p=(\alpha_1, \beta_1), q=(\alpha_2, \beta_2)$ and $s=(\alpha_3, \beta_2)$, consider the regular tetrahedron $T$ "spanned" by these three points (I am normalizing the radius of the Earth to be equal to 1).

Consider now a probability measure $\mu$ and suppose the points $p,q,s$ (or, better to say: their coordinates $(\alpha_i, \beta_i)$) are i.i.d. random variables w.r.t. to this probability measure.

Q. Which is the distribution/probability measure $\mu$ on $[0,2\pi] \times [0,\pi]$ which maximizes the mean volume (without sign) of the tetrahedron, i.e. $$ \int_{\mathbb ([0,2\pi] \times [0,\pi])^3} \vert \text{Vol }(T) \vert d\mu(\alpha_1, \beta_1)d\mu(\alpha_2,\beta_2)d\mu(\alpha_3,\beta_3)? $$

Alternatively, we can directly consider probability distributions on the unit sphere $S^2 \subset \mathbb R^3$ and in this case the functional becomes $$ \int_{\mathbb (S^2)^3} \vert \text{Vol }(T) \vert d\mu(p)d\mu(q) d\mu(s) $$ and I honestly do not know which one is easier (if it is).

Romeo
  • 6,087
  • It actually could make quite a difference whether you have three iid random variables describing the locations of the points, or six iid random variables giving the locations in spherical coordinates. For example, if the coordinates are iid then it is impossible for the distribution of the point $p$ to be uniform over the surface area of the sphere. (And there is the fact that in conventional spherical coordinates, the difference of maximum and minimum values is $\pi$ for one coordinate and $2\pi$ for the other.) – David K Feb 19 '19 at 14:49
  • @DavidK Uh, interesting, I did not notice it. Well, I believe the interesting case is when the random variables are describing the locations of points on the sphere. How can I formulate this precisely? Should we consider three iid random variables on the unit sphere and maximize the expectation w.r.t. their common distribution? Is this the correct way of stating the problem? I apologize but I am not so confident with language of probability theory... Thanks. – Romeo Feb 19 '19 at 15:22
  • The question would be more natural with $\mu$ as a probability measure on $[0,2\pi]\times[0,\pi]$, so that the integrand is $|\text{Vol}(T)|, d\mu(\alpha_1,\beta_1), d\mu(\alpha_2,\beta_2), d\mu(\alpha_3,\beta_3)$. –  Feb 19 '19 at 15:37
  • @MattF. Nice to hear from you again :) Well, is a probability measure $\mu$ on $[0,2\pi] \times [0,\pi]$ the same as a probability measure on $S^2$? If that's true then probably it is the best way of asking the question. Do you agree? – Romeo Feb 19 '19 at 15:49
  • That’s good too –  Feb 19 '19 at 16:13
  • So you now have three iid distributions over a surface, which I think is a good idea. Like @MattF., my intuition says the best distribution is uniform, but I don't have a proof yet. – David K Feb 19 '19 at 16:51

1 Answers1

2

This is not an answer, but an amusing observation. If I didn't screw up my calculus, for the uniform distribution on the spherethe integral has a value of $\frac{\pi}{48} \approx 0.06545...$. On the other hand, since the volume is maximized when the points are orthogonal, let us try a measure concentrated at the vertices of an orthonormal basis, i.e. $\mu = \frac13 \big( \delta_{e_1}+\delta_{e_2}+ \delta_{e_3} \big)$. The expected value of the volume for this distribution will be $\frac1{27} =0.037...$, since $6$ out of $27$ choices yield a tetrahedron of volume $\frac16$, and the rest give volume $0$. We see that the uniform distribution wins.

Notice that in 2 dimensions (for the area of triangle Apex angle of a triangle as a random variable) this difference is: $\frac1{\pi} \approx 0.3183...$ vs. $\frac14= 0.25$.

I suspect that for the analogous problem in $d$ dimensions ($d$-dimensional volume of a simplex generated by random points on $S^{d-1}$) the difference would go to zero as $d\rightarrow \infty$ (due to the phenomenon of concentration of measure), but the uniform measure would be the maximizer.

1) Here is the computation of the integral. I'll work with the expected volume of parallelepiped vs tetrahedron, so that I don't have to carry $\frac16$ around. We need to compute \begin{equation} \int\limits_{S^{d-1}}\int\limits_{S^{d-1}}\int\limits_{S^{d-1}} | \det ( x\, y\, z) | d\sigma(x)\, d\sigma(y) \, d\sigma(z) = \int\limits_{S^{d-1}} \int\limits_{S^{d-1}} | \det ( x\, y\, z) | d\sigma(x)\, d\sigma(y), \end{equation} where by rotational invariance we can fix one of the variables (e.g. $z$), thus reducing to a double integral above. Since this integral is independent of the value of $z$, let us choose $z= e_3 = (0,0,1)$. Then $$| \det ( x\, y\, z) | = |x_1 y_2 - x_2 y_1 | = | \sin (\phi_x - \phi_y) | \cdot \sin \theta_y \cdot \sin \theta_y,$$ where we switched to polar coordinates. Recalling that $$ d\sigma = \frac1{4\pi} \sin \theta d\theta d\phi,$$ (we divide by $4\pi$ since $\sigma $ is a normalized measure) we reduce our integral to: \begin{align*} \int\limits_{S^{d-1}} \int\limits_{S^{d-1}} | \det ( x\, y\, e_3) | d\sigma(x)\, d\sigma(y)& = \frac1{16\pi^2} \bigg(\int_0^\pi \sin^2 \theta \, d\theta \bigg)^2 \cdot \bigg( \int_0^{2\pi}\int_0^{2\pi} | \sin (\phi_x - \phi_y ) | \, d\phi_x d\phi_y \bigg)\\ & = \frac1{16\pi^2} \cdot \bigg( \frac{\pi}2 \bigg)^2 \cdot 8\pi = \frac{\pi}{8}. \end{align*} (The integrals above are straightforward to compute.) Therefore, the expected area of the tetrahedron is $\frac{1}{6} \cdot \frac{\pi}{8} = \frac{\pi}{48} \approx 0.06545...$.

2) Here is the derivation of the recursive formula for the expected volume in $d$ dimensions. Let $E_d$ be the expected $d$-dimensional volume of the parallelepiped, spanned by $d$ independent uniformly distributed random points on the sphere $S^{d-1}$. Then the volume of the tetrahedron is simply $V_d = \frac{1}{d!} E_d$.

For any $x\in S^{d-1}$ let us write $x= (\sqrt{1-t^2} u, t)$, where $t\in [-1,1]$ is its $d^{th}$ coordinate, and $u\in \mathbb S^{d-2}$. It is a classical fact that under this change of variables $$ d\sigma (x) = \frac{\omega_{d-2}}{\omega_{d-1}} (1-t^2)^{\frac{d-3}2} dt \, d\sigma_{d-2} (u) , $$ where $\omega_{d-1} = \frac{2\pi^{d/2}}{\Gamma (d/2)}$ is the Lebesgue surface measure of the sphere $S^{d-1}$, and $\sigma_{d-2}$ is the normalized surface measure on $S^{d-2}$. Notice that this implies, in particular, that $$\int_{-1}^1 (1-t^2)^{\frac{d-3}2} dt = \frac{\omega_{d-1}}{\omega_{d-2}}. $$ We then use the same trick as above: fix $x_d = e_d = (0,0,\dots,0,1)$ to reduce the number of variables: \begin{align*} \mathbb E_{x_1,\dots, x_d \in S^{d-1}} | \det (x_1 \, \dots \, x_d ) | & = \mathbb E_{x_1,\dots, x_{d-1} \in S^{d-1}} | \det (x_1 \, \dots \, e_d ) |\\ & = E_{x_1,\dots, x_{d-1} \in S^{d-1}} \prod_{i=1}^{d-1} (1-t_i^2)^{1/2} \cdot \det (u_1 \, \dots \, u_{d-1} ) |\\ & = E_{t_1,\dots, t_{d-1} \in [-1,1] } \prod_{i=1}^{d-1} (1-t_i^2)^{1/2} \cdot E_{u_1,\dots, u_{d-1} \in S^{d-2}} | \det (u_1 \, \dots \, u_{d-1} ) |\\ & = \bigg( \frac{\omega_{d-2}}{\omega_{d-1}} \int_{-1}^1 (1-t^2)^{\frac{d-2}2} dt \bigg)^{d-1} \cdot E_{d-1}\\ & = \bigg( \frac{\omega_{d-2}\omega_{d}}{\omega_{d-1}^2} \bigg)^{d-1} E_{d-1} . \end{align*} Therefore $$ E_d = \bigg( \frac{\omega_{d-2}\omega_{d}}{\omega_{d-1}^2} \bigg)^{d-1} E_{d-1} = \left( \frac{\big( \Gamma \big(\frac{d}{2}\big)\big)^2 }{\Gamma \big(\frac{d-1}{2}\big) \Gamma \big(\frac{d+1}{2}\big)} \right)^{d-1} E_{d-1}.$$

Let us check the case $d=3$. We have $$ E_{d-1}= E_2= \frac{1}{4\pi^2} \int_0^{2\pi}\int_0^{2\pi} |sin (\phi_1 - \phi_2) | d\phi_1 d\phi_2 = \frac{2}{\pi} ,$$ also $\omega_{d-1} = \omega_2 = 4\pi$, $\omega_{d-2} = \omega_1 = 2\pi$, $\omega_d = \omega 3 = 2\pi^2$. Therefore, $$ E_3 = \bigg( \frac{\omega_{1}\omega_{3}}{\omega_{2}^2} \bigg)^2 E_{2} = \bigg( \frac{2\pi \cdot 2\pi^2}{(4\pi)^2} \bigg)^2 \cdot \frac{2}{\pi} = \bigg( \frac{\pi}{4} \bigg)^2 \cdot \frac{2}{\pi} = \frac{\pi}{8}, $$ as we have obtained above.

  • This is also interesting, and I want to ask for confirmation what happens in the $d$-dimensional case: it seems to me we have $d^d$ choices (i.e. the entrances of each vector of the canonical base) and of these $d(d-1)$ (?) will yield a volume of $1/(d+1)!$ and the others zero. So the expected volume for the distribution $1/d (\delta_1 + \ldots + \delta_d)$ is $1/d^d \cdot 1/((d+1)(d-2)!)$. Am I right? Thanks. – Romeo Feb 26 '19 at 17:51
  • I think you are slightly off here: we are indeed, choosing $d$ vectors out of the $d$ basis elements (with repetitions), so $d^d$ choices. We are interested in the choices where none of them repeat and all elements of the basis are taken (otherwise the volume is zero): there are $d!$ such choices, and in each case the volume of the tetrahedron spanned by the basis is $1/{d!}$, hence the expected value will just be $1/{d^d}$. – Dmitriy Bilyk Feb 27 '19 at 13:02
  • That was easy, you are right. Sorry for messing things up :-( and thanks for clarification. Do you know if there is a close formula for the integral wrt the uniform distribution in $\mathbb R^d$ (so to compare the asymptotic behavior with $d^{-d}$)? Thanks. – Romeo Feb 27 '19 at 18:28
  • I don't know one, but I'll try to compute it at some point. – Dmitriy Bilyk Feb 27 '19 at 18:45
  • How did you set up the calculus to get .0398 for the uniform distribution? I got .12 for that by the code in my answer. –  Mar 02 '19 at 03:25
  • @MattF. I was going to ask the same question too. Actually "my" code gives $\pi/8 = .39$ (for the parallelepiped: we need to divide by 6 to get the tetrahedron, hence that would be $\pi/48 = .65$). Gosh, which is the right answer for the uniform distribution? Thanks. – Romeo Mar 02 '19 at 09:06
  • @MattF.: you guys are right -- sorry, I did for get to divide by 6, and I lost a $\pi^2$ along the way. It is true: it is $\pi/8$ for the parallelepiped, and $\pi/48$ for the tetrahedron. I also found a recursive formula for the volume of the tetrahedron in dimension $d$. I'll update the original answer with corrections and computations. – Dmitriy Bilyk Mar 02 '19 at 18:39
  • @Romeo, your code looks wrong — in the uniform distribution on the sphere, it is less common to have $0<\theta<\pi/2$ than to have $\pi/2<\theta<\pi$. My code shows how to set up the uniform distribution on the sphere properly. –  Mar 02 '19 at 19:26
  • @Romeo: I have updated the computations above -- indeed, for the parallelepiped the expected volume seems to be $\pi/8$. I computed it two ways: integral in spherical coordinates and with the recursive formula, using the 2d answer (see above). Just for reference, in my spherical coordinates the latitude $\theta \in [0,\pi]$ (so that $\sin \theta \ge 0$), and the longitude $\phi \in [0,2\pi]$. – Dmitriy Bilyk Mar 02 '19 at 19:44
  • @MattF.: in your code you are generating the uniform distribution on the sphere absolutely correctly, but I do not see, why you are attaching a $4^{th}$ coordinate equal to $1$ to the vectors and then computing $4\times 4$ determinants. A parallelepiped (or tetrahedron) is spanned by $3$ vectors in 3D, and its volume is the absolute value of the determinant of the $3\times 3$ matrix whose rows are those 3D vectors (divided by 6 for tetrahedron). Am I missing something? – Dmitriy Bilyk Mar 03 '19 at 00:12
  • @MattF.: another issue is that your trials are not completely independent -- roughly speaking, you generate a 1000 random points on the sphere. Then in trial 1 you take points 1,2,3, and in trial 2 you take points 2,3,4. So, these two trials are not independent (e.g., if points 2 and 3 happened to be very close, then in both cases the volume will be small). – Dmitriy Bilyk Mar 03 '19 at 00:16
  • See MathWorld.wolfram.com/tetrahedron.html for the volume of a tetrahedron determined by four points. 2) Indeed my trials are not independent, but that is irrelevant for taking expectations.
  • –  Mar 03 '19 at 16:40
  • @MattF. We seem to have a different reading of the problem: in Romeo’s original post, one chooses 3 random points on the sphere, so my understanding is that one takes the tetrahedron spanned by the 3 vectors, i.e. with vertices at the origin and those 3 points. Your version (taking 4 random points on the sphere and looking at the tetrahedron with those vertices) is also a very interesting problem, but I don’t think that was the original question. Maybe Romeo can correct us – Dmitriy Bilyk Mar 03 '19 at 17:48
  • @MattF. I agree with Dmitriy Bilyk, my version of the problem is the one he states in his comment: I take three random unit vectors and I consider the tetrahedron "spanned" by them. – Romeo Mar 03 '19 at 18:45
  • @Romeo, if that's the case then I wasn't reading your question carefully enough and you weren't reading my answer carefully enough. So I've deleted my answer. –  Mar 04 '19 at 16:00