How do you compute the tangent space vectors with normals given in the mesh?

Question

Suppose you are given a mesh with normals for each vertex. These normals might not correspond to the face normals, as the artist could have tweaked them. In this case, how would you go about computing the tangent and binormal vectors for the vertex?

The best algorithm I can think of, and the one that looks to be used by assimp, is to take all the faces in a given smoothing group and compute the tangent and binormal per face. Once that is done, project them into the plane determined by the normal and average over all the faces in the smoothing group.

I am uncertain as to whether this algorithm works exactly as needed. For example, does the projection operator preserve the angle between the tangent and binormal?

Another issue I have is that the assimp implementation of the algorithm does not seem to use the smoothing groups specified in the mesh. They instead seem to calculate the smoothing groups manually using nearby vertices. What is the advantage of doing this?

Finally, is this the ideal algorithm? Some other things I have found (such as this) seem to suggest that it is not, but they do not go into any more details.

EDIT:

So I checked out the MikkTSpace algorithm that was suggested in the comments, and while it answers my question of what the industry standard method for generating tangent spaces is, I do not fully understand why the algorithm works.

As described in the paper, the algorithm works by collecting triangles into groups and then averaging the tangent spaces for each individual triangle in the group. I ran into two problems understanding this: first and once again, why discard the smoothing groups specified in the mesh? It seems wasteful to discard information that is already given to you about how the mesh is supposed to appear.

Secondly, how do the groups even work? The paper specifies that two triangles are considered to be in a group if the share two vertices, two texture coordinates, and two normals. The first two requirements make sense to me, but the third does not. Suppose you are approximating a sphere by something like an icosahedron. In this case there will be no vertex normals shared by any two vertices, so each individual triangle is going to have its own tangent space. This seems counterintuitive.

There are many methods to compute tangent spaces, which can add to frustration when trying to get two parts of your pipeline to agree. ;) One popular one though is called MikkTSpace, after Morten S. Mikkelsen. Search that name and you'll find publicly released source code for working with this tangent convention. — DMGregory, Aug 07 '17 at 17:27
The problem is that the author says they must share two normals, ie when two triangles meet in two vertices, both vertices must have the same normal. This means that since no two normals of a sphere are the same, no two triangles will share a tangent space. — Josiah, Aug 09 '17 at 18:06
I don't think that's what they're saying. I think they mean if triangle A B C and triangle a b c share an edge between vertices A/a and B/b, that A & a should have the same position, texture coordinate, and normal (our first shared normal), AND B & b should have the same position, texture coordinate, and normal (our second shared normal), giving us a total of two shared normals on this shared edge. Contrast this to the case where b has a different normal than B, then there's a sharp crease along the edge A/a-B/b, and we cannot guarantee a continuous tangent space across the crease. — DMGregory, Aug 09 '17 at 18:11
Ok, I guess I was misunderstanding what I read, that makes a lot more sense. That answers my main problem with the algorithm, but I would still like to know why the smoothing group information is ignored. If this information is artist generated, then ignoring it seems like it would lead to results that are not quite what the artist intended. — Josiah, Aug 09 '17 at 18:15
Where do you think shared and unshared normals come from? ;) — DMGregory, Aug 09 '17 at 18:16
Ok, I think that makes sense. Thanks for your response. One more thing I did not understand from the paper was his mention of accumulating the magnitudes. What is the purpose of this? — Josiah, Aug 09 '17 at 18:20

score 5 · Accepted Answer · answered Aug 10 '17 at 23:55

Summarizing the clarifications above as an answer:

There are multiple ways to generate tangent spaces for a mesh, and not all of them agree on the result.

This is a common source of rendering errors in games, where the normal map baking tool generated the texture with respect to one tangent space, and the 3D modelling software or game's mesh importer decided on a different tangent space, leading to mismatches and artifacts, as shown in these examples from the Handplane documentation:

So, we need to pick a standard tangent space to use.

A popular choice is "MikkTSpace" a method for generating tangent spaces that Morten S. Mikkelsen developed as part of his master's thesis. He specifically designed the algorithm to be robust for use by multiple tools in an asset pipeline, so they can independently generate the same tangent basis regardless of quirks like choices of vertex welding or the order of the vertices & faces.

Code for the MikkTSpace algorithm is freely available online, and I'm not an expert in all of its workings, so I won't describe a complete implementation here. Instead I'll address the specific questions about it raised above.

Why do these algorithms disregard smoothing groups?

"Smoothing groups" don't really exist on GPUs or in most game engines - they're a concept used in 3D modelling tools to make manipulating normals more intuitive.

By the time a mesh is pushed down the graphics pipeline, it's just a raw stream of vertices combining position, texture coordinates, normals, etc.

Wherever two entries in this stream coincide at the same position, but have different normals, you'll get a hard crease or lighting seam along any edges they share (or a point discontinuity like the tip of a cone if they don't share any edges)

Wherever you have an edge where all vertices at the start of the edge agree on their normal, and all vertices at the end of the edge agree on their (possibly different) normal, you'll get a smooth join with no discontinuity.

Smoothing groups exist to tell the 3D package where it should force vertices along a shared edge to share a normal, versus where the normal can be independently chosen for each vertex.

By the time the mesh gets to the tangent space baking step, these smoothing groups have typically already been converted to vertex splits, so the tangent space algorithm doesn't need to be aware of a particular tool's smoothing conventions - it can just work with the literal vertex data.

No artist-authored smoothing information is lost this way, it's just already been translated into the lower level form these algorithms understand.

Why does the MikkTSpace algorithm group only faces that share two vertices, two texture coordinates, AND two normals?

"Suppose you are approximating a sphere by something like an icosahedron. In this case there will be no vertex normals shared by any two vertices, so each individual triangle is going to have its own tangent space. This seems counterintuitive."

This looks like a misunderstanding of the algorithm, as though it said:

"Two triangles can be grouped only if the vertices at both ends of the shared edge agree on a single shared vertex normal."

But it actually says:

two vertices [(vertex positions)] are shared.
vertex normals at the two vertices are shared.
texture coordinates at the two vertices are shared.
the triangles must have the same sign of det(T). [(ie. Either neither, or both, are mirrored in texture space - not one of each)]

Point 2 means that:

triangle A and triangle B's vertices at the START of the shared edge must share one vertex normal
triangle A and triangle B's vertices at the END of the shared edge must share a (possibly distinct) second vertex normal

...for a total of two shared normals along the common edge.

So you can see this is the same condition we described above with regard to smoothing groups. If this condition is not met - the vertices at at least one end of the edge disagree about their normal direction - then we'll have a sharp crease along this edge. Since the normal experiences a sharp discontinuity along this edge, the tangent basis (which includes the normal) will also be discontinuous along this edge.

Why does the algorithm accumulate magnitudes of vectors?

Checking through the code, you'll find the magnitudes of the partial derivatives are not used for the "basic" version of the tangent space.

For the more advanced version, here is Mikkelsen's original comment:

// This function is used to return tangent space results to the application.
// fvTangent and fvBiTangent are unit length vectors and fMagS and fMagT are their
// true magnitudes which can be used for relief mapping effects.

Relief Mapping is an effect where we approximate parallax and self-occlusion of surface structure by ray-marching through a height field texture from our initial surface sample point. We imagine the surface has some depth to it, and that our view ray can continue some distance from where it hit the bounding volume of the polygon geometry, before it actually hits the displaced surface underneath.

To make it work, we need to transform our view vector from eye/world/object space into texture space:

(Diagram of Relief Mapping from GPU Gems)

To do that precisely, we need to know more than just how the texture is oriented with regard to the 3D geometry (what we get from tangent & normal directions), we also need to know how it's scaled. If we ignore this, then a ray entering a compressed part of the texture will behave like it's refracted in water, covering less worldspace distance parallel to the surface for each unit of travel perpendicular, distorting the effect. We can use the magnitude information provided by the algorithm to compensate and ensure our texture space ray matches the direction of our view ray in the world.

There are other effects that might benefit from having this type of scale information about the texture mapping (maybe some forms of tesselation using a control texture?) but if you're just using the tangent space for standard normal mapping and only care about directions, then you can safely ignore the magnitude tracking the MikkTSpace algorithm does and use the basic version instead.

Wow, this answer is great, I had found the comment in the code but was not sure if it was the only reason it stored the magnitudes. Thank you very much. — Josiah, Aug 12 '17 at 01:02
I'll confess I don't know for sure if there are other reasons to use these magnitudes, but from examining the code they didn't seem to be used to determine the tangent vectors themselves... Hopefully if someone finds other important cases for these they'll post another answer. :) — DMGregory, Aug 12 '17 at 01:03
Also, just for anyone coming by and wanting to know where in the paper the algorithm is outlined, the specific steps are given on pages 47-48. The tangent and bitangent/binormal are described in the paper as the first order derivatives. — Josiah, Aug 12 '17 at 01:17

How do you compute the tangent space vectors with normals given in the mesh?

1 Answers1

Linked