What is an "encoding" of a TM?

Question

I'm currently working on a reduction from $A_{TM}$ to another language, and have been reading through some example proofs. I've come across the situation where, for example, we have $L = \{ \langle M,w \rangle | \text{ ...etc} \}$, where obviously this would normally stand for $M$ being a TM and $w$ being a string. However, later in the proofs, we replace the $w$ (a string) with an "encoding of a turing machine". Sometimes it's even "an encoding of the TM, $M$".

I'm rather lost on this idea. How do we pass an "encoding of a TM" into a parameter for a string? How do we run that on a TM? Maybe I'm misunderstanding the definition of an "encoding of a TM", which I assume to be the TM itself somehow converted into a string format.

Would anyone mind explaining this to me? I'm sure truly understanding this concept would immensely help me in writing further reductions.

I feel your doubt is related to difference in syntax and semantics of a string. When we say encoding of a Turing machine , we mean we can write it form of the string, such that given the method of encoding and the string, you can understand the physical Turing machine it represents. — advocateofnone, Nov 18 '15 at 00:20

score 7 · Accepted Answer · answered Nov 18 '15 at 00:19

7

A Turing machine $M$ can be described as a 7-tuple $(Q,F,q_0,\Sigma,\Gamma,\delta, blank)$. This means that if someone gives you this 7-tuple, then the TM is well-defined, and you can precisely define how it behaves, etc.

The encoding of a TM, usually denoted as $\langle M \rangle$ is a string that encompasses all the information of the 7-tuple describing $M$. You can think of it as "writing the 7-tuple as a binary string" (but this is a simplification). So the encoding of M, is just a string that describes how the TM works.

The last observation is that if you know the encoding - you know everything about the TM; specifically, if $\langle M \rangle$ is given as an input (to a machine $M'$), the TM $M'$ can "run" or "simulate" what $M$ would have done on any given input -- the machine $M'$ knows the states $Q$ of $M$ and the transition $\delta$ of $M$, so it can imitate its actions, step by step.

answered Nov 18 '15 at 00:19

Ran G.

20,684
3
60
115

Your note that we can think of it as "writing the 7-tuple as a binary string" was very helpful. So In most cases where we have a language, say, 'P' and it is written as P{ <, M'> | ... } where M' is the encoding of M, and we run M on M', what would the result be? We are literally running a TM on another TM (itself). How would that even work? – user3472798 Nov 18 '15 at 00:32
1

Yes! when you run $M$ on a string $w$ you don't care what $w$ "means". It is just a string... only in our mind this string has a "meaning" of being the encoding of some machine... – Ran G. Nov 18 '15 at 00:42
Huh, gotcha. Thank you! Just one last question, how would it yield a useful result, running M on M'? If the encoding of a machine is just a binary number, how would running this on it's twin yield something of use? What is an accept/reject telling us in this case? – user3472798 Nov 18 '15 at 00:47
1

This depends on the specific $M$ and the context. See this answer for some examples of reductions that get other machine's encodings as input, and the "meaning" it makes. – Ran G. Nov 18 '15 at 01:00
1

You should probably add that in the context of computability, we need that $\langle \cdot \rangle$ and its (partial) inverse are effectively computable. This corresponds to the set of all TMs (resp. their encodings) being recursively enumerable, a non-trivial insight. The universal Turing machine is another affected concept. – Raphael Nov 18 '15 at 12:09

score 5 · Answer 2 · answered Nov 18 '15 at 01:00

Part of your confusion seems to be about the nature of $\langle\,M\,\rangle$. That's just a string like any other, except that it is a complete description of $M$. For example, a managable project for an introductory programming course might be to write a program which, when given an input describing a TM, shows the action of that TM on some input. What would such a program need as a description of a TM? We might decide to do something like this:

A list of the states, [q0, q1, ... , qn].
A description of which states are start, accept, and reject states, s, a, r, where each of these is one of q0 ... qn
A list of move rules, of the form (p, a, r, b, d) corresponding to the move $\delta(p, a) = (r, b, d)$, i.e., in state $p$ with input $a$, change to state $r$, write $b$ on the tape, and move in direction $d$ (left or right).

With that information, the program to simulate the TM wouldn't be too hard to write. The point here is that the string

[q1, q2], q1, q1, q2, [(q1, 0, q1, 1, R), (q1, 1, q2, 1, L), ... ]

would be a complete description of a TM, suitable for use as the basis of the simulation by our assigned program. This description is what we call $\langle\,M\,\rangle$. Now we might further encode this in a more limited alphabet, but that's immaterial. The point is that it can be done: any TM can be described by a suitable string.

The same comment I posted at Ran G.'s answer applies here: it's very important that this encoding is computable, as is it's inverse. — Raphael, Nov 18 '15 at 12:10

What is an "encoding" of a TM?

2 Answers2

Linked