Show that the class of regular languages is closed under gapping

Question

"Let $\Sigma=\{a, b\} . $ For every word $ w=a_{1} \ldots a_{n} \in \Sigma^{*} $ with $ a_{i} \in \Sigma $ and $ 1 \leq k \leq n $ let $ w_{k}^{-}:=a_{1} \ldots a_{k-1} \overline{a_{k}} a_{k+1} \ldots a_{n} $, while $ \overline{a_{k}} $ is the unique character out of $ \Sigma \backslash\left\{a_{k}\right\} $. For every language $ L $ over $ \Sigma $ let gapping $ L^{-} $ be defined by $$L^{-}:= \left \{u \in \Sigma^{*} \mid u=w_{k}^{-}, k \in \mathbb{N} \text { und } w \in L \backslash\{\varepsilon\}\right\}$$ Show that the class of regular languages is closed under gapping. Meaning for a given finite automaton $ \mathcal{A} $ construct a new automaton that accepts $ (L(\mathcal{A}))^{-} $."

I don't understand how I could solve this problem without knowing a specific place in a word where one character is supposed to be left out and which character that is. I have thought about using $ε$- transitions, but that doesn't guarantee that a specific character has been left out. Also this might be a dumb question but in a word like $ w_{k}^{-}:=a_{1} \ldots a_{k-1} \overline{a_{k}} a_{k+1} \ldots a_{n} $ doesn't $a_{k+1}$ just become the new $a_{k}$ when that is left out?

Any help would be greatly appreciated! (Sorry if the wording is weird, the original language of the problem wasn't English.)

Problem in original language:

"Sei $ \Sigma=\{a, b\} . $ Für jedes Wort $ w=a_{1} \ldots a_{n} \in \Sigma^{*} $ mit $ a_{i} \in \Sigma $ und $ 1 \leq k \leq n $ sei $ w_{k}^{-}:=a_{1} \ldots a_{k-1} \overline{a_{k}} a_{k+1} \ldots a_{n} $, wobei $ \overline{a_{k}} $ das eindeutige Zeichen aus $ \Sigma \backslash\left\{a_{k}\right\} $ ist. Für jede Sprache $ L $ über $ \Sigma $ sei die Lückenbildung $ L^{-} $ definiert durch $$L^{-}:=\left\{u \in \Sigma^{*} \mid u=w_{k}^{-}, k \in \mathbb{N} \text { und } w \in L \backslash\{\varepsilon\}\right\}$$ Zeigen Sie, dass die Klasse der FA-erkennbaren Sprachen unter Lückenbildung abgeschlossen ist. D.h. konstruieren Sie für einen gegebenen endlichen Automaten $ \mathcal{A} $ einen neuen Automaten, der $ (L(\mathcal{A}))^{-} $ erkennt."

ansatz:

Let $ L $ be accepted by the $ N F A \quad A=\left(Q, \Sigma, \Delta, q_{0}, F\right) $ , $ L(A)=L . $

We define $ \varepsilon-N F A \quad A^{\prime}=\left(Q^{\prime}, \Sigma^{\prime}, \Delta^{\prime}, q_{0}^{\prime}, F^{\prime}\right) $ $$ \begin{aligned} Q^{\prime} &=Q \\ \Sigma^{\prime} &=\Sigma \backslash\left\{a_{k}\right\} \\ \Delta^{\prime} &=\left\{(q, \varepsilon, p) \mid\left(q, a_{k}, p\right) \in \Delta\right\} \end{aligned} $$ $$ \begin{aligned} F^{\prime}=F \end{aligned} $$ $$ \begin{aligned} L\left(A^{\prime}\right)=L^{-} \end{aligned} $$ Proof: $$ "\subseteq " $$ Let $ w=a_{1} \ldots a_{n} $ be a word over $L\left(A^{\prime}\right) $. Let $ \left(q_{0}, a_{1}, \ldots a_{k-1}, r_{k-1}, \varepsilon, r_{k}, a_{k+1}, \ldots, a_{n}, q_{n}\right) $ be an accepting run of $ w $. Then $ q_{n} \in F^{\prime} $ and $ \left(r_{i-1}, a_{i}, r_{i}\right) \in \Delta^{\prime} $ for every $ i \in[1, n] \backslash k \mid 1 \leqslant k \leqslant n $

Can you clarify what you mean by $\overline{a_k}$ is the unique character out of $\Sigma \setminus {a_k}$? In the meantime, let me suggest using the Myhill-Nerode Theorem. — TomKern, May 16 '21 at 21:08
It's just a complicated way of saying a_k isn't in the word. so ak with the line above is element of sigma without a_k. we haven't even been taught the Myhill-Nerode Theorem yet, but I will check it out. — iina, May 16 '21 at 21:29
@iina: What is the original language? The statement really isn’t clear. Gapping makes it sound like $a_k$ is simply being omitted, so that $\bar L$ is simply the language of all words over $\Sigma$ that can be obtained by omitting one symbol from a word of $L$, but the description of $\overline{a_k}$ makes it sound like we replace $a_k$ by $b$ if $a_k=a$, and by $a$ if $a_k=b$. — Brian M. Scott, May 16 '21 at 22:27
@BrianM.Scott the original language is German and I think what you said first is what they mean. $L^{-}$ is the language of all words over $\Sigma$ that are words of $L$ but one symbol is being skipped/cut out. Unfortunately the wording is weird even in German, so I don't know how to word it clearer than they did in the original task. I think that sentence is just supposed to be a description of what $\overline{a_{k}}$ means. So that the overline means that symbol isn't in the new word. — iina, May 16 '21 at 22:51
@iina: I strongly suspect that you’re right about what’s intended, but I do read German, so just to play safe, could you quote the original problem, if only to satisfy my curiosity? — Brian M. Scott, May 16 '21 at 22:53
@iina: Thank you. I’m going to revise my guess: I now think that despite the use of the term Lückenbildung, we really are looking at the words that can be obtained by switching one letter of a word in $L$ from $a$ to $b$ or vice versa. That description of $\overline{a_k}$ is pretty unambiguous. — Brian M. Scott, May 16 '21 at 23:03
@iina: It appears that the class of regular languages is closed under both operations. Start with two copies of a DFA for $L$, say $M_1$ and $M_2$; we’ll combine them to get an NFA $M$ for $\bar L$. The initial state of $M_1$ is the initial state of $M$, and the final states of $M_2$ are the final states of $M$. At each state of $M_1$ we’ll add transitions to $M_2$. If we’re really omitting a letter, these are $\epsilon$-transitions simulating reading an $a$ or a $b$; if we’re switching one letter to the other one, they simulate reading the letter that we didn’t read. — Brian M. Scott, May 16 '21 at 23:13
@BrianM.Scott we have to construct an NFA formally, I don't really know how to translate what you said into formal language. I added an ansatz of me trying to follow the solution of a similar exemplary problem I was provided with, but it doesn't look correct to me. And I'm also stuck, because I don't know how to use the definition now to proof my claim. — iina, May 17 '21 at 10:20
@iina: I’ve done the construction for one interpretation of the question. — Brian M. Scott, May 17 '21 at 18:39

score 1 · Answer 1 · answered May 17 '21 at 18:38

Let $M=\langle Q,\Sigma,\delta,q_0,F\rangle$ be a DFA that recognizes $L$. We will construct an NFA $M'=\langle Q\times\{0,1\},\Sigma,\Delta,\langle q_0,0\rangle,F\times\{1\}\rangle$ that recognizes $\overline L$ for the interpretation in which $\overline{a_k}$ is the unique element of $\Sigma\setminus\{a_k\}$. For convenience let $\bar a=b$ and $\bar b=a$.

We can think of $M'$ as consisting of two copies of $M$, one with state set $Q\times\{0\}$, the other with state set $Q\times\{1\}$. $M'$ will start at $\langle q_0,0\rangle$ in the first copy, and at some point it will move to the second copy, which will behave exactly like $M$. Thus, we want

$$\Delta(\langle q,1\rangle,x)=\{\langle\delta(q,x),1\rangle\}$$

for each $q\in Q$ and $x\in\Sigma$.

The first copy will also behave like $M$, except that when it reads a symbol $x$, it has the option treating it as if it were $\bar x$ and moving to the appropriate state in the second copy of $M$. Thus, we want

$$\Delta(\langle q,0\rangle,x)=\{\langle\delta(q,x),0\rangle,\langle\delta(q,\bar x),1\rangle\}\,.$$

I’ll leave it to you to check that $M'$ really does recognize $\overline L$.

Show that the class of regular languages is closed under gapping

1 Answers1