Several months ago, I was writing for class. I claimed an environment was non-Markovian because it would take several states to de-alias some positions in the grid world. I was corrected that it was n-step Markovian. Something that didn't occur to me until the class had ended was:
If there's an n, then n could be infinity. This would imply an MDP where the next state is decidable with infinite history. This seems like a reasonable thing to call non-Markovian as it's not decidable within a finite time but is it? Where does non-Markovian start?