In solving a first order linear differential equation $(1-D)y=x^2$ where $D\equiv \frac{d}{dx}$ the way I learnt was that we proceed as $y=\frac{1}{1-D}x^2=(1-D)^{-1}x^2=(1+D+D^2+D^3+\cdots)x^2=x^2+2x+2+0+0+\cdots=x^2+2x+2.$
Now the question that comes to mind is that what justifies our saying that $$(1-D)^{-1}=1+D+D^2+D^3+\cdots$$
When I asked my teacher about this he said that it is not the case that the inverse operator of $1-D$ is $1+D+D^2+D^3+\cdots$ (whose meaning is unclear anyway). What is true instead is that $$(1-D)(1+D+D^2+\cdots+D^m)x^m=x^m$$ and that we are actually using this fact.
Although I understand this, but I am not entirely satisfied for two reasons. Firstly the resemblance with the expression for the geometric series must be there for a reason which I want to know. Secondly can an appropriate norm be given to an appropriate function space in which we can actually prove this geometric series to be true? (The answer to the second question also covers the first).
Thanks