tl;dr There are two issues with your geometric reasoning. First, drawing $D(2)$ as a tiny circle around the origin is not really accurate. Second, your argument about squares and circles is not the sort that works in synthetic differential geometry, because it cannot handle degenerate cases. There are other ways of picturing infinitesimals, such as the one below.
I. Not visualizing the smooth real line
Many things go wrong when one tries to visualize the infinitesimal neighborhood $D=\{d \in \mathcal{R}\mid d^2=0 \}$ as a scaled-down copy of the smooth real line $\mathcal{R}$. The picture is not just intuitively misleading: its formalized versions are false. For example, a scaled-down copy $\varepsilon \mathcal{R}$ remains closed under addition, while $D$ being closed under addition would contradict the Kock-Lawvere axiom.
Things get worse once we add order/integration axioms into the mix! For arbitrary $d_1, d_2 \in D$, both $d_1 \leq d_2$ and $d_2 \leq d_1$ necessarily hold: we can't really speak of an infinitesimal lying to the right/left of another.
It follows that all geometric pictures of infinitesimal geometry will be quite misleading. E.g. when you draw an infinitesimal rectangle around the origin, you don't really know its orientation, and you can't even rule out its degeneracy (i.e. that one or more vertices coincide) in any way.
We don't even need to invoke 2-dimensional geometry to see this. When we write "consider an infinitesimal closed interval of length $d \in D$", for some arguments we might want to rule out the empty interval $\{0\}$. But we can't just assume $d \neq 0$, as there is (provably) no element $d \in D$ that satisfies $d \neq 0$.
Things might look extraordinarily bad at this point: one of the purported benefits of SDG is that it's supposed to make certain geometric arguments in Lie theory perfectly rigorous. But now it seems that even ordinary geometric reasoning breaks down!
You can read more about this phenomenon (and why it's not as bad as it looks) in my answer to another question. While you can prove that the infinitesimal neighborhood $D$ of $0$ does not coincide with $\{0\}$, you can also prove that no point $d \in D$ is really distinct from $0$ in the sense that $\neg d \neq 0$ holds for all $d \in D$. This all works out thanks to intuitionistic logic. It allows $D$ to behave like an interval in certain aspects (thus the Kock-Lawvere axiom can be consistently adopted). But $D$ will still behave very much like a single point in other aspects. The same goes for the geometric objects built using $D$: in some of their aspects they'll behave like ordinary rectangles or circles, but in many other aspects they'll act as if they were single points.
Thus, most geometric arguments about such shapes break down in the infinitesimal world. Only certain arguments keep working, ones that can handle potentially degenerate objects gracefully.
II. What is wrong with your geometric picture above?
There are two issues. The first issue is that drawing $D(2)$ as a disk (and to a lesser extend $D^2$ as a filled square) is at best a visual aid to remember the inclusion $D(2) \subseteq D^2$, and not really an accurate picture. As discussed above, these sets are not infinitesimally scaled versions of disks and rectangles at all.
The second issue is that not all geometric arguments remain valid in the infinitesimal world. Only those arguments keep working that can handle degenerate objects gracefully. But your argument ("from the geometric picture the diagonal of the square should not be contained in the circle") does not handle degenerate objects gracefully! In particular, the degenerate diagonal of a degenerate square (of side length $0$, the point $\{0\}^2 \subseteq \mathcal{R}^2$) is contained inside the degenerate circle (of radius $0$) around the origin.
III. Visualizing the smooth real line
Long story short, I recommend not conceptualizing the object $D$ as a tiny version of the line $\mathcal{R}$. This leads us to your second question: is there a better way to picture infinitesimals?
As I explained in an answer to a similar question I usually think of an infinitely long chain built from tiny metallic links. From afar, you see a line. From up close, you see chain links labeled by ordinary numbers: but the links themselves have richer, more exotic geometric structure. The first-order infinitesimal neighborhoods of numbers lie on these links.
When we apply a function to the real line, we twist and turn this chain into a seemingly arbitrarily complicated curve. However, when we focus on a single link, we see that the transformation behaves linearly. The transformation preserves the integrity of the links and the overall shape of each link remains the same, even though its orientation may have changed. This is the Kock-Lawvere axiom.

The picture somewhat extends to higher dimensions: you can visualize the plane $\mathcal{R}^2$ as a sort of chain mail, where neighborhoods of the origin such as $D(2)$ and $D^2$ live inside a single ring piece, which has more complicated structure than the large-scale structure of the plane. But in the end, this visualization is still more of a "useful reminder" than something deep that could be made rigorous or formalized.