Suppose that we have a probability space $(\Omega, \mathcal{F}, P)$. Let $X,Y$ be real-valued random variables defined on this space, and let $\mathcal{H} \subset \mathcal{F}$ be a sub-sigma-algebra.
Suppose $X$ is $\mathcal{H}$-measurable (i.e, $X^{-1}(B) \in \mathcal{H}$ for all Borel $B \subset \mathbb{R}$).
Also suppose that $Y$ is independent of $\mathcal{H}$ (which implies that $X$ and $Y$ are independent).
Then is it true that for any Borel-measurable function $g: \mathbb{R}^2 \to \mathbb{R}$, we have that $\mathbb{E}(g(X,Y)|\mathcal{H})=\mathbb{E}(g(X,Y)|X)$?
Observations: It seems to be true for functions of the form $g(x,y)=f(x)h(y)$, because $\mathbb{E}(f(X)h(Y)|\mathcal{H}) = \mathbb{E}(h(Y)) \cdot f(X)=\mathbb{E}(f(X)h(Y)|X)$ by independence of $Y$ to $X$ and $\mathcal{H}$. But I can't seem to prove it for the general case. Maybe we can approximate arbitrary $g$ by functions of this form from below, and then use MCT?
Is it possible to show that any measurable $g: \mathbb{R}^2 \to \mathbb{R}$ can be written as the (upward) limit of linear combinations of functions of the form $\chi_{A \times B}$, for Borel $A,B \subset \mathbb{R}$? Because then we can just apply MCT and use the preceding comment, and we're done. (The $\chi_E$ denotes characteristic function of $E$.)