Strong duality result for non-convex problem with two quadratic functions is a related question. However, I am trying to understand how the dual form problem comes about.
This dual form representation is presented in Appending B, Sec B.1. of Boyd & Vandenberghe's Convex Optimization, but the authors don't explain how it comes about. This appears to be an important result in optimization, so I would really like to understand it.
The primal problem is: \begin{equation}\label{primal} \begin{aligned} & \underset{x}{\text{minimize}} & & x^TA_0x+2b_0^Tx+c_0 \\ & \text{subject to} & & x^TA_1x+2b_1^Tx+c_1 \leq 0 \\ \end{aligned} \end{equation}
where $A_i \in S^n$, the set of symmetric $n \times n$ matrices, $b_i \in \mathbb{R}^n$, $c_i \in \mathbb{R}^n$. No positive semidefinite assumptions on $A_i$ exist. (Note that this is not a QCQP, because $A_i$ is not assumed to be positive semi-definite).
The dual function is clearly
\begin{equation} g(\lambda)=c_0 + \lambda c_1-(b_0 + \lambda b_1)^T(A_0+\lambda A_1)^{\dagger}(b_0 + \lambda b_1) - (1) \end{equation} where $A_0+\lambda A_1\succeq 0$, $b_0 + \lambda b_1 \in C(A_0+\lambda A_1)$ ($C(.)$ denotes the column space), and $\dagger$ denotes the psuedo inverse. Else $g(\lambda)=-\infty$.
The text says that using a Schur complement, a dual problem can be expressed as: \begin{equation}\label{dual} \begin{aligned} & \underset{\gamma, \lambda}{\text{maximize}} & & \gamma \\ & \text{subject to} & & \lambda \geq 0 \\ & & & \begin{bmatrix}A_0+\lambda A_1 & b_0 + \lambda b_1 \\ (b_0 + \lambda b_1)^T & c_0 + \lambda c_1 - \gamma\end{bmatrix} \succeq 0 \end{aligned} \end{equation} which is an SDP (semidefinite program) with variables $\gamma, \lambda \in \mathbb{R}$.
I understand that \begin{equation} \begin{aligned} \\ & & & \begin{bmatrix}A_0+\lambda A_1 & b_0 + \lambda b_1 \\ (b_0 + \lambda b_1)^T & c_0 + \lambda c_1\end{bmatrix} \succeq 0 \end{aligned} \end{equation} is the matrix associated with the Schur complement of (1) above, but I don't see how to go any further.
How does this dual problem come about, and why does it make sense? Where did $\gamma$ come from, and why are we maximising simply one variable here? And then $\gamma$ appears in the matrix in the (2, 2) position?