Edit: I have changed this answer significantly because my original answer which was accepted was for a slightly different question and, not to mention, incorrect. I have (hopefully) fixed all errors.
Unfortunately, a negative binomial distribution is unlikely to be helpful here because it doesn’t account for the number of times “in a row” that an event occurs, only the total amount of times that it does. To take that into account, we treat this experiment as an absorbing Markov chain with transition matrix,
$$\begin{pmatrix}
5/6 & 1/6 & 0 & 0 & 0 \\
5/6 & 0 & 1/6 & 0 & 0 \\
5/6 & 0 & 0 & 1/6 & 0 \\
5/6 & 0 & 0 & 0 & 1/6 \\
0 & 0 & 0 & 0 & 1
\end{pmatrix}.$$
The $(i,j)$-entry of this matrix tells us the probability that the next trial will put us in state $q_{j - 1}$ given that the current trial is in state $q_{i - 1}$ where
$$q_k = \begin{cases}
\text{“the current streak of $1$’s is $k$”}, & 0 \leq k \leq 3 \\
\text{“the current streak of $1$’s is $4$, or the experiment has ended”}, & k = 4
\end{cases}.$$
Now, the thing about state $q_4$ is that the experiment stops when we hit a streak of four $1$’s in a row; it’s a way of keeping track that we’ve hit the end of the experiment, i.e., $q_4$ is our absorbing state. The other entry probabilities are intuitive: at any step we could roll a $1$ with probability $1/6$, and that increases our streak by $1$, bringing us to next state; or we could not roll a $1$ with probability $5/6$, and that takes our streak back down to $0$. However, once we reach $q_4$ for the first time, the experiment is over and will remain in state $q_4$ with probability $1$.
It’s a standard result in probability theory that $\operatorname{E}(X) = \sum_{n\geq 0} P(X > n)$, and $\operatorname{E}(X^2) = \sum_{n\geq 0}(2n + 1) P(X > n)$ for any random variable, $X$, whose range is nonnegative integers.
One fact about absorbing Markov chains is that
$$
P(X > n) = \mathbf{p} A^n \mathbf{1}
$$
where $X$ is the number of steps until an absorbing state is reached, $\mathbf{p}$ is the probability row vector of the initial state with absorbing states removed, $A$ is the transition matrix with the rows and columns of any absorbing state removed, and $\mathbf{1}$ is the vector of appropriate length whose entries are all $1$. In our case, $X$ is the number of trials until we hit four consecutive $1$’s (which is $q_4$), $\mathbf{p} = (1, 0, 0, 0)$ because our intitial state is $q_0$ (there is a streak of $0$ if the first trial has yet to happen), and $A$ is our transition matrix without the last row and column.
Hence,
$$\begin{align*}
\operatorname{E}(X) &= \mathbf{p} \sum_{n\geq 0} A^n \mathbf{1} \\
&= \mathbf{p} (I - A)^{-1} \mathbf{1} \\
&= \mathbf{p} \begin{pmatrix}6^4 & 6^3 & 6^2 & 6\\
6^4 - 6 & 6^3 & 6^2 & 6\\
6^4 - 6^2 & 6^3 - 6 & 6^2 & 6\\
6^4 - 6^3 & 6^3 - 6^2 & 6^2 - 6 & 6
\end{pmatrix} \mathbf{1}\\
&= \begin{pmatrix} 6^4 & 6^3 & 6^2 & 6\end{pmatrix}\mathbf{1}\\
&= 6^4 + 6^3 + 6^2 + 6 \\
&= 1554,
\end{align*}$$
and
$$\begin{align*}
\operatorname{Var} X &= \operatorname{E}(X^2) - \operatorname{E}(X)^2\\
&= \mathbf{p} \sum_{n\geq 0} (2n + 1) A^n \mathbf{1} - 1554^2 \\
&= \mathbf{p} \sum_{n\geq 0} ((2n + 2)A^n - A^n)\mathbf{1} - 1554^2 \\
&= 2\mathbf{p} \sum_{n\geq 0} (n + 1)A^n \mathbf{1} - \underbrace{\mathbf{p} \sum_{n\geq 0} A^n \mathbf{1}}_{=\operatorname{E}(X)} - 1554^2\\
&= 2\mathbf{p}(I - A)^{-2}\mathbf{1} - 1554 - 1554^2\\
&= (2 \cdot 6^8 + 4 \cdot 6^7 + 6 \cdot 6^6 + 8 \cdot 6^5) - 1554 - 1554^2\\
&= 2404650.
\end{align*}
$$
I have left out the calculation for the entries of $(I - A)^{-2}$ because it is quite tedious and doesn’t look as nice as $(I - A)^{-1}$ does, but any program (or human) that can do matrix operations will confirm the result. Note that I’ve used the fact that $\sum_{n\geq 0}A^n = (I - A)^{-1}$ and $\sum_{n\geq 0} (n + 1)A^n = (I - A)^{-2}$ for any matrix, $A$, whose spectral radius is less than $1$.