The theorem to be proven is: $\lim_{x\to \infty} (f(x)+f'(x))=0\implies \lim_{x\to \infty} f(x)=0 \land \lim_{x\to \infty} f'(x)=0$.
Here's one Lemma that'll be used in the proof of the above theorem: If $\lim f(x)$ and $\lim f'(x)$ both exist then $\lim f'(x)=0\tag 1$ (here and henceforth in the post, $\lim$ would mean $\lim_{x\to \infty}$)
The outline of GH Hardy's proof is as follows:
Case 1: $f'$ takes positive values for all large $x$.
It means that there is some $M$ post which if $x$ lies then $f'(x)>0$. It follows that $f$ is strictly increasing on $(M,\infty)$. So there are only two possibilities for $\lim f(x)$ -either $\lim f(x)$ exists finitely or $\lim f(x)=+\infty$. Assuming the latter, it must be the case that $\lim f'(x)=-\infty$ (else the given hypothesis won't hold). But this is impossible in this case.
So $\lim f(x)$ must exist finitely, say the limit is $L$. Then to satisfy the given hypothesis $\lim f'(x)=-L$. By Lemma stated in $(1)$, this is possible only if $L=0$. So the theorem holds in this case.
Case 2: $f'$ takes negative values for all large $x$.
This case can be handled similar to the way Case 1 was handled above. The final conclusion from this case is "the theorem holds in this case too."
Now comes the last case that confuses me: The exact wording of this case can be found here in the accepted answer. But I believe that the case is equivalent to the following case:
Case 3: $f'$ oscillates between positive and negative values when $x$ is large.
Now, here's what GH Hardy says (please treat $\phi$ as $f$ here):
"If $x$ has a large value corresponding to a maximum or a minimum of $\phi(x)$, then $\phi(x)+\phi'(x)$ is small and $\phi'(x)=0$, so that $\phi(x)$ is small. A fortiori the other values of $\phi(x)$ are small when $x$ is large." $\tag 2$
I understand the spirit of what's being said (minimum and maximum values of $f(x)$ tend to $0$ as $x\to \infty$ so $f(x)$ should also tend to $0$ as $x\to \infty$) but I don't understand why that's true. To be more specific, I have the following confusion(s):
In the event that some large $x$ corresponds to a maximum or minimum, then it is true that $f'(x)=0$ there but if this maximum/minimum is local then I don't see why "A fortiori" in $(2)$ is valid. The case when $(2)$ indeed works would be a very optimistic one -there are infinitely many large $x$ which correspond to global maximum/minimum so that "a fortiori all other values are small" works.
What happens if f doesn't have any extremum when $x$ is large?
In this case, it can be said (by Darboux's theorem) that $f'$ is $0$ at infinitely many large $x$. But such $x$ will correspond necessarily to an extremum, can not be said. This is not surprising as $h:x\mapsto x^3$ defined on $(-2,2)$ has no local extremum at $0$ even though $f'(0)=0$. So how does $(2)$ take care of this case (confusion 2)?
Please help me understand what I am missing. Thanks.