Shouldn't complexity theory consider the time taken for different operations?

Question

I have read the answer found here which considers the size of integers when doing comparisons and how that affects on the basic cost of comparison.

I am trying to understand why each basic operation takes $O(1)$ time. For example branches in general are considered to run in $O(1)$ time when they usually don't.

In computer architecture branches are considered the most time-consuming and problematic part of the code and this is one of the reasons why a loop is unrolled by the compiler in order to reduce the amount of branches the processor needs to execute.

For example a simple branch (let's say if a>b) needs to do at least four things

Get a from memory (whether it is cache or RAM or on disk does not matter; cache misses and hits are a different field of study).
Get b from memory
Evaluate the two and select the proper answer
Search in instruction memory a new value for the Program Counter depending on the result of the evaluation.

Now get the variables from RAM is something done everywhere. So let's not count that. But since all modern processors load more than just one instruction from instruction memory (let's say for our case 100) so as to reduce instruction memory accesses and a branch usually leads to a different place in the instructions list (let's say if a>b call a method that swaps them), that would mean that branches take a lot more time than any other basic function.

Also in all modern processors multiplications and divisions take significantly much more time than additions and subtractions. Which is the reason why when multiplying or diving by numbers that are powers of two we should use logical bit shifting.

I know that in CS Theory and evaluating an algorithm we do not care. But in practice when implementing an algorithm the operations we use really effect the efficiency and complexity of our code.

So let's say we have three algorithms

$n$ additions
$n$ multiplications
$n$ branches

In Theory all three of these compute in time $n$ but in practice, 1 is faster than 2, which is faster than 3.

Branch mispredictions may be more expensive than most instructions (and disk/network IO is much more expensive yet), but they still take constant time. Note that $2^{100}! \in O(1)$. That said, I think your main confusion is about what asymptotic bounds on runtime functions say and what they (deliberately) don't say. I think this question can help (and may even be a duplicate). — Raphael, Nov 05 '14 at 15:12

score 7 · Accepted Answer · answered Nov 05 '14 at 12:25

All complexity theory is relative to some model of computation. Typically, we consider models of computation where all basic operations take unit time because that's usually precise enough and, frankly, it's already difficult enough to do that kind of analysis.

Typically, running time analyses are only quoted asymptotically anyway. As long as there's a constant bound on the number of clock cycles required for any operation, saying that an algorithm takes $O(f(n))$ operations is exactly the same thing as saying that it takes $O(f(n))$ clock cycles: the difference is just in the constant hidden in the $O()$ notation.

Having said that, models of computation do vary in what operations they consider as being unit-cost. For example, ordinary Turing machines count moving the head one step as unit-cost; random access Turing machines can move the head farther at unit cost. People who study the complexity of matrix multiplication are often interested in the number of times you need to multiply matrix elements, rather than the total number of arithmetic operations, precisely because of your observation that multiplies are significantly more expensive than additions on a real CPU.

But, again, many things are swept under the carpet by asymptotic analysis. Suppose you did produce a model in which branches are much more expensive than other operations. Fine but, if you have a loop that needs to be iterated $n$ times, where $n$ is an input, your compiler can only unroll a constant number of iterations of the loop. So, instead of doing $n$ branches, you do, say, $n/8$ branches: that's still $O(n)$.

"that's usually precise enough" -- I think it's agreed upon that this is not precise at all, but also that that's okay. — Raphael, Nov 05 '14 at 15:12

score 3 · Answer 2 · answered Nov 05 '14 at 16:22

actually complexity theory has many "timing" and "cost" models but these tend to be more advanced and are not typically outlined at earlier/ introductory levels (where just the difficulty of asymptotic complexity analysis can be nontrivial or even difficult at times). some take into account the complexity of "low level" operations. for example

arbitrary precision arithmetic. see eg analysis of the complexity of the RSA algorithm or factoring large numbers. here arithmetic operations scale roughly at $O(n)$ or $O(n \log(n))$ for the size of the numbers instead of "unit cost".
there is a model called the RAM model of computing where their complexity is not exactly equivalent to other standard models eg Turing machines for array access etc.
cache oblivious algorithms are designed to scale well with arbitrary size caches.
even circuits can be considered a basic computation model that are not exactly identical in complexity to Turing machines & other models

there are many others! so the answer is yes, CS is a vast field now and has many theoretical models for the complexity of computation which are used according to context/ relevance.

score 0 · Answer 3 · answered Nov 05 '14 at 12:05

0

What if someone came up with a new architecture (maybe something that is not based on transistor tech at all) which has different performance characteristics of the operations? For example: Multiplication is now as fast as addition and so on. Then you would need to create a new complexity theory for that particular architecture and update all the theories based on this new architecture - doesn't sound like theory anymore :)

answered Nov 05 '14 at 12:05

Ankur

628
3
12

I am aware of that yes (which as for branches it is highly improbable) but how are we calculating true time complexity if we are not specific and estimations differ from each other? (as per my last example with 3 different algorithms?) – John Demetriou Nov 05 '14 at 12:18
1

@JohnDemetriou Bear in mind that the phrase "time complexity" is misleading: it's not about time but about number of operations. – David Richerby Nov 05 '14 at 12:25
@DavidRicherby Yes but it could evaluate two different algorithms to have the same time complexity while in fact are completely different – John Demetriou Nov 08 '14 at 20:47
@JohnDemetriou Can you give an example of that? – David Richerby Nov 09 '14 at 09:01
@DavidRicherby I have one in my question, N additions, N multiplications and N branches. All three are O(n) but each is faster than the next – John Demetriou Nov 09 '14 at 16:42
@JohnDemetriou But those are examples with the same complexity: $O(n)$ operations in each case and, unless those operations take non-constant time (not just that multiplication takes a different time from addition but that it takes a non-constant amount of time), they all have running time $O(n)$ clock cycles, too. – David Richerby Nov 09 '14 at 19:41
@DavidRicherby yes they all are O(n) but run in completely different speeds. This is what I am trying to say – John Demetriou Nov 09 '14 at 19:55

Shouldn't complexity theory consider the time taken for different operations?

3 Answers3

Linked