Stories
Slash Boxes
Comments

SoylentNews

SoylentNews is people

Sections

SoylentNews

A Journey Through the CPU Pipeline

posted by martyb on Saturday October 31 2015, @03:33AM

Phoenix666 writes:

It is good for programmers to understand what goes on inside a processor. The CPU is at the heart of our career.
What goes on inside the CPU? How long does it take for one instruction to run? What does it mean when a new CPU has a 12-stage pipeline, or 18-stage pipeline, or even a "deep" 31-stage pipeline?
Programs generally treat the CPU as a black box. Instructions go into the box in order, instructions come out of the box in order, and some processing magic happens inside.
As a programmer, it is useful to learn what happens inside the box. This is especially true if you will be working on tasks like program optimization. If you don't know what is going on inside the CPU, how can you optimize for it?

A primer for those with a less formal background.

Original Submission

This discussion has been archived. No new comments can be posted.

A Journey Through the CPU Pipeline | Log In/Create an Account | Top | 29 comments | Search Discussion

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

Re:A classic argument.... Re:A classic argument.... (Score: 2) by martyb on Saturday October 31 2015, @02:38PM

by martyb (76) on Saturday October 31 2015, @02:38PM (#256893) Journal

Still, a background in assembly never hurt anyone intending to start a career in programming.
Most certainly agree. How is data stored in memory? Knowing something about the underlying hardware, I can, for example, understand how a C struct is laid out in memory and how pointers are implemented.
You're not going to do any big projects in assembler, nobody does that these days, and truth be known nobody ever did. (emphasis added)
I can only imagine your concept of a big project is different than mine. Unless you consider IBM's VM operating system to be a small project? It was written entirely in assembler, or at least the parts of it that I worked on — system config, scheduler, dispatcher, free storage manager, etc. Wikipedia has some background on IBM's BAL — Basic Assembly Language. [wikipedia.org] Though I do not have first-hand information, I have a strong suspicion that IBM's COBOL and FORTRAN compilers were also written in assembler.
Yeah you are going to develop in C or something a lot more high-level. But it is worth while to understand what is really going on. What you think is optimized code, might look like a lot different when it gets to the machine level.
I, too, have witnessed this. Sometimes, for whatever reason, the compiler just gets it wrong. A compiler is a program, and is subject to mistakes, as well. I have found and reported bugs in a couple of them. Still, for the vast majority of the time, compilers do a good enough job and much more quickly than hand-crafted assembler would often be. So, I'm entirely with you on this point.
Using a compiler that shall remain nameless, we use to look for optimization opportunities by having the the compiler generate the assembly code. Then we would measure the amount of assembly instructions of each high-level operation..... With a Ruler. By laying the listing out on the floor, and using a ruler to measure how many feet of assembly were generated, we could see hopelessly inefficient compilation. Just changing a little code, usually from complex computation statements to a series of discrete elementary operations made a huge difference in speed.

I appreciate the anecdote. And, yes, compilers occasionally make what seem to be poor choices in particular cases.
That said, it bears mentioning for those who might not be aware that there are space and time tradeoffs, e.g.: loop unrolling. To illustrate, here is some pseudo code where we assume that x is defined as an array of unsigned bytes:
for (i=0; i<=7; i++) x[i] = 0;

could be unrolled as:
x[0] = 0; x[1] = 0; x[2] = 0; x[3] = 0; x[4] = 0; x[5] = 0; x[6] = 0; x[7] = 0;

By removing increment and conditional branch instructions, one ultimately has fewer instructions to perform — it would run more quickly. So, the assembly listing would be longer, but the code would generally run more quickly.
On the other hand, one could define, say, an array of long ints (32 bits) defined to be located at the same address as x[] and one would need only two instructions to zero out those locations:
a[0] = 0L; a[4] = 0L;

Note, however, that if taken to extremes, one gets to the point that the additional size of the code exceeds what can fit in cache and costs a cache miss which could eliminate the time savings. As with most things, there are tradeoffs.

--
Wit is intellect, dancing.

Parent

Starting Score: 1 point

Karma-Bonus Modifier +1

Total Score: 2
Re:A classic argument.... (Score: 2) by frojack on Saturday October 31 2015, @07:36PM

by frojack (1554) on Saturday October 31 2015, @07:36PM (#256970) Journal

I can only imagine your concept of a big project is different than mine. Unless you consider IBM's VM operating system to be a small project? It was written entirely in assembler, or at least the parts of it that I worked on — system config, scheduler, dispatcher, free storage manager, etc. Wikipedia has some background on IBM's BAL — Basic Assembly Language. [wikipedia.org] Though I do not have first-hand information, I have a strong suspicion that IBM's COBOL and FORTRAN compilers were also written in assembler.
Having some exposure to compiler writing from a long ago contract, I would doubt much of (if any) a compiler was written in assembler. You usually start with some high level language (even something as *cough* "high level" as basic, on some existing platform, to evaluate your compiler source code and produce assembler output. You start small, read something in, say, look at the generated assembler code till it works good enough, "can" that code, and then start working on Writing something out. Building up a collection of library routines.
Before long, you are writing the compiler for your new language IN ITSELF. Very often, compiler developers would get away from any other language, say C, and end up writing their new FORTRAN or Algol compilers in FORTRAN or Algol, or whatever.
Often, these Compiler-Compilers are the best way to get from one platform to another. You just change the assembly instructions to those of the new processor.

--
No, you are mistaken. I've always had this sig.

Parent

Moderator Help

Two is not equal to three, even for large values of two.

Starting Score:	1		point
Karma-Bonus Modifier		+1

Total Score:		2

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

A Journey Through the CPU Pipeline

Re:A classic argument.... Re:A classic argument.... (Score: 2) by martyb on Saturday October 31 2015, @02:38PM

Re:A classic argument.... (Score: 2) by frojack on Saturday October 31 2015, @07:36PM