Need some Mathematicians, maybe Programmers

The Hero · March 2007

I need help finishing up a function.

Lets say I have some positive numbers, some 0s, and some negative numbers as different inputs. Language is C++.

If a number input is positive, I want to return a 1. If a number is negative, I want to return a -1, and if a number is 0, I want to return a 0. I want to do this mathematically, no conditionals, because I'm convinced it could probably be faster that way (for our purposes). Am I technically correct in this line of thinking?

Unfortunately, when looking at a graph of this function, it looks kind of like something of a piece wise function. So I have no idea if a clean function can be created to return these outputs.

Some relevant info, the inputs will always be greater than or equal 3 or less than or equal to -3, unless it's 0.

One of my ideas was doing something with binary representations.

Hlubocky · March 2007

If you want to do this mathematically, why can't you just divide the input by the absolute value of the input and return that? This will normalize the input to the desired range while preserving the original sign.

Deswa · March 2007

You would run into trouble when you divide by zero though.

Edit:

Assembly is awesome. I'd know how to deal with everything but zero with that.

iconduck · March 2007

Why write it in C++? If you want SUPER DUPER UNBEATABLE PERFORMANCE!!1! you should be writing in assembly like a real man.

Seriously though, don't try to be cute. Compilers are very good at optimizing this sort of thing these days, so all you'll really be doing is making your code unreadable and possibly buggy. Your binary tricks will have to take into account stuff like the size of an integer on your machine, and that can get tricky in a hurry and result in more code than if you had just done it the simple way in the first place.

Hlubocky · March 2007

Deswa wrote: »

You would run into trouble when you divide by zero though.

Heh, that is an embarrassing oversight! I forget how negative numbers work with C++ and bitwise operations, but what if you took the input and did a bitwise AND with the number 1?

Deswa · March 2007

Hlubocky wrote: »

Deswa wrote: »

You would run into trouble when you divide by zero though.

Heh, that is an embarrassing oversight! I forget how negative numbers work with C++ and bitwise operations, but what if you took the input and did a bitwise AND with the number 1?

Edit: I dont think the following would work.

What about a bitwise & 10...01 for everything. The only downside is you do get two results for your negatives and they wont be 1,0,-1.

Jimmy King · March 2007

Hlubocky wrote: »

Deswa wrote: »

You would run into trouble when you divide by zero though.

Heh, that is an embarrassing oversight! I forget how negative numbers work with C++ and bitwise operations, but what if you took the input and did a bitwise AND with the number 1?

Should work. Signed int (required to have negative numbers) uses the most significant bit to determine if the number is positive or negative.

well... not sure you'd AND it against number 1, you'd probably AND it against the max signed int value after checking the most significant bit to see if it's 0 or 1. Then you'd just AND the rest of it and if you get all 0's then the original number was 0 otherwise the original number was anything else.

I'm really not sure this is going to end up any faster or better than just using you're standard conditionals and < and > checks, though. There's still going to be conditionals in this method to determine if it was 0 or otherwise, if negative or otherwise, etc. In other words, technically possible but otherwise pointless unless there's a lot more to this app and this is just a very dumbed down and incomplete example of what is being done.

Sir Red of the Manti · March 2007

iconduck wrote: »

Why write it in C++? If you want SUPER DUPER UNBEATABLE PERFORMANCE!!1! you should be writing in assembly like a real man.

Seriously though, don't try to be cute. Compilers are very good at optimizing this sort of thing these days, so all you'll really be doing is making your code unreadable and possibly buggy. Your binary tricks will have to take into account stuff like the size of an integer on your machine, and that can get tricky in a hurry and result in more code than if you had just done it the simple way in the first place.

Unless he's throwing the program on a microcontroller and intends to process several thousand requests per second, he should be content with C++. That, and it looks like he has to use the language for his task.
As for compiling, would dividing by 0 produce an undefined result, or would you just crash your program at that point?

Jimmy King · March 2007

Sir Red of the Manti wrote: »

iconduck wrote: »

Why write it in C++? If you want SUPER DUPER UNBEATABLE PERFORMANCE!!1! you should be writing in assembly like a real man.

Seriously though, don't try to be cute. Compilers are very good at optimizing this sort of thing these days, so all you'll really be doing is making your code unreadable and possibly buggy. Your binary tricks will have to take into account stuff like the size of an integer on your machine, and that can get tricky in a hurry and result in more code than if you had just done it the simple way in the first place.

Unless he's throwing the program on a microcontroller and intends to process several thousand requests per second, he should be content with C++. That, and it looks like he has to use the language for his task.
As for compiling, would dividing by 0 produce an undefined result, or would you just crash your program at that point?

I've only ever done it in C and it crashes the app with an error along the lines of "Divide by zero error". Might could stick it in a try{} block and catch{} it, though, I've never attempted that to see if it throws a proper exception or if it just craps out.

ecco the dolphin · March 2007

Assuming integers, assembly would be great : bit shift left one bit, then (in this order):

1.) Carry flag set -> Negative
2.) Zero flag set -> Zero
3.) Positive

None of this fancy mathematical stuff. Just one bit shift, and the information's there.

Even floating point numbers have their sign bits, if in IEEE representation.

But anyway, back to the problem at hand:

Are you sure you want to do this mathematically? Because if you're returning 1, 0 and -1 from a function it sounds to me that you're going to be using the results in a conditional manner anyway. What you might want to do is make your function inline, turn on optimisations, and let the optimiser handle it.

Spackle · March 2007

well, if you're ok with 1 conditional, just have an if statement that checks of the input is equal to 0 and if so, return 0.

If not, move onto your +/- check logic which could be mathematical based.

The Hero · March 2007

Anybody know if theres any programs where you can create a graph and have the computer output a function for that graph?

The graph of a function could look something like a flat line at y =1 when x > 3, a flat line at y=-1 when x < -3, and in the area between 3 and -3 there is simply a line that can do anything but it must cross 0.

There MUST be an equation for that, it passes the definition of a function.

Voro · March 2007

I don't know about programs to provide the function, but what you're describing sounds like tanh, with a bit of rounding. Always returns 0 when 0 is passed to it, has a limit of 1 when passed a positive number, and has a limit of -1 when passed a negative number.

ecchi · March 2007

Trig functions are slow. Making it an inline conditional is really the way to go -- if absolutely necessary, it can be optimized later if you can determine that this function is what causes the slowdown in the first place.

The Hero · March 2007

meh.

Can't believe this.

I guess I'll have to use a conditional. If input is 0 return 0 else, return Input divided by absolute value of input.

But that blows

dsplaisted · March 2007

The Hero wrote: »

There MUST be an equation for that, it passes the definition of a function.

What makes you think that all functions can be expressed as equations?

dsplaisted · March 2007

The Hero wrote: »

meh.

Can't believe this.

I guess I'll have to use a conditional. If input is 0 return 0 else, return Input divided by absolute value of input.

But that blows

Are you familiar with the conditional operator in C++? You could write it as

return (val == 0) ? 0 : ((val > 0) ? 1 : -1);

I'm not convinced that this will compile to code that will be faster than simple if statements. But I have read that it generally won't use branches when compiled.

Why is it so important that this code be as fast as possible? I have a feeling it won't matter as much as you think.

Bama · March 2007

A < or > comparison should be as fast as an add, and possibly faster than a divide. Definitely faster than a divide and an absolute value call. You shouldn't be worried about the comparisons for performance reasons. The branches can cost you performance on pipelined processors but it's probably not affecting you that much.

Spackle · March 2007

I don't know if the conditional is faster but it sure makes for some succinct single line code that is quite functional.

The above code states: if val equals 0, return 0 otherwise if val is greater than 0 return 1 else return -1

The Hero · March 2007

I'll probably have to end up using a conditional in there.

Edit: Actually I forgot probability of a 0 case happening is less than a 1 or -1 case, so really its:

return ( (n < 0) ? -1 : ((n > 0) ? 1 : 0) ) ;

But now I'm gonna have to store n in memory...

But if I don't store n in memory the final complete function can look something like

int8 s(char x){

return ( (((x-4)*(x-8)) < 0) ? -1 : ((((x-4)*(x-8)) > 0) ? 1 : 0) );

}

Speed is important because it will be run by small machines with low bandwidth across the internet, tens of thousands of times, possibly per second.

Smasher · March 2007

What sort of machines exactly are you talking about? What is your program in general supposed to do? Some more details would clarify things a bit.

I get the feeling that you're missing the forest for the trees here. Your program must be doing something other than this transformation, whether it's writing the results to a file or sending them over the internet or using them internally for some other purpose. Whatever's happening, those other things are going to be using far more processor time than a simple function like this. There's an axiom that 90% of the time is spent in 10% of your code, and thus if you optimize at all (often it's best not to, as you end up spending too much time for too little gain) you should focus on that 10%. I feel pretty confident in saying that this function is not what you should be focusing on.

ecco the dolphin · March 2007

Tell you what - at the end of the day, a lot of it comes down to variables that we don't know.

Why don't you benchmark to see which is faster? See if it makes any difference at all?

Do, say, one million iterations with one method (e.g. divide by absolute value), another million with another (e.g. conditional) and so on, and measure it.

We can talk all day about it, but talk is cheap. Measurements mean a bit more.

Philodox · March 2007

I'd pay attention to what Smasher has to say. "Premature optimization is the root of all evil." That statement so very true. It's almost certain that there are many other parts of your program that will perform much much worse than a conditional expression to do what you want. If you benchmark your code and you find out that this particular segment is performing wayyyyy too slowly then you can look at ways of improving it. As it stands stick with what's simple and easy to read.

fforde · March 2007

For the love of God don't use the conditional operator (Or turnary operator? That's how I learned it). The compiler treats it exactly the same way as an if then else block and it does nothing but muddy up your code. It gets even worse when you throw it inside an if or a for clause. Fewer lines of code does not always mean better code. Your basically sacrificing readability so that you can impress someone with your knowledge of superfluous language features. It wont perform better.

As far as the whole idea of optimizing this function, you should really make sure this is where your slow down is. Regardless of how you optimize this function, beyond a traditional if then else you are talking about saving a handful of nanoseconds per execution. You wouldn't see any speed increase unless you were executing this function hundreds of thousands or maybe even millions of times, and then you are talking about saving only a handful of seconds, which is probably a very small fraction of your total run time.

Senjutsu · March 2007

Philodox wrote: »

I'd pay attention to what Smasher has to say. "Premature optimization is the root of all evil." That statement so very true. It's almost certain that there are many other parts of your program that will perform much much worse than a conditional expression to do what you want. If you benchmark your code and you find out that this particular segment is performing wayyyyy too slowly then you can look at ways of improving it. As it stands stick with what's simple and easy to read.

Yeah, I was going to mention more or less exactly the same thing. The chances of any of these other techniques being more than marginally faster than an if statement aren't very high (you're saving maybe 1 or 2 cycles, tops), so unless you've already got the program written and profiled, know that this function is your absolute limiter and infinitesimal gains are worth it, this whole thing is just an exercise code obfuscation. Write, then profile, and only then consider spot optimization if it proves necessary. The first rule of optimization is Don't.

Senjutsu · March 2007

The Hero wrote: »

I'll probably have to end up using a conditional in there.

Edit: Actually I forgot probability of a 0 case happening is less than a 1 or -1 case, so really its:

return ( (n < 0) ? -1 : ((n > 0) ? 1 : 0) ) ;

But now I'm gonna have to store n in memory...

But if I don't store n in memory the final complete function can look something like

int8 s(char x){

return ( (((x-4)*(x-8)) < 0) ? -1 : ((((x-4)*(x-8)) > 0) ? 1 : 0) );

}

Speed is important because it will be run by small machines with low bandwidth across the internet, tens of thousands of times, possibly per second.

You realize terse code has no effect on speed right? Ternary if is treated more or less the same as an if()else; statement by the compiler. It's likely the compiler may create a temporary variable to cache your calculation in the later statement anyways, so the the memory "gain" is meaningless.

Honestly, with the result caching, the two subtractions and the mult in the latter example, I'm not convinced it won't run slower than the former. straight n < 0? is a single cycle instruction on many architectures.

Kambing · March 2007

Echoing what other people have said: profile, then spot-optimize if necessary. Since it sounds like wire transmission is the long pole in your app, I recommend focusing your optimization efforts around packet size and transmission rate in order to maximize perf.

fforde wrote:

For the love of God don't use the conditional operator (Or turnary operator? That's how I learned it). The compiler treats it exactly the same way as an if then else block and it does nothing but muddy up your code. It gets even worse when you throw it inside an if or a for clause. Fewer lines of code does not always mean better code. Your basically sacrificing readability so that you can impress someone with your knowledge of superfluous language features. It wont perform better.

In an unoptimized setting, the conditional block will produce different code than the ternary operator. Consider the snippet:

#include <math.h>

int f1(int x)
{
    if (x > 0) {
        return 1;
    } else if (x < 0) {
        return -1;
    } else /* x == 0 */ {
        return x;
    }
}

int f2(int x)
{
    return (x > 0) ? 1 : ((x < 0) ? -1 : x);
}

int f3(int x)
{
    if (x == 0) { return x; } else { return abs(x) / x; }
}

Compiling this with VC++ 2005 (/Zi) gives you the following assembly:

?f1@@YAHH@Z:
  10001030: 55                 push        ebp
  10001031: 8B EC              mov         ebp,esp
  10001033: 83 7D 08 00        cmp         dword ptr [ebp+8],0
  10001037: 7E 09              jle         10001042
  10001039: B8 01 00 00 00     mov         eax,1
  1000103E: EB 12              jmp         10001052
  10001040: EB 10              jmp         10001052
  10001042: 83 7D 08 00        cmp         dword ptr [ebp+8],0
  10001046: 7D 07              jge         1000104F
  10001048: 83 C8 FF           or          eax,0FFFFFFFFh
  1000104B: EB 05              jmp         10001052
  1000104D: EB 03              jmp         10001052
  1000104F: 8B 45 08           mov         eax,dword ptr [ebp+8]
  10001052: 5D                 pop         ebp
  10001053: C3                 ret

?f2@@YAHH@Z:
  10001060: 55                 push        ebp
  10001061: 8B EC              mov         ebp,esp
  10001063: 83 EC 08           sub         esp,8
  10001066: 83 7D 08 00        cmp         dword ptr [ebp+8],0
  1000106A: 7E 09              jle         10001075
  1000106C: C7 45 FC 01 00 00  mov         dword ptr [ebp-4],1
            00
  10001073: EB 1B              jmp         10001090
  10001075: 83 7D 08 00        cmp         dword ptr [ebp+8],0
  10001079: 7D 09              jge         10001084
  1000107B: C7 45 F8 FF FF FF  mov         dword ptr [ebp-8],0FFFFFFFFh
            FF
  10001082: EB 06              jmp         1000108A
  10001084: 8B 45 08           mov         eax,dword ptr [ebp+8]
  10001087: 89 45 F8           mov         dword ptr [ebp-8],eax
  1000108A: 8B 4D F8           mov         ecx,dword ptr [ebp-8]
  1000108D: 89 4D FC           mov         dword ptr [ebp-4],ecx
  10001090: 8B 45 FC           mov         eax,dword ptr [ebp-4]
  10001093: 8B E5              mov         esp,ebp
  10001095: 5D                 pop         ebp
  10001096: C3                 ret

?f3@@YAHH@Z:
  100010A0: 55                 push        ebp
  100010A1: 8B EC              mov         ebp,esp
  100010A3: 83 7D 08 00        cmp         dword ptr [ebp+8],0
  100010A7: 75 07              jne         100010B0
  100010A9: 8B 45 08           mov         eax,dword ptr [ebp+8]
  100010AC: EB 0F              jmp         100010BD
  100010AE: EB 0D              jmp         100010BD
  100010B0: 33 C0              xor         eax,eax
  100010B2: 83 7D 08 00        cmp         dword ptr [ebp+8],0
  100010B6: 0F 9F C0           setg        al
  100010B9: 8D 44 00 FF        lea         eax,[eax+eax-1]
  100010BD: 5D                 pop         ebp
  100010BE: C3                 ret

However, throwing in optimizations (/O2) yields:

?f1@@YAHH@Z:
  10001030: 8B 44 24 04        mov         eax,dword ptr [esp+4]
  10001034: 85 C0              test        eax,eax
  10001036: 7E 06              jle         1000103E
  10001038: B8 01 00 00 00     mov         eax,1
  1000103D: C3                 ret
  1000103E: 7D 03              jge         10001043
  10001040: 83 C8 FF           or          eax,0FFFFFFFFh
  10001043: C3                 ret

?f2@@YAHH@Z:
  10001050: 8B 44 24 04        mov         eax,dword ptr [esp+4]
  10001054: 85 C0              test        eax,eax
  10001056: 7E 06              jle         1000105E
  10001058: B8 01 00 00 00     mov         eax,1
  1000105D: C3                 ret
  1000105E: 7D 03              jge         10001063
  10001060: 83 C8 FF           or          eax,0FFFFFFFFh
  10001063: C3                 ret

?f3@@YAHH@Z:
  10001070: 8B 4C 24 04        mov         ecx,dword ptr [esp+4]
  10001074: 85 C9              test        ecx,ecx
  10001076: 75 03              jne         1000107B
  10001078: 33 C0              xor         eax,eax
  1000107A: C3                 ret
  1000107B: 8B C1              mov         eax,ecx
  1000107D: 99                 cdq
  1000107E: 33 C2              xor         eax,edx
  10001080: 2B C2              sub         eax,edx
  10001082: 99                 cdq
  10001083: F7 F9              idiv        eax,ecx
  10001085: C3                 ret

Note that, when optimized, f1 and f2 are identical. Abs() is inlined in f3 for performance's sake, but it still suffers from a single jump to cover the x == 0 case and a nasty division to make the magical calculation. In any case, we're talking about a handful of instructions here between each of the possible solutions. Hopefully there are bigger fish to fry in terms of perf in your code.

As far as the readability of the ternary operator, you'll find that (as with most stylistic things) its a matter of taste. My take with this particular example is that since we're conditioning on the return value, the ternary operator says succinctly (always a good thing) what we're trying to express than the tri-way if-else block. Depending on the context, of course, it may or may not be the "best" way to express your intent.

The Hero · March 2007

Senjutsu wrote: »

You realize terse code has no effect on speed right? Ternary if is treated more or less the same as an if()else; statement by the compiler. It's likely the compiler may create a temporary variable to cache your calculation in the later statement anyways, so the the memory "gain" is meaningless.

Honestly, with the result caching, the two subtractions and the mult in the latter example, I'm not convinced it won't run slower than the former. straight n < 0? is a single cycle instruction on many architectures.

I know that, that's why I wanted to avoid having a conditional completely. Would have been nice to have a straight up math equation. But I guess it just wasn't possible. For my tastes, a ternary block is fine.

Assembly optimization will only be looked at after the whole project has been finished in C++.

As it currently stands, I don't know how the entire program works so I can't tell you if there is more important optimizations to be made elsewhere, and if there are then they are not my responsibility. I am only responsible for my parts, and I am the only one who knows how my area truly works. In fact no one man in the project probably knows how the whole thing works, everyone simply knows bits and pieces which get put together into a big picture.

Bama · March 2007

But still, you have to be working on something larger than this - right? I mean, did they just give you one hilariously simple function to write and now you're using the rest of your time to wring a couple of cycles out of it?

Will the compiler make the optimization of storing the product of those differences in a register instead of computing it twice? I've never actually looked at the assembly that gets produced when I write something like C++.

Also, it'd be nice to know your target architecture since the discussion is performance and it seems to be a specialized piece of software.

Penny Arcade

Quick Links

Need some Mathematicians, maybe Programmers

Posts