Stories
Slash Boxes
Comments

SoylentNews is people

posted by martyb on Monday September 16 2019, @01:48PM   Printer-friendly
from the COBOL-is-often-fractionally-better dept.

https://medium.com/@bellmar/is-cobol-holding-you-hostage-with-math-5498c0eb428b

Face it: nobody likes fractions, not even computers.

When we talk about COBOL the first question on everyone's mind is always Why are we still using it in so many critical places? Banks are still running COBOL, close to 7% of the GDP is dependent on COBOL in the form of payments from the Centers for Medicare & Medicaid Services, The IRS famously still uses COBOL, airlines still use COBOL (Adam Fletcher dropped my favorite fun fact on this topic in his Systems We Love talk: the reservation number on your ticket used to be just a pointer), lots of critical infrastructure both in the private and public sector still runs on COBOL.

Why?

The traditional answer is deeply cynical. Organizations are lazy, incompetent, stupid. They are cheap: unwilling to invest the money needed upfront to rewrite the whole system in something modern. Overall we assume that the reason so much of civil society runs on COBOL is a combination of inertia and shortsightedness. And certainly there is a little truth there. Rewriting a mass of spaghetti code is no small task. It is expensive. It is difficult. And if the existing software seems to be working fine there might be little incentive to invest in the project.

But back when I was working with the IRS the old COBOL developers used to tell me: "We tried to rewrite the code in Java and Java couldn't do the calculations right."

[Ed note: The referenced article is extremely readable and clearly explains the differences between floating-point and fixed-point math, as well as providing an example and explanation that clearly shows the tradeoffs.]


Original Submission

 
This discussion has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 0) by Anonymous Coward on Monday September 16 2019, @05:46PM (7 children)

    by Anonymous Coward on Monday September 16 2019, @05:46PM (#894720)

    The problem with using floating point versus decimal is worse than that, even.
    Did you know that floating point cannot exactly represent a value as common as 0.10 ?
    *This* is the major problem: A particular number base has many fractions that it CANNOT represent exactly as digits after a radix point.
    Floating point is base 2. It can't exactly represent 0.10 or 0.01, etc.
    Decimal (base 10) can't exactly represent 1/3, for example.
    The problem is that currencies are decimal numbers, and arithmetic on currencies must be exact, not "pretty close."
    This is why there is no alternative to using decimal numbers for financial calculations.

  • (Score: 2) by theluggage on Monday September 16 2019, @09:14PM (6 children)

    by theluggage (1797) on Monday September 16 2019, @09:14PM (#894820)

    Did you know that floating point cannot exactly represent a value as common as 0.10 ?

    Did you know that either floating point or integer can represent $0.10 with absolute accuracy as "10 cents" or "10,000,000 microcents"? Which is really all the "fixed point" arithmetic in COBOL does. Or, as anybody worth their salt doing scientific/technical programming knows, you try and avoid things like repeatedly adding small fractional numbers to larger ones, think about whether the order of operations is going to produce intermediate results that might exceed the range/precision of the number format and never, ever test floating point values for exact equality (...if you don't know what an appropriate margin of error is in the context, your results are invalid anyway). Or, just use a fixed point/arbitrary precision library (available for most languages, and built in to some - e.g. Python if that fills your hovercraft with eels).

    The problem is that currencies are decimal numbers, and arithmetic on currencies must be exact, not "pretty close."

    The problem is that some people* doing arithmetic on currencies have no concept of errors or error propagation [utoronto.ca] meaning that, scientifically speaking, most of their results are completely meaningless, because they don't cite errors.

    When I fill in my tax form (at least in the UK), the rule is that you round numbers up or down to the nearest £1, so every figure has a potential error of +/- £0.50. Lets say your taxable income is the result of adding or subtracting, say, 50 such figures - the combined margin of error on that sum is the square root of the sum of the squares of the errors - sqrt(50 * 0.50^2) - so, about £3.50. (There are other rules if figures get multiplied, divided or raised to powers). So if the total income based on my tax form comes to £12345 what it really means is that my actual income was £12345 +/- £3.50, so probably somewhere between £12341.50 and £12347.50 (...and could be outside that range if lady luck hiccups and my figures include a preponderance of £0.99s).

    So, on top of that, if a decimal/binary conversion error makes the total £12344.9999998 , it is completely irrelevant because that second 4 and anything to the right of it are garbage anyway. More practically, if you scale everything up by, say, 100 (or 128 if you're a smartarse) you'll get completely accurate results (and if you know how much to scale up by, they might also be correct rather than just accurate measurements of noise)

    But it's tax so the right answer is right by definition? Nope - several points on my return where I have to do my own calculations and no clear guidance on at which stage to do the rounding, and virtually all of the figures involved have been rounded to the nearest penny somewhere along the lines. There's no "right to the nearest 0.01" answer to the bottom line.

    Know that (possibly urban mythical) scam where the disgruntled bank worker diverts all the rounded-off 0.5 cents to their account and claims its a victimless crime? They may have a point because their "fortune" would be less than the margin of error in the bank's bottom line and (scientifically speaking) insignificant.

    NB: please be lenient if I've invoked Muphrey's Law (sic) here - its 30 years or so since I last looked a partial differential in the face...

    * I'm not saying that this is totally unknown by serious economists doing serious economics, and maybe it is taken into account by the accounting hard core, but it is Physics/Engineering 101 that is (or should be) drummed into anybody taking a numerate science beyond high school level.

    • (Score: 0) by Anonymous Coward on Monday September 16 2019, @11:01PM (5 children)

      by Anonymous Coward on Monday September 16 2019, @11:01PM (#894866)

      Your scaled integer solution (storing everything as an integer penny amount, for example) works when you are doing simple sums, differences, and integer multiplications. It fails everywhere everywhere else.

      integer + integer -> integer
      integer - integer -> integer
      integer * integer -> integer
      integer / integer -> real

      As a consequence of the above rules,
      it is also true that:

      integer * (fraction) -> real

      So your scaled integer system is not in any way a general solution.
      Sure, you could add routines to handle the operations I noted above that result in real numbers with more code, but at that point you have just re-invented decimal arithmetic.
      I guess you could buy yourself some "time" to keep using scaled integers by scaling to a large decimal number, you still run into rounding issues which eventually will lead to accumulated
      inaccuracies, plus it's a pain and errorprone to manage the decimal point "outside of the code."

      • (Score: 1) by jurov on Tuesday September 17 2019, @12:01AM (2 children)

        by jurov (6250) on Tuesday September 17 2019, @12:01AM (#894897)

        We are not talking about general solutions for everything.

        We are talking about money amounts and explicit rounding. With scaled integers, you can sit down with the formula and determine the required precision beforehand, only basic arithmetic understanding is required.

        Say you need to apply some % discount. Well then, with scaled integers: multiply the price by (100 - discount), then if you want rounding up, add 1 and lastly, divide by 100 to get discounted price (assuming integer division rounds down).

        With floating point numbers it is altogether unclear if we can rely on this approach. That's the point of this article and whole discussion.

        • (Score: 0) by Anonymous Coward on Tuesday September 17 2019, @01:23AM (1 child)

          by Anonymous Coward on Tuesday September 17 2019, @01:23AM (#894935)

          Or you just use a language with full support for decimal arithmetic and happily and quickly get on with writing the REST of your program, secure in the knowledge that basic arithmetic will be correct. There is no excuse to do it any other way unless you are writing embedded systems code for an 8 bit processor or Wall Street high frequency transaction systems. Stop writing brittle code!

          • (Score: 2) by theluggage on Tuesday September 17 2019, @10:40AM

            by theluggage (1797) on Tuesday September 17 2019, @10:40AM (#895093)

            Or you just use a language with full support for decimal arithmetic and happily and quickly get on with writing the REST of your program, secure in the knowledge that basic arithmetic will be correct.

            Well, that includes just about every serious language in use today - if decimal math isn't built into the language (Java, Python), there will be a library (C/C++). So it is certainly not an excuse for sticking with COBOL.

            Stop writing brittle code!

            It is code that ignores appropriate precision that is brittle. If your algorithm would - in binary - magnify the difference between 0.1 and 0.09999999999999012964 into a significant bottom line error, then it may well do the same - in decimal - with the different between 1/6 and 0.166666666666666667, let alone the +/- 0.005 margin of error in every input value rounded to the nearest cent.

      • (Score: 2) by theluggage on Tuesday September 17 2019, @09:31AM (1 child)

        by theluggage (1797) on Tuesday September 17 2019, @09:31AM (#895085)

        Your scaled integer solution (storing everything as an integer penny amount, for example) works when you are doing simple sums, differences, and integer multiplications. It fails everywhere everywhere else.

        So does decimal, which can no more represent every real number than binary can. E.g. 1/6 = 0.166666...

        • (Score: 0) by Anonymous Coward on Tuesday September 17 2019, @11:59AM

          by Anonymous Coward on Tuesday September 17 2019, @11:59AM (#895099)

          Decimal numbers are used to represent currency.
          Currency is not arbitrary fractions, but short, terminating decimal fractions.
          For example, we talk of dollars and cents, and those cents are integers in themselves.
          It is this domain where you need to use decimal arithmetic.
          The fact that 1/6 doesn't have a terminating decimal representation (your example) is not limiting because in business calculations typically you use either short, terminating fractions in decimal -or- if you do something like divide by 6, you still want the math done correctly for your decimal numbers and rounded correctly as well.
          There are multiple rounding algorithms used in finance, and each depends on a decimal representation in the fraction, so rounding ALONE requires decimal.
          You can't use a scaled integer representation and get correct rounding.