Stories
Slash Boxes
Comments

SoylentNews is people

posted by janrinok on Tuesday November 25, @10:57PM   Printer-friendly

Is Matrix Multiplication Ugly?

A few weeks ago I was minding my own business, peacefully reading a well-written and informative article about artificial intelligence, when I was ambushed by a passage in the article that aroused my pique. That's one of the pitfalls of knowing too much about a topic a journalist is discussing; journalists often make mistakes that most readers wouldn't notice but that raise the hackles or at least the blood pressure of those in the know.

The article in question appeared in The New Yorker. The author, Stephen Witt, was writing about the way that your typical Large Language Model, starting from a blank slate, or rather a slate full of random scribbles, is able to learn about the world, or rather the virtual world called the internet. Throughout the training process, billions of numbers called weights get repeatedly updated so as to steadily improve the model's performance. Picture a tiny chip with electrons racing around in etched channels, and slowly zoom out: there are many such chips in each server node and many such nodes in each rack, with racks organized in rows, many rows per hall, many halls per building, many buildings per campus. It's a sort of computer-age version of Borges' Library of Babel. And the weight-update process that all these countless circuits are carrying out depends heavily on an operation known as matrix multiplication.

Witt explained this clearly and accurately, right up to the point where his essay took a very odd turn.

Here's what Witt went on to say about matrix multiplication:

"'Beauty is the first test: there is no permanent place in the world for ugly mathematics,' the mathematician G. H. Hardy wrote, in 1940. But matrix multiplication, to which our civilization is now devoting so many of its marginal resources, has all the elegance of a man hammering a nail into a board. It is possessed of neither beauty nor symmetry: in fact, in matrix multiplication, a times b is not the same as b times a."

The last sentence struck me as a bizarre non sequitur, somewhat akin to saying "Number addition has neither beauty nor symmetry, because when you write two numbers backwards, their new sum isn't just their original sum written backwards; for instance, 17 plus 34 is 51, but 71 plus 43 isn't 15."

The next day I sent the following letter to the magazine:

"I appreciate Stephen Witt shining a spotlight on matrices, which deserve more attention today than ever before: they play important roles in ecology, economics, physics, and now artificial intelligence ("Information Overload", November 3). But Witt errs in bringing Hardy's famous quote ("there is no permanent place in the world for ugly mathematics") into his story. Matrix algebra is the language of symmetry and transformation, and the fact that a followed by b differs from b followed by a is no surprise; to expect the two transformations to coincide is to seek symmetry in the wrong place — like judging a dog's beauty by whether its tail resembles its head. With its two-thousand-year-old roots in China, matrix algebra has secured a permanent place in mathematics, and it passes the beauty test with flying colors. In fact, matrices are commonplace in number theory, the branch of pure mathematics Hardy loved most."

[...] I'm guessing that part of Witt's confusion arises from the fact that actually multiplying matrices of numbers to get a matrix of bigger numbers can be very tedious, and tedium is psychologically adjacent to distaste and a perception of ugliness. But the tedium of matrix multiplication is tied up with its symmetry (whose existence Witt mistakenly denies). When you multiply two n-by-n matrices A and B in the straightforward way, you have to compute n2 numbers in the same unvarying fashion, and each of those n2 numbers is the sum of n terms, and each of those n terms is the product of an element of A and an element of B in a simple way. It's only human to get bored and inattentive and then make mistakes because the process is so repetitive. We tend to think of symmetry and beauty as synonyms, but sometimes excessive symmetry breeds ennui; repetition in excess can be repellent. Picture the Library of Babel and the existential dread the image summons.

G. H. Hardy, whose famous remark Witt quotes, was in the business of proving theorems, and he favored conceptual proofs over calculational ones. If you showed him a proof of a theorem in which the linchpin of your argument was a 5-page verification that a certain matrix product had a particular value, he'd say you didn't really understand your own theorem; he'd assert that you should find a more conceptual argument and then consign your brute-force proof to the trash. But Hardy's aversion to brute force was specific to the domain of mathematical proof, which is far removed from math that calculates optimal pricing for annuities or computes the wind-shear on an airplane wing or fine-tunes the weights used by an AI. Furthermore, Hardy's objection to your proof would focus on the length of the calculation, and not on whether the calculation involved matrices. If you showed him a proof that used 5 turgid pages of pre-19th-century calculation that never mentioned matrices once, he'd still say "Your proof is a piece of temporary mathematics; it convinces the reader that your theorem is true without truly explaining why the theorem is true."

If you forced me at gunpoint to multiply two 5-by-5 matrices together, I'd be extremely unhappy, and not just because you were threatening my life; the task would be inherently unpleasant. But the same would be true if you asked me to add together a hundred random two-digit numbers. It's not that matrix-multiplication or number-addition is ugly; it's that such repetitive tasks are the diametrical opposite of the kind of conceptual thinking that Hardy loved and I love too. Any kind of mathematical content can be made stultifying when it's stripped of its meaning and reduced to mindless toil. But that casts no shade on the underlying concepts. When we outsource number-addition or matrix-multiplication to a computer, we rightfully delegate the soul-crushing part of our labor to circuitry that has no soul. If we could peer into the innards of the circuits doing all those matrix multiplications, we would indeed see a nightmarish, Borgesian landscape, with billions of nails being hammered into billions of boards, over and over again. But please don't confuse that labor with mathematics.


Original Submission

 
This discussion was created by janrinok (52) for logged-in users only, but now has been archived. No new comments can be posted.
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • (Score: 5, Informative) by owl on Wednesday November 26, @04:21AM (2 children)

    by owl (15206) on Wednesday November 26, @04:21AM (#1425210)

    That's one of the pitfalls of knowing too much about a topic a journalist is discussing; journalists often make mistakes that most readers wouldn't notice but that raise the hackles or at least the blood pressure of those in the know.

    Gell-Mann Amnesia [epsilontheory.com]

    Starting Score:    1  point
    Moderation   +3  
       Informative=3, Total=3
    Extra 'Informative' Modifier   0  
    Karma-Bonus Modifier   +1  

    Total Score:   5  
  • (Score: 0) by Anonymous Coward on Wednesday November 26, @12:38PM (1 child)

    by Anonymous Coward on Wednesday November 26, @12:38PM (#1425228)
    I believe newspaper subscriptions etc are down. So maybe more and more people have realized that level of journalism is not worth paying for. Lots of newspapers have become propaganda arms too, which is even less worth paying for.

    You can get about as crap journalism for free elsewhere AND it might even have less propaganda bundled in.

    e.g. some writer in the BBC/NYT/etc is more likely to have an agenda/propaganda direction when writing on certain topics than some random person who has allegedly just captured raw footage on their phone.

     
    • (Score: 2) by aafcac on Wednesday November 26, @06:38PM

      by aafcac (17646) on Wednesday November 26, @06:38PM (#1425280)

      Essentially. A large part of the problem is that media ownership rules were relaxed so there's a good chunk of the papers printing the same stories. If I want news, I tend to go for local TV news as that seems to be the least corrupted form of news we've got at the moment. A bunch of the radio news is syndicated and whereas I used to work in a building with an active radio booth where they'd report on things like the weather and they could literally look out the window and see if it was significantly off, often times that sort of stuff is done out of a single studio in in Georgia.

      It's also part of why there's so many issues related to "fake news" there's a lot of legitimately fake news like a bunch of the coverage of Israel, but there'd be less of an issue if there were more outlets having to compete with each other the way they used to. Doing a large scale investigative report on things can take months to complete and that's after years of establishing sources. There's also the expense of having somebody at city hall for the press conferences in case something happens that day and the like. All of that costs money, but with there being so few news organizations left, there's not as much of an incentive to do so as the same outfit may very well own several different types of outlets in a given market.

      I'd subscribe to a paper if it wasn't so thin on actual substance and stuff that isn't available for free in a bunch of places.