• 9 Posts
  • 148 Comments
Joined 2 years ago
cake
Cake day: July 19th, 2023

help-circle
  • I went into this with negative expectations; I recall being offended in high school that The Flashbulb was artificially sped up, unlike my heroes of neoclassical guitar and progressive-rock keyboards, and I’ve felt that their recent thoughts on newer music-making technology have been hypocritical. That said, this was a great video and I’m glad you shared it.

    Ears and eyes are different. We deconvolve visual data in the brain, but our ears actually perform a Fourier decomposition with physical hardware. As a result, psychoacoustics is a real and non-trivial science, used e.g. in MP3, which limits what an adversary can do to frustrate classification or learning, because the result still has to sound like music in order to get any playtime among humans. Meanwhile I’m always worried that these adversarial groups are going to accidentally propagate something like McCollough stripes, a genuine cognitohazard that causes edges to become color-coded in the visual cortex for (up to) months after a few minutes of exposure; it’s a kind of possible harm that fundamentally defies automatic classification by definition.

    HarmonyCloak seems like a fairly boring adversarial tool for protecting the music industry from the music industry. Their code is incomplete and likely never going to get properly published; again we’re seeing an industry-capture research group taking and not giving back to the Free Software community. I think all of the demos shown here are genuine, but he fully admits that this is a compute-intensive process which I estimate is going to slide back out of affordability by the end of 2026. This is going to stop being effective as soon as we get back into AI winter, but I’m not going to cry for Nashville.

    I really like the two attacks shown near the end, starting around 22:00. The first attack, if genuinely not audible to humans, is likely a Mosquito-style frequency that is above hearing range and physically vibrates the components of the microphone. Hofstadter and the Tortoise would be proud, although I’m concerned about the potential long-term effects on humans. The second attack is again adversarial but specific to models on home-assistant devices which are trained to ignore some loud sounds; I can’t tell spectrographically whether that’s also done above hearing range or not. I’m reluctant to call for attacks on home assistants, but they’re great targets.

    Fundamentally this is a video that doesn’t want to talk about how musicians actually rip each other off. The “tones and rhythms” that he keeps showing with nice visualizations have been machine-learnable for decades, ranging from beat-finders to frequency-analyzers to chord-spellers to track-isolators built into our music editors. He doubles down on copyright despite building businesses that profit from Free Software. And, most gratingly, he talks about the Pareto principle while ignoring that the typical musician is never able to make a career out of their art.



  • In practice, the behaviors that the chatbots learn in post-training are FUD and weasel-wording; they appear to not unlearn facts, but to learn so much additional nuance as to bury the facts. The bots perform worse on various standardized tests about the natural world after post-training; there are quantitative downsides to forcing them to adopt any particular etiquette, including speaking like a chud.

    The problem is mostly that the uninformed public will think that the chatbot is knowledgeable and well-spoken because it rattles off the same weak-worded hedges as right-wing pundits, and it’s addressed by the same improvements in education required to counter those pundits.

    Answering your question directly: no, slop machines can’t be countered with more slop machines without drowning us all in slop. A more direct approach will be required.


  • Yes, but the article’s not actually about that. It’s about Microsoft returning to the same datacenter-building schedule from a decade ago. Datacenters have a lag of about 3-5yrs depending on what’s inside them and where they’re located, so what we’re actually seeing is Microsoft projecting a relative reduction in overall usage. Note that among all the cancellations of notes and prospective claims, Microsoft isn’t walking back their two-decade nuclear-power deal with Westinghouse; they’re not destroying or reducing any existing capacity, just planning to build less. At risk of quoting Bloomberg:

    After a frantic expansion to support OpenAI and other artificial intelligence projects, [Microsoft] expects spending to shift from new construction to fitting out data centers with servers and other equipment.

    To the extent that the bubble is popping, Microsoft and other datacenter owners have to guess half a decade in advance when the bubble will pop, and if you take them at their word — that is, if we assume that they canceled these contracts with perfect foresight — then the bubble must have already popped in 2023-2024, and the market is experiencing coyote time because…? More likely, this is fallout from their ongoing breakup with OpenAI, who almost certainly begged Microsoft for so much compute (and definitely begged for too many nVidia GPUs!) that Microsoft had to adjust their datacenter plans. The bubble’s not done until OpenAI has exhausted all possible funding, say in late 2025 or early 2026 when Softbank and the Saudis realize that they’ve made a hilarious mistake.

    We’ve discussed this previously on awful.systems, both the value of nuclear-energy contracts and Microsoft’s retraction of intents.



  • As the classic film Network points out, the Saudi money is the end of the road; there aren’t any richer or more gullible large wealth funds who will provide further cash. So OpenAI could be genuinely out of greater fools financing after another year of wasting Somebody Else’s Money. This crash has removed “large” from the front of any other wealth fund that might have considered bailing them out. The Stargate gamble could still work out, but so far I think ti’s only transferred bag-holding responsibilities from Microsoft to Oracle.

    Another path is to deflate nVidia’s cap. At first blush, this seems impossible to me; nVidia’s business behavior is so much worse than that of competitors Intel or Imagination yet they have generally never lost faith from their core gaming laity, and as long as nVidia holds 20-30% of the gaming GPU market they will always have a boutique niche with cap at least comparable to e.g. their competitor AMD. But GPUs have been treated as currency among datacenter owners, and a market crash could devalue the piles of nVidia GPUs which some datacenter owners have been using as collateral for purchasing land, warehouses, machines, more GPUs, etc. nVidia isn’t the only bag-holder here, though, and since they don’t really want to play loan-shark and repossess a datacenter for dereliction, odds are good that they’ll survive even if they’re no longer king of the hill. The gold rush didn’t work out? Too bad, no returns allowed on shovels or snow gear.

    Side note: If folks just wanted to know whether tech in general is hurt by this, then yes, look at Tesla’s valuation. Tesla is such a cross-cutting big-ticket component of so many ETFs that basically every retirement scheme took a hit from Tesla taking a hit. The same thing will happen with nVidia and frankly retirement-fund managers should feel bad for purchasing so much of what any long-term investor would consider to be meme stocks. (I don’t hold either TSLA or NVDA stocks.)

    I hope this makes sense. I don’t post with this candor when I’m well-rested and sober.



  • Today on the orange site, an AI bro is trying to reason through why people think he’s weird for not disclosing his politics to people he’s trying to be friendly with. Previously, he published a short guide on how to talk about politics, which — again, very weird, no possible explanation for this — nobody has adopted. Don’t worry, he’s well-read:

    So far I’ve only read Harry Potter and The Methods of Rationality, but can say it is an excellent place to start.

    The thread is mostly centered around one or two pearl-clutching conservatives who don’t want their beliefs examined:

    I find it astonishing that anyone would ask, [“who did you vote for?”] … In my social circle, anyway, the taboo on this question is very strong.

    To which the top reply is my choice sneer:

    In my friend group it’s clear as day: either you voted to kill and deport other people in the friend group or you didn’t. Pretty obvious the group would like to know if you’re secretly interested in their demise.



  • Strange is a trooper and her sneer is worth transcribing. From about 22:00:

    So let’s go! Upon saturating my brain with as much background information as I could, there was really nothing left to do but fucking read this thing, all six hundred thousand words of HPMOR, really the road of enlightenment that they promised it to be. After reading a few chapters, a realization that I found funny was, “Oh. Oh, this is definitely fanfiction. Everyone said [laughing and stuttering] everybody that said that this is basically a real novel is lying.” People lie on the Internet? No fucking way. It is telling that even the most charitable reviews, the most glowing worshipping reviews of this fanfiction call it “unfinished,” call it “a first draft.”

    A shorter sneer for the back of the hardcover edition of HPMOR at 26:30 or so:

    It’s extremely tiring. I was surprised by how soul-sucking it was. It was unpleasant to force myself beyond the first fifty thousand words. It was physically painful to force myself to read beyond the first hundred thousand words of this – let me remind you – six-hundred-thousand-word epic, and I will admit that at that point I did succumb to skimming.

    Her analysis is familiar. She recognized that Harry is a self-insert, that the out-loud game theory reads like Death Note parody, that chapters are only really related to each other in the sense that they were written sequentially, that HPMOR is more concerned with sounding smart than being smart, that HPMOR is yet another entry in a long line of monarchist apologies explaining why this new Napoleon won’t fool us again, and finally that it’s a bad read. 31:30 or so:

    It’s absolutely no fucking fun. It’s just absolutely dry and joyless. It tastes like sand! I mean, maybe it’s Yudkowsky’s idea of fun; he spent five years writing the thing after all. But it just [struggles for words] reading this thing, it feels like chewing sand.


  • Anecdote: I gave up on COBOL as a career after beginning to learn it. The breaking point was learning that not only does most legacy COBOL code use go-to statements but that there is a dedicated verb which rewrites go-to statements at runtime and is still supported on e.g. the IBM Enterprise COBOL for z/OS platform that SSA is likely using: ALTER.

    When I last looked into this a decade ago, there was a small personal website last updated in the 1990s that had advice about how to rewrite COBOL to remove GOTO and ALTER verbs; if anybody has a link, I’d appreciate it, as I can no longer find it. It turns out that the best ways of removing these spaghetti constructions involve multiple rounds of incremental changes which are each unlikely to alter the code’s behavior. Translations to a new language are doomed to failure; even Java is far too structured to directly encode COBOL control flow, and the time would be better spent on abstract specification of the system so that it can be rebuilt from that specification instead. This is also why IBM makes bank selling COBOL emulators.






  • Here’s some food for thought; ha ha, only serious. What if none of this is new?

    If this is a dealbreaker today, then it should have been a dealbreaker over a decade ago, when Google first rolled out Knowledge panels, which were also often inaccurate and unhelpful.

    If this isn’t acceptable from Google, then it shouldn’t be acceptable from DuckDuckGo, which has the same page-one results including an AI summary and panels, nor any other search engines. If summaries are unacceptable from Gemini, which has handily topped the leaderboards for weeks, then it’s not acceptable using models from any other vendor, including Alibaba, High-Flyer, Meta, Microsoft, or Twitter.

    If fake, hallucinated, confabulated, or synthetic search results are ruining the Web today, then they were ruining the Web over two decades ago and have not lessened since. The economic incentives and actors have shifted slightly, but the overall goal of fraudulent clicks still underlies the presentation.

    If machine learning isn’t acceptable in collating search results today, then search engines would not exist. The issue is sheer data; ever since about 1991, before the Web existed, there has been too much data available on the Internet to search exhaustively and quickly. The problem is recursive: when a user queries a popular search engine, their results are populated by multiple different searchers using different techniques to learn what is relevant, because no one search strategy works at scale for most users asking most things.

    I’m not saying this to defend Google but to steer y’all away from uncanny-valley reactionism. The search-engine business model was always odious, but we were willing to tolerate it because it was very inaccurate and easy to game, like a silly automaton which obeys simple rules. Now we are approaching the ability to conduct automated reference interviews and suddenly we have an “oops, all AI!” moment as if it weren’t always generative AI from the beginning.




  • Why is Microsoft canceling a Gigawatt of data center capacity while telling everybody that it didn’t have enough data centers to handle demand for its AI products? I suppose there’s one way of looking at it: that Microsoft may currently have a capacity issue, but soon won’t, meaning that further expansion is unnecessary.

    This is precisely it. Internally, Microsoft’s SREs perform multiple levels of capacity planning, so that a product might individually be growing and requiring more resources over the next few months, but a department might be overall shrinking and using less capacity over the next few years. A datacenter requires at least 4yrs of construction before its capacity is available (usually more like 5yrs) which is too long of a horizon for any individual product…unless, of course, your product is ChatGPT and it requires a datacenter’s worth of resources. Even if OpenAI were siloed from Microsoft or Azure, they would still know that OpenAI is among their neediest customers and include them in planning.

    Source: Scuttlebutt from other SREs, mostly. An analogous situation happened with Google’s App Engine product: App Engine’s biggest users impacted App Engine’s internal capacity planning at the product level, which impacted datacenter planning because App Engine was mostly built from one big footprint in one little Oklahoma datacenter.

    Conclusion: Microsoft’s going to drop OpenAI as a customer. Oracle’s going to pick up the responsibility. Microsoft knows that there’s no money to be made here, and is eager to see how expensive that lesson will be for Oracle; Oracle is fairly new to the business of running a public cloud and likely thinks they can offer a better platform than Azure, especially when fueled by delicious Arabian oil-fund money. Folks may want to close OpenAI accounts if they don’t want Oracle billing them someday.


  • Reading through the docket, he is entitled to a hearing for relief and has a modicum of standing due to the threat of deportation from the USA to China; it’s not unreasonable to go to federal court. The judge was fairly courteous in referring him to the Pro Se Project a week ago. I’m a little jealous of how detached he is from reality; from 36(a) of the Amended Complaint:

    The Plaintiff asserts that completing a Ph.D. in Health Services Research significantly increases earning potential. The average salary for individuals with such a Ph.D. is $120,000 annually, compared to $30,000 annually in China, where Plaintiff’s visa cancellation forces him to seek employment. Over an estimated 30-year working career, this represents a lifetime income loss of $2,700,000.

    He really went up to the judge and said, “your honor, my future career is dependent on how well I prompt ChatGPT, but statistically I should be paid more if I have a second doctorate,” and the judge patted him on his head and gave him a lollipop for being so precocious.


  • Well, how do you feel about robotics?

    On one hand, I fully agree with you. AI is a rebranding of cybernetics, and both fields are fundamentally inseparable from robotics. The goal of robotics is to create artificial slaves who will labor without wages or solidarity. We’re all ethically obliged to question the way that robots affect our lives.

    On the other hand, machine learning (ML) isn’t going anywhere. In my oversimplification of history, ML was originally developed by Markov and Shannon to make chatbots and predict the weather; we still want to predict the weather, so even a complete death of the chatbot industry won’t kill ML. Similarly, some robotics and cybernetics research is still useful even when not applied to replacing humans; robotics is where we learned to apply kinematics, and cybernetics gave us the concept of a massive system that we only partially see and interact with, leading to systems theory.

    Here’s the kicker: at the end of the day, most people will straight-up refuse to grok that robotics is about slavery. They’ll usually refuse to even examine the etymology, let alone the history of dozens of sci-fi authors exploring how robots are slaves or the reality today of robots serving humans in a variety of scenarios. They fundamentally don’t see that humans are aggressively chauvinist and exceptionalist in their conception of work and labor. It’s a painful and slow conversation just to get them to see the word robota.