trying to write a thread about polytopia but my images won't upload >:(. idk what i'm doing wrong, i've tried on both my desktop and my phone
TechTakes
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
it should be fixed… again. for some reason our image cache keeps getting into a state where it either stops accepting uploads or stops accepting requests at all. I plan to upgrade us to the latest version soon, but it’ll unfortunately involve a little bit of downtime: to upgrade pict-rs to a new point release, you have to run the migrate command, but it only works for the previous release. we’re two releases behind, so I have to custom package the in-between release just to get us there.
i see! thanks for all your work <3. I think i'll just write the thread after the upgrade, i got partially done and it started eating my images again so maybe this just isn't the moment
of course! re the images: uggh hell with it, I’m scheduling the maintenance and I’m gonna spend some time in the lead-up isolating a root cause for our breakage just in case the upgrade doesn’t fix it
Starting to think we're about at the point where you could make the best search engine on the market in these three easy steps:
- Search Wikipedia for whatever the user typed and show the top result first.
- Check if dot com, org, and net exist and show them in the order of popularity.
- End of page.
Remember how OAI claimed that O3 had displayed superhuman levels on the mega hard Frontier Math exam written by Fields Medalist? Funny/totally not fishy story haha. Turns out OAI had exclusive access to that test for months and funded its creation and refused to let the creators of test publicly acknowledge this until after OAI did their big stupid magic trick.
From Subbarao Kambhampati via linkedIn:
"𝐎𝐧 𝐭𝐡𝐞 𝐬𝐞𝐞𝐝𝐲 𝐨𝐩𝐭𝐢𝐜𝐬 𝐨𝐟 "𝑩𝒖𝒊𝒍𝒅𝒊𝒏𝒈 𝒂𝒏 𝑨𝑮𝑰 𝑴𝒐𝒂𝒕 𝒃𝒚 𝑪𝒐𝒓𝒓𝒂𝒍𝒍𝒊𝒏𝒈 𝑩𝒆𝒏𝒄𝒉𝒎𝒂𝒓𝒌 𝑪𝒓𝒆𝒂𝒕𝒐𝒓𝒔" hashtag#SundayHarangue. One of the big reasons for the increased volume of "𝐀𝐆𝐈 𝐓𝐨𝐦𝐨𝐫𝐫𝐨𝐰" hype has been o3's performance on the "frontier math" benchmark--something that other models basically had no handle on.
We are now being told (https://lnkd.in/gUaGKuAE) that this benchmark data may have been exclusively available (https://lnkd.in/g5E3tcse) to OpenAI since before o1--and that the benchmark creators were not allowed to disclose this *until after o3 *.
That o3 does well on frontier math held-out set is impressive, no doubt, but the mental picture of "𝒐1/𝒐3 𝒘𝒆𝒓𝒆 𝒋𝒖𝒔𝒕 𝒃𝒆𝒊𝒏𝒈 𝒕𝒓𝒂𝒊𝒏𝒆𝒅 𝒐𝒏 𝒔𝒊𝒎𝒑𝒍𝒆 𝒎𝒂𝒕𝒉, 𝒂𝒏𝒅 𝒕𝒉𝒆𝒚 𝒃𝒐𝒐𝒕𝒔𝒕𝒓𝒂𝒑𝒑𝒆𝒅 𝒕𝒉𝒆𝒎𝒔𝒆𝒍𝒗𝒆𝒔 𝒕𝒐 𝒇𝒓𝒐𝒏𝒕𝒊𝒆𝒓 𝒎𝒂𝒕𝒉"--that the AGI tomorrow crowd seem to have--that 𝘖𝘱𝘦𝘯𝘈𝘐 𝘸𝘩𝘪𝘭𝘦 𝘯𝘰𝘵 𝘦𝘹𝘱𝘭𝘪𝘤𝘪𝘵𝘭𝘺 𝘤𝘭𝘢𝘪𝘮𝘪𝘯𝘨, 𝘤𝘦𝘳𝘵𝘢𝘪𝘯𝘭𝘺 𝘥𝘪𝘥𝘯'𝘵 𝘥𝘪𝘳𝘦𝘤𝘵𝘭𝘺 𝘤𝘰𝘯𝘵𝘳𝘢𝘥𝘪𝘤𝘵--is shattered by this. (I have, in fact, been grumbling to my students since o3 announcement that I don't completely believe that OpenAI didn't have access to the Olympiad/Frontier Math data before hand.. )
I do think o1/o3 are impressive technical achievements (see https://lnkd.in/gvVqmTG9 )
𝑫𝒐𝒊𝒏𝒈 𝒘𝒆𝒍𝒍 𝒐𝒏 𝒉𝒂𝒓𝒅 𝒃𝒆𝒏𝒄𝒉𝒎𝒂𝒓𝒌𝒔 𝒕𝒉𝒂𝒕 𝒚𝒐𝒖 𝒉𝒂𝒅 𝒑𝒓𝒊𝒐𝒓 𝒂𝒄𝒄𝒆𝒔𝒔 𝒕𝒐 𝒊𝒔 𝒔𝒕𝒊𝒍𝒍 𝒊𝒎𝒑𝒓𝒆𝒔𝒔𝒊𝒗𝒆--𝒃𝒖𝒕 𝒅𝒐𝒆𝒔𝒏'𝒕 𝒒𝒖𝒊𝒕𝒆 𝒔𝒄𝒓𝒆𝒂𝒎 "𝑨𝑮𝑰 𝑻𝒐𝒎𝒐𝒓𝒓𝒐𝒘."
We all know that data contamination is an issue with LLMs and LRMs. We also know that reasoning claims need more careful vetting than "𝘸𝘦 𝘥𝘪𝘥𝘯'𝘵 𝘴𝘦𝘦 𝘵𝘩𝘢𝘵 𝘴𝘱𝘦𝘤𝘪𝘧𝘪𝘤 𝘱𝘳𝘰𝘣𝘭𝘦𝘮 𝘪𝘯𝘴𝘵𝘢𝘯𝘤𝘦 𝘥𝘶𝘳𝘪𝘯𝘨 𝘵𝘳𝘢𝘪𝘯𝘪𝘯𝘨" (see "In vs. Out of Distribution analyses are not that useful for understanding LLM reasoning capabilities" https://lnkd.in/gZ2wBM_F ).
At the very least, this episode further argues for increased vigilance/skepticism on the part of AI research community in how they parse the benchmark claims put out commercial entities."
Big stupid snake oil strikes again.
(oh no it's politics)
Trump's new cryptocurrency scheme is surprisingly forthright about being a pump & dump:
CIC Digital LLC, an affiliate of The Trump Organization, and Fight Fight Fight LLC collectively own 80% of the Trump Cards, subject to a 3-year unlocking schedule. CIC Digital LLC and Celebration Cards LLC, the owners of Fight Fight Fight LLC, will receive trading revenue derived from trading activities of Trump Meme Cards.
Essentially according to their own website, they started by selling 20%* of the tokens to the public, and over the next few years will... sell another 80% of the tokens to the public. To the moon!
* half of that they describe as "liquidity" instead of public distribution -- whatever that means.
My gut says that liquidity in this context means "making sure that there are tokens available to purchase for initial buyers" or in other words listing them on the market instead of distributing them at initial purchase price.
I read about this gross Robo Anne Frank LLM by a company called "School AI": Bluesky post (looks like via an activitypub bridge, but I can't be bothered to find the canonical link), News Article, School AI's website.
Gee it sure is weird how all these digital clones the AI companies keep coming up with all have the exact same (lack of a) personality.
So, the Wikipedia article about "prompt engineering" is pretty terrible. First source: OpenAI. Second: a blog. Third: OpenAI. Fourth: OpenAI's blog. ArXiv, arXiv, arXiv... 43 times. Hop on over to the Talk page, and we find this gem:
It is sometimes necessary to make assumptions to write an article (see WP:MNA).
Spoiler alert: that link doesn't justify anything. It basically advises against going off on tangents: There's no need to rehash the fact that evolution is a fact on every damn biology page. It does not say that Wikipedia should have an article on some creationist fantasy, like baraminology or flood geology, based entirely on creationist screeds that all cite each other.
I have spent the last half-hour in the angry dome
hells yeah it's time for some action - Drew DeVault is organizing a sit-in protest of Jack Dorsey's keynote at FOSDEM 2025.
The people of Muskogee will have their revenge on Jack Dorsey
eh? I don't see Jackie D's keynote in the schedule, did the threat of a sit-in make them delete it? https://fosdem.org/2025/schedule/ edit: oh, it's linked from Drew's post.
there it is, sammy has gone and said people are just prompting the model wrong (I recall we’ve had that bit said here earlier)
but in true sammy grift: you just need to be asking the right questions to trump intelligence. “why do you want to suck, as a human?” sammy asks, not understanding a moment of humanity
how do you even define "raw, intellectual horsepower" and how does it differ from knowing how to formulate questions mother fucker
"Raw, intellectual horsepower" means fucking an intellectual horse without a condom.
Oh, wait, that's rawdogging intellectual horsepower, my mistake.
shot:
Von Neumann arguably had the highest processor-type "horsepower" we know of plus his breadth of intellectual achievements is unparalleled.
chaser:
But imo Grothendieck is a better comparison point for ASI as his intelligence, while being strangely similar to LLMs in some dimensions
I never thought I'd say this but... don't slander category theory like that, compared to LLMs it's downright useful
oh my god
Pretty sure there's gonna be a contract somewhere that defines raw intelligence as "the amount of money you make for OpenAI."
the bit about it that I find subtly glorious (in how remarkably fuckwitted it is) is the baseline idea of “intellectual horsepower”
I’m not surprised that this is a view they (of the company that’s effectively going “just 12 more DCs bro it’ll be enough compute bro I promise bro just watch”) hold and consider in such a simple mechanism-rating scale
but it is funny as fuck
What they promise: intelligence systems, revolutionary agent which can take over tasks!
What we get: https://youtu.be/izazdBpraC8
This is extremely tangential to the areas of sneer interest, but seeing as this is the only technology related community I am in, I’m putting it here.
This song has been making the rounds on the charts/social media and I refuse to believe that it isn’t about the package management tool apt
I hope Toni Basil and the Ting Tings get big royalty cheques.
the writers of hey mickey have a writing credit on it, but the ting tings got shafted.
A shame. Seems like they were clearly inspired by it .... I guess you can't copyright a vibe.
yoooooooooooooo
of course it is one in a long tradition of tech-tangential songs, including this banger about the inevitable collapse of tech bubbles
I remember boten anna as a tech song example