Not just things connected to the internet. Radio-controlled things can also be hacked/hackable. Electronic devices are prone to EM interference. Air-gapped systems have flaws somewhere, even if the flaws come from the human operators (social engineering, humans are hackable). There's no such thing as a non-hackable thing.
dsilverz
I’ve also fooled around with LLMs and in my experience, they don’t perform well on uncommon things. If it’s barely in their dataset, they’ll struggle with the concept and fabricate something.
Me too. They hallucinate. And sometimes I learn things through these hallucinations, when I ask them about an uncommon thing. However, they won't give the uncommon thing, I'm the one who usually feeds the prompt with the uncommon thing for them to hallucinate. Indeed, what I'm seeking is likely the exact opposite of what's expected for LLMs: the extremely uncommon, close to complete hallucination and stochastic behavior.
A better tool would be traditional statistics, going through datasets and counting frequencies and you can find your hapax legomena precisely
I'm used to do it in a laughable "poorman's way" via Node.js + RegEx + JS key-value dictionary object (whose key is the token and the value is a number that increases as this token is found via interaction), downloading some JSON/TXT/CSV dataset, reading and parsing it, then iterating over its tokens. It consumes a lot of memory, time and CPU (yet I try to use a sleep/delay between N iterations in order to free the CPU from high loads). I know there are better ways, and a temperature/param-adjustable LLM seemed for me as a better way, hoping that there's some exception across the many LLMs publicly available that wouldn't discard hapaxes.
And I mean some linguists probably already studied this and you could also read their publications
The things I'm willing to discover and learn weren't/aren't so well studied. I mean, human knowledge is a really vast universe of concepts, names and ideas, some of them got buried by time (sometimes centuries or millennia). Someone has to dig them because they could hold value, knowledge value. One of my purposes with this inquiry over the unknown is to find these really forgotten ideas and concepts, things never studied before, and try and study them, learn about them. That's how things were rediscovered throughout the entire human history: treasures are buried by the passage of time, and a curious person digs them, and humanity gets to know them once again. And a potential source of knowledge lurking in oblivion is the big data, or big datasets.
Thanks, I'll take a look at them. While an LLM would bring autocompleted context (although possibly a hallucinated context, but it would work for me) about the hapax word/concept, I can also discover hidden hapax words in a dataset and then try to figure out their meaning.
Actually, it's training a self-driving humanoid robot that's supposed to climb stairs in order to terminate any potential John Connor that's inside a house upstairs.
Dead Internet Theory all over again.
I use Linux, but let’s be honest here: every operating system requires space to function properly. While Linux is known for taking up less storage space (especially distros such as Alpine), as you install things, it will demand space, and that includes future updates. With the recent additions of AI within Windows, 8GB is compatible with the size of neural weights or datasets (speaking as someone who has dealt with AI in a technical/development capacity). If they're registered as "Temporary files" from "old installation", I guess it has to do with some re-training of Windows AI tools. I'm not discussing the merit of those AI tools (whether they're useful or not to the final user), but AI tools, among all modern features, takes space.
Before opening the article, I was thinking of something really, really sophisticated involving high-pitched sound and microphones (e.g. coil whine modulation through I/O processes), electrical inductance and electromagnetic fields carefully modulated to directly interfere on CPU instructions, Van Eck Phreaking (something like TempestSDR but fancier), precision-grade voltage meters to try and identify ongoing CPU instructions through quick teeny-tiny microvolt fluctuations over the power grid but, no, it's the old fashioned way of malware transportation: portable disks.
For now, Brave. As for search engines, most of the time I've been using DDG (IIRC it's the default search engine on Brave) but sometimes I prepend ":g" to use Google for searching things that DDG cannot (yet). As a plus, sometimes I also use Marginalia (I set Brave to use Marginalia when I prepend a ":mgn") in order to search for (g)old content (such as blogosphere content, BBS List archives and so on). If I need to search something deeply, I use Ahmia.
A naive question of mine, but isn't using a browser/extensions that silently/transparently blocks cookies (such as Brave, but not just it) enough to fearlessly click "Accept All Cookies", since ultimately they would be pointless for the purpose of tracking (due to the browser's own cookie blocking capabilities)?
~~Media doesn't load here (both through my instance's web front, as well as through discuss.tchncs.de's web front)~~
Edit: never mind, I got it to load now, maybe it was a temporary lag. It remembers me these Ethernet testers (in such a way that it's capable of testing individual pins for electrical continuity), but digital and with more features:
It's actually not nice, not for local wildlife, for example. Biomes exist for a reason and if anything changes abruptly, evolution can't keep up with these changes, resulting in extinction of several species. Just like flowers are blooming in Antarctica, a rainy and green Sahara is as beautiful as a rose with thorns under its petals: really beautiful, but ominously dangerous.
Actually, the hotel manager could relocate guests from each Nth room to the (2×N)th (every even-numbered room), as there are infinity rooms. This way, there'll be as infinity free odd-numbered rooms as there are infinity booked guests. Sisyphus can then choose any odd room for himself and another for his boulder.