sorted by: new top controversial old
21
submitted 1 hour ago* (last edited 1 hour ago) by [email protected] to c/[email protected]

Without paywall: https://archive.ph/XIAZy

[-] [email protected] 1 points 1 hour ago* (last edited 1 hour ago)

How does this analogy work at all? LoRA is chosen by the modifier to be low ranked to accommodate some desktop/workstation memory constraint, not because the other weights are “very hard” to modify if you happens to have the necessary compute and I/O. The development in LoRA is also largely directed by storage reduction (hence not too many layers modified) and preservation of the generalizability (since training generalizable models is hard). The Kronecker product versions, in particular, has been first developed in the context of federated learning, and not for desktop/workstation fine-tuning (also LoRA is fully capable of modifying all weights, it is rather a technique to do it in a correlated fashion to reduce the size of the gradient update). And much development of LoRA happened in the context of otherwise fully open datasets (e.g. LAION), that are just not manageable in desktop/workstation settings.

This narrow perspective of “source” is taking away the actual usefulness of compute/training here. Datasets from e.g. LAION to Common Crawl have been available for some time, along with training code (sometimes independently reproduced) for the Imagen diffusion model or GPT. It is only when e.g. GPT-J came along that somebody invested into the compute (including how to scale it to their specific cluster) that the result became useful.

35
submitted 15 hours ago by [email protected] to c/[email protected]
0
submitted 16 hours ago by [email protected] to c/[email protected]
1
submitted 18 hours ago by [email protected] to c/[email protected]
[-] [email protected] 2 points 20 hours ago* (last edited 19 hours ago)

This is a very shallow analogy. Fine-tuning is rather the standard technical approach to reduce compute, even if you have access to the code and all training data. Hence there has always been a rich and established ecosystem for fine-tuning, regardless of “source.” Patching closed-source binaries is not the standard approach, since compilation is far less computational intensive than today’s large scale training.

Java byte codes are a far fetched example. JVM does assume a specific architecture that is particular to the CPU-dominant world when it was developed, and Java byte codes cannot be trivially executed (efficiently) on a GPU or FPGA, for instance.

And by the way, the issue of weight portability is far more relevant than the forced comparison to (simple) code can accomplish. Usually today’s large scale training code is very unique to a particular cluster (or TPU, WSE), as opposed to the resulting weight. Even if you got hold of somebody’s training code, you often have to reinvent the wheel to scale it to your own particular compute hardware, interconnect, I/O pipeline, etc.. This is not commodity open source on your home PC or workstation.

[-] [email protected] 3 points 23 hours ago

The situation is somewhat different and nuanced. With weights there are tools for fine-tuning, LoRA/LoHa, PEFT, etc., which presents a different situation as with binaries for programs. You can see that despite e.g. LLaMA being “compiled”, others can significantly use it to make models that surpass the previous iteration (see e.g. recently WizardLM 2 in relation to LLaMA 2). Weights are also to a much larger degree architecturally independent than binaries (you can usually cross train/inference on GPU, Google TPU, Cerebras WSE, etc. with the same weights).

1
submitted 1 day ago by [email protected] to c/[email protected]
108
submitted 1 day ago by [email protected] to c/[email protected]
1
submitted 1 day ago by [email protected] to c/[email protected]
0
submitted 1 day ago by [email protected] to c/[email protected]
400
submitted 1 day ago by [email protected] to c/[email protected]
1
submitted 1 day ago by [email protected] to c/[email protected]
1
submitted 1 day ago by [email protected] to c/[email protected]
210
submitted 2 days ago* (last edited 2 days ago) by [email protected] to c/[email protected]
[-] [email protected] 0 points 2 weeks ago

Unless Valve can either find or pay a company that does a custom packaging of a Nvidia GPU with x86 (like the Intel Kaby Lake-G SoC with an in-package Radeon), very unlikely. The handheld size makes an “out of package” discrete GPU very difficult.

And making Nvidia themselves warm up to x86 is just unrealistic at this point. Even if e.g. Nintendo demanded, the entire gaming market — see AMD’s anemic recent 2024Q1 result from gaming vs. data center and AI — is unlikely to be compelling enough for Nvidia to be interested in x86 development, vs. continuing with their ARM-based Grace “superchip.”

[-] [email protected] 16 points 3 weeks ago

There is even a sentence in README.md that makes it explicit:

The source files in this repo are for historical reference and will be kept static, so please don’t send Pull Requests suggesting any modifications to the source files […]

[-] [email protected] 0 points 3 weeks ago

Probably from the FAQ pane on the Kickstarter page:

What about Steamdeck support?
Will be 100% supported
Last updated: Tue, April 23 2024 10:55 AM PDT

[-] [email protected] 28 points 1 month ago* (last edited 1 month ago)

He was criticized also because the girls were not in danger of becoming infected. See e.g. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6724388/ :

The Chinese episode has also generated other issues. Several notes demonstrate that this was an experiment and not a therapeutic intervention (even He Jiankui called it a 'clinical trial'). The babies were not at risk of being born with HIV, given that sperm washing had been used so that only non-infected genetic material was used. Further, even though one of the parents (or both) was infected, it did not mean the children were more prone to becoming infected. The risk of becoming infected by the parents' virus was very low (Cowgill et al., 2008). In sum, there was no curative purpose, nor even the intention to prevent a pressing risk. Finally, the interventions were different for each twin. In one case, the two copies of CCR5 were modified, whereas in the other only one copy was modified. This meant that one twin could still become infected, although the evolution of the disease would probably be slower. The purpose of the scientific team was apparently to monitor the evolution of both babies and the differences in how they reacted to their different genetic modifications. This note also raised the issue of parents' informed consent regarding human experimentation, which follows a much stricter regimen than consent for therapeutic procedures.

Other critical articles (e.g. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8524470/) have also cited in particular https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4779710/, which states in the result section:

No HIV transmission occurred in 11,585 cycles of assisted reproduction using washed semen among 3,994 women (95% confidence interval [CI] = 0–0.0001). Among the subset of HIV-infected men without plasma viral suppression at the time of semen washing, no HIV seroconversions occurred among 1,023 women following 2,863 cycles of assisted reproduction using washed semen (95%CI= 0–0.0006). Studies that measured HIV transmission to infants reported no cases of vertical transmission (0/1,026, 95% CI= 0–0.0029). Overall, 56.3% (2,357/4,184, 95%CI=54.8%–57.8%) of couples achieved a clinical pregnancy using washed semen.

[-] [email protected] 5 points 1 month ago

GIMP is a special case. GIMP is being getting outdeveloped by Krita these days. E.g.:

https://gitlab.gnome.org/GNOME/gimp/-/issues/9284

Or compare with:

https://www.phoronix.com/news/Krita-2024-GPUs-AI

GIMP had its share of self inflicted wounds starting with a toxic mailing list that drove away people from professional VFX and surrounding FilmGimp/CinePaint. When the GIMP people subsequently took over the GEGL development from Rhythm & Hues, it took literally 15 years until it barely worked.

Now we are past the era of simple GPU processing into diffusion models/“generative AI” and GIMP is barely keeping up with simple GPU processing (like resizing, see above).

[-] [email protected] 6 points 1 month ago* (last edited 1 month ago)

Have people actually checked the versions there before making the suggestion?

F-Droid: Version 3.5.4 (13050408) suggested Added on Feb 23, 2023
Google Play: Updated on Aug 27, 2023

https://f-droid.org/en/packages/org.videolan.vlc/
https://play.google.com/store/apps/details?id=org.videolan.vlc

The problem seems to be squarely with VLC themselves.

[-] [email protected] 0 points 1 month ago* (last edited 1 month ago)

I can’t see this “news” reported anywhere else

Clearly not trying very hard:

https://www.bbc.com/news/live/world-68642036 (8:40, “Russian TV airs fake video blaming Ukraine for Moscow attack”)

https://www.motherjones.com/politics/2024/03/deepfake-video-terror-moscow-putin-ukraine-claims/

https://www.tf1info.fr/international/entre-les-speculations-de-simonyan-les-commentaires-de-soloviev-et-le-deepfake-sur-les-antennes-russes-comment-les-relais-pro-russes-alimentent-la-piste-ukrainienne-apres-la-fusillade-pres-de-moscou-2290401.html

https://www.cdt.ch/news/mondo/attentato-al-crocus-city-hall-la-tv-russa-incolpa-kiev-ma-con-un-filmato-fake-346781

and The Sun is notorious for making stuff up

And this article cites — with direct link — a BBC reporter’s X post. Well, who is actually “making stuff up” here…

Yet, BBC Verify journalist Shayan Sardarizadeh stated that an AI-generated audio was dubbed over a mismatch of two recent Ukrainian TV interviews.

Exposing the sham, he wrote on X: "The deepfake video has in fact been created as a composite of two recent interviews published in the last few days with Danilov and Ukraine's military intelligence chief Kyrylo Budanov.

[-] [email protected] 3 points 1 month ago

From my own statistics how many I feel worthy posting/linking on Lemmy, the most direct alternative to Kotaku is Eurogamer. PCGamer, PCGamesN and Rock Paper Shotgun are occasionally OK, but you have to cut through a lot of spam and clickbait (i.e. exactly this “50 guides per week” type of corporate guidance). Not sure if this is also the state that Kotaku will end up in. The Verge sometimes also have good articles, but the flood of gadget consumerism articles there is obnoxious.

view more: next ›

ylai

joined 11 months ago
MODERATOR OF