this post was submitted on 03 Feb 2025
796 points (98.3% liked)

Technology

61632 readers
3425 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
(page 3) 40 comments
sorted by: hot top controversial new old
[–] [email protected] 5 points 2 days ago

Come and get me.

[–] [email protected] 3 points 2 days ago (1 children)

And they wonder why China is eating the US's lunch on the world stage in terms of influence and technology.

load more comments (1 replies)
[–] [email protected] 3 points 2 days ago

Are you tired of winning yet?

[–] [email protected] 6 points 2 days ago (1 children)

You wouldn't download a LLM

load more comments (1 replies)
[–] [email protected] 8 points 2 days ago (3 children)

Shouldn't Google and Apple go to jail for distribution?

load more comments (3 replies)
[–] [email protected] 6 points 2 days ago* (last edited 2 days ago)

I'd get LM studio or Ollama, and download R1 your system can handle quick. If you're on Linux, Alpaca is on Flathub, you can get it and it'll download models and run them for you, including Deepseek R1.

[–] [email protected] 69 points 2 days ago (1 children)

I'm gonna download it even harder.

[–] [email protected] 10 points 2 days ago

See you hell evildoer!

[–] [email protected] 32 points 2 days ago* (last edited 2 days ago)

Print the code in a book and mail it.

Surely, they cannot ban books... right? Right?

Edit: Wait wait wait... the Comstock Act says mail cannot be used for anything that can be used for abortion. And a AI can theoretically be used to get instructions for abortion. BOOM, it's banned! 👀

[–] [email protected] 23 points 2 days ago (1 children)

More protectionist bullshit that won't help anyone.

load more comments (1 replies)
[–] [email protected] 47 points 2 days ago* (last edited 2 days ago) (1 children)

I wasn't thinking of downloading an AI onto my low tier computer until now.

[–] [email protected] 19 points 2 days ago (2 children)

I've got a laptop kicking around from 2010 that's about to get deepseek just because they're proposing this dumb ass shit. I don't even use Gen AI.

[–] [email protected] 83 points 2 days ago* (last edited 2 days ago) (2 children)

Hawley’s statement called DeepSeek “a data-harvesting, low-cost AI model that sparked international concern and sent American technology stocks plummeting.”

data-harvesting

???

It runs offline... using open-source software that provably does not collect or transmit any data...

It is low-cost and out-competes American technology, though, true

[–] [email protected] 19 points 2 days ago (1 children)

sent American technology stocks plummeting

Oh yeah, thats what did it, totally

load more comments (1 replies)
[–] [email protected] 52 points 2 days ago* (last edited 2 days ago)
[–] [email protected] 4 points 2 days ago

rip Pat Gelsinger

[–] [email protected] 10 points 2 days ago (1 children)
load more comments (1 replies)
[–] [email protected] 34 points 2 days ago (3 children)

So I guess it's free speech as long as you agree with the goverment's speech. If not, then it's a crime.

[–] [email protected] 13 points 2 days ago

Elon Musk was just posting a factory of prisoners all working for cents on the dollar saying that America needs more of that.

[–] [email protected] 5 points 2 days ago (1 children)

Always have been, and this is a bipartisan value, heck, it's common to all political parties of the world.

load more comments (1 replies)
load more comments (1 replies)
[–] [email protected] 84 points 2 days ago (2 children)
[–] [email protected] 55 points 2 days ago (1 children)

this is deepseek-v3. deepseek-r1 is the model that got all the media hype: https://huggingface.co/deepseek-ai/DeepSeek-R1

load more comments (1 replies)
[–] [email protected] 10 points 2 days ago (3 children)

Can you elaborate on the differences?

[–] [email protected] 20 points 2 days ago* (last edited 2 days ago) (6 children)

Base models are general purpose language models, mainly useful for AI researchers and people who want to build on top of them.

Instruct or chat models are chatbots. They are made by fine-tuning base models.

The V3 models linked by OP are Deepseek's non-reasoning models, similar to Claude or ChatGPT4o. These are the "normal" chatbots that reply with whatever comes to their mind. Deepseek also has a reasoning model, R1. Such models take time to "think" before supplying their final answer; they tend to give better performance for stuff like math problems, at the cost of being slower to get the answer.

It should be mentioned that you probably won't be able to run these models yourself unless you have a data center style rig with 4-5 GPUs. The Deepseek V3 and R1 models are chonky beasts. There are smaller "distilled" forms of R1 that are possible to run locally, though.

load more comments (6 replies)
[–] [email protected] -2 points 2 days ago (1 children)

r1 is lightweight and optimized for local environments on a home PC. It's supposed to be pretty good at programming and logic and kinda awkward at conversation.

v3 is powerful and meant to run on cloud servers. It's supposed to make for some pretty convincing conversations.

[–] [email protected] 5 points 2 days ago (2 children)

R1 isn’t really runnable with a home rig. You might be able to run a distilled version of the model though!

[–] [email protected] 5 points 2 days ago (1 children)

Tell that to my home rig currently running the 671b model...

[–] [email protected] 6 points 2 days ago (1 children)

That likely is one of the distilled versions I’m talking about. R1 is 720 GB, and wouldn’t even fit into memory on a normal computer. Heck, even the 1.58-bit quant is 131GB, which is outside the range of a normal desktop PC.

But I’m sure you know what version you’re running better than I do, so I’m not going to bother guessing.

[–] [email protected] 4 points 2 days ago* (last edited 2 days ago) (1 children)

It's not. I can run the 2.51bit quant

load more comments (1 replies)
load more comments (1 replies)
[–] [email protected] 2 points 2 days ago (1 children)

https://www.deepseekv3.com/en/download

I was assuming one was pre-trained and one wasn't but don't think that's correct and don't care enough to investigate further.

[–] [email protected] 17 points 2 days ago

Is that website legit? I've only ever seen https://www.deepseek.com/

And I would personally recommend downloading from HuggingFace or Ollama

[–] [email protected] 28 points 2 days ago (1 children)

It's only a proposed bill (thankfully), but definitely one to keep an eye on.

[–] [email protected] 17 points 2 days ago (2 children)

Eh, it's just virtue signaling nonsense. It's not going anywhere.

[–] [email protected] 18 points 2 days ago (1 children)

That's what everyone thought about Trump becoming president... Both times.

[–] [email protected] 4 points 2 days ago (1 children)

At this point someone could say Hitler is coming back from the dead riding a dinosaur, and my reaction would be "Yeah, sure. That may as well happen. Nothing has made sense the last 10 years. That's just as plausible as anything else we've seen."

load more comments (1 replies)
[–] [email protected] 36 points 2 days ago* (last edited 2 days ago) (1 children)

that's what they said about vaguely gestures at everything

[–] [email protected] 222 points 2 days ago* (last edited 2 days ago) (4 children)

Wow, bold choice to ban the import of technology and knowledge. Usually governments are worried about export, so it doesn't fall into the wrong hands.

Btw, how is the Nvidia stock price doing?

[–] [email protected] 56 points 2 days ago

Right? Like, seriously, we all know somebody is just butthurt because their stock options tanked.

Oh, wait, I'm sorry! That was very unpatriotic of me, wasn't it? I mean, we all know that winning an election guarantees being heavily rewarded with insider trading, right? It's not like they're there to represent constituents or anything; I mean, doesn't everyone know we're a republic, not a democracy?!

Sigh...

load more comments (3 replies)
load more comments
view more: ‹ prev next ›