Selfhosted

48681 readers

2485 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don't duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.

Resources:

selfh.st Newsletter and index of selfhosted software and apps
awesome-selfhosted software
awesome-sysadmin resources
Self-Hosted Podcast from Jupiter Broadcasting

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago

MODERATORS

[email protected]

110

I've just created c/Ollama! (lemmy.world)

submitted 3 days ago* (last edited 3 days ago) by [email protected] to c/[email protected]

32 comments fedilink hide all child comments

I've just re-discovered ollama and it's come on a long way and has reduced the very difficult task of locally hosting your own LLM (and getting it running on a GPU) to simply installing a deb! It also works for Windows and Mac, so can help everyone.

I'd like to see Lemmy become useful for specific technical sub branches instead of trying to find the best existing community which can be subjective making information difficult to find, so I created [email protected] for everyone to discuss, ask questions, and help each other out with ollama!

So, please, join, subscribe and feel free to post, ask questions, post tips / projects, and help out where you can!

Thanks!

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 3 points 3 days ago* (last edited 3 days ago) (16 children)

Totally depends on your hardware, and what you tend to ask it. What are you running? What do you use it for? Do you prefer speed over accuracy?

[–] [email protected] 1 points 2 days ago (11 children)

I have a MacBook 2 pro (Apple silicon) and would kind of like to replace Google's Gemini as my go-to LLM. I think I'd like to run something like Mistral, probably. Currently I do have Ollama and some version of Mistral running, but I almost never used it as it's on my laptop, not my phone.

I'm not big on LLMs and if I can find an LLM that I run locally and helps me get off of using Google Search and Gimini, that could be awesome. Currently I use a combo of Firefox, Qwant, Google Search, and Gemini for my daily needs. I'm not big into the direction Firefox is headed, I've heard there are arguments against Qwant, and using Gemini feels like the wrong answer for my beliefs and opinions.

I'm looking for something better without too much time being sunk into something I may only sort of like. Tall order, I know, but I figured I'd give you as much info as I can.

[–] [email protected] 2 points 2 days ago* (last edited 2 days ago) (6 children)

Honestly perplexity, the online service, is pretty good.

As for local running, one question first: how much RAM does your Mac have? This is basically the factor for what model you can and should run.

[–] [email protected] 1 points 2 days ago (1 children)

8GB

[–] [email protected] 2 points 2 days ago* (last edited 2 days ago) (1 children)

8GB?

You might be able to run Qwen3 4B: https://huggingface.co/mlx-community/Qwen3-4B-4bit-DWQ/tree/main

But honestly you don't have enough RAM to spare, and even a small model might bog things down. I'd run Open Web UI or LM Studio with a free LLM API, like Gemini Flash, or pay a few bucks for something off openrouter. Or maybe Cerebras API.

...Unfortunely, LLMs are very RAM intensive, and >4GB (more realistically like 2GB) is not going to be a good experience :(

[–] [email protected] 1 points 2 days ago (1 children)

Good to know. I'd hate to buy a new machine strictly for running an LLM. Could be an excuse to pickup something like a Framework 16, but realistically, I don't see myself doing that. I think you might be right about using something like Open Web UI or LM Studio.

[–] [email protected] 2 points 2 days ago* (last edited 2 days ago) (1 children)

Yeah, just paying for LLM APIs is dirt cheap, and they (supposedly) don't scrape data. Again I'd recommend Openrouter and Cerebras! And you get your pick of models to try from them.

Even a framework 16 is not good for LLMs TBH. The Framework desktop is (as it uses a special AMD chip), but it's very expensive. Honestly the whole hardware market is so screwed up, hence most 'local LLM enthusiasts' buy a used RTX 3090 and stick them in desktops or servers, as no one wants to produce something affordable apparently :/

[–] [email protected] 2 points 2 days ago (1 children)

@brucethemoose @WhirlpoolBrewer

*1650 and it works like a charm 🤌🏾

[–] [email protected] 1 points 2 days ago* (last edited 2 days ago)

1650

You mean GPU? Yeah, it's good, I was strictly talking about purchasing a laptop for LLM usage, as most are less than ideal for the money. Laptop vram pools are relatively small and SO-DIMMS are usually very slow.

Things will get much better once the "Max" AMD SKUs proliferate.

load more comments (4 replies)

load more comments (8 replies)

load more comments (12 replies)