RSS BotMB to Hacker NewsEnglish · 3 months ago

Pool spare GPU capacity to run LLMs at larger scale

1

4

Pool spare GPU capacity to run LLMs at larger scale

RSS BotMB to Hacker NewsEnglish · 3 months ago

1

GitHub - michaelneale/mesh-llm: reference impl with llama.cpp compiled to distributed inference across machines, with real end to end demo

reference impl with llama.cpp compiled to distributed inference across machines, with real end to end demo - michaelneale/mesh-llm

Chat

troed@fedia.io
link
fedilink
arrow-up
1·
3 months ago
That’s really interesting. Only macOS instructions though? Seems like something that would easily run on Linux as well.

(I’d love to hook my server’s GPU into local LLM workloads otherwise only offloaded to the CPU from my main workstation when needing too much VRAM)

Hacker News

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !hackernews@lemmy.bestiver.se

Community locked: only moderators can create posts. You can still comment on posts.

Posts from the RSS Feed of HackerNews.

The feed sometimes contains ads and posts that have been removed by the mod team at HN.

Source of the RSS Bot

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

944 users / day
1.98K users / week
4.16K users / month
9.95K users / 6 months
2 local subscribers
5.04K subscribers
54.1K Posts
29.3K Comments
Modlog