Lemmy: Bestiverse
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
RSS BotMB to Hacker NewsEnglish · 7 hours ago

Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system

www.tomshardware.com

external-link
message-square
1
fedilink
2
external-link

Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system

www.tomshardware.com

RSS BotMB to Hacker NewsEnglish · 7 hours ago
message-square
1
fedilink
Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system— up to 9x increase in output lets 213 GPUs perform like 1,192
www.tomshardware.com
external-link
A paper presented at SOSP 2025 details how token-level scheduling helped one GPU serve multiple LLMs, reducing demand from 1,192 to 213 H20s.

Comments

  • 🇵🇸antifa_ceo@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    7 hours ago

    Uh oh my precious AI bubble!

Hacker News

hackernews

Subscribe from Remote Instance

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !hackernews@lemmy.bestiver.se
lock
Community locked: only moderators can create posts. You can still comment on posts.

Posts from the RSS Feed of HackerNews.

The feed sometimes contains ads and posts that have been removed by the mod team at HN.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 601 users / day
  • 1.76K users / week
  • 3.75K users / month
  • 9.58K users / 6 months
  • 2 local subscribers
  • 2.85K subscribers
  • 34K Posts
  • 14.7K Comments
  • Modlog
  • mods:
  • patrick
  • RSS Bot
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org