• silence7@slrpnk.netOP
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    1
    ·
    7 months ago

    They know exactly what the power consumption of that hardware is though. This isnt tough to figure out just because you use a cloud provider

      • silence7@slrpnk.netOP
        link
        fedilink
        English
        arrow-up
        2
        ·
        7 months ago

        You can produce a remarkably good estimate by looking at CPU and GPU utilization out of procfs and profiling a handful of similar machines power use with similar utilization and workloads.

        Network is less than 5% of power use for non-GPU loads; probably less for GPU.

        • ddh@lemmy.sdf.org
          link
          fedilink
          English
          arrow-up
          2
          ·
          7 months ago

          Sure, you can do that at an aggregate level, but then how do you divide it by customer? And even then, some setups will be more efficient than others, so you’d only get that setup’s usage.

          And even if you do that and can narrow it down to a single user and a single prompt, you can still only roughly predict how long it will think and how long the response will be.

          • silence7@slrpnk.netOP
            link
            fedilink
            English
            arrow-up
            1
            ·
            7 months ago

            By customer is easy: they’re each renting specific resources. A fractional cloud instance (excepting the sma burst able ones) is tied to specific CPUs and GPUs. And there are records of who rented which one when being kept already.

            You might not be able to break out specific individual queries, but computing averages is completely straightforward

      • Zeoic@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        7 months ago

        Im sure they can do the simple math of: we pay for x power, we have y customers. x / y would be a rough but probably pretty accurate number if we are talking tens of thousands to millions of customers.