• by neximo64 on 12/15/2024, 10:32:08 AM

    the session is tied to a gpu cluster. It would actually be quite inefficient to switch gpu cluster to another one mid session, but its needed in a failure scenario

  • by ansonhw on 12/15/2024, 12:41:26 PM

    good batching and tensor parallelization prob