Faster version of Gryphe/MythoMax-L2-13b running on multiple H100 cards in fp8 precision. Up to 160 tps.
Due to low usage this model has been replaced by Gryphe/MythoMax-L2-13b. Your inference requests are still working but they are redirected. Please update your code to use another model.
Faster version of Gryphe/MythoMax-L2-13b running on multiple H100 cards in fp8 precision. Up to 160 tps.