google/flan-t5-xxl cover image

google/flan-t5-xxl

Flan-PaLM 540B achieves state-of-the-art performance on several benchmarks, such as 75.2% on five-shot MMLU. We also publicly release Flan-T5 checkpoints, which achieve strong few-shot performance even compared to much larger models, such as PaLM 62B. Overall, instruction finetuning is a general method for improving the performance and usability of pretrained language model.

Flan-PaLM 540B achieves state-of-the-art performance on several benchmarks, such as 75.2% on five-shot MMLU. We also publicly release Flan-T5 checkpoints, which achieve strong few-shot performance even compared to much larger models, such as PaLM 62B. Overall, instruction finetuning is a general method for improving the performance and usability of pretrained language model.

Public
$0.0005/sec
demoapi

d2dd2330e76ef048bc6b3cea5b282cde0c3f6fe6

2023-01-18T23:10:50+00:00


© 2023 Deep Infra. All rights reserved.

Discord Logo