Flan-UL2 is an encoder decoder model based on the T5 architecture. It uses the same configuration as the UL2 model released the previous year. It was fine tuned using the "Flan" prompt tuning and dataset collection. The original UL2 model was only trained with receptive field of 512, which made it non-ideal for N-shot prompting where N is large.
Flan-UL2 is an encoder decoder model based on the T5 architecture. It uses the same configuration as the UL2 model released the previous year. It was fine tuned using the "Flan" prompt tuning and dataset collection. The original UL2 model was only trained with receptive field of 512, which made it non-ideal for N-shot prompting where N is large.
a90bd522aaecabafd1be768ba40cf6896d5dd46e
2023-03-08T00:51:10+00:00
5dfdc8c4a1ae47a6c1487a88636ac8a712e8808a
2023-03-04T02:17:26+00:00