CodeGen is a family of autoregressive language models for program synthesis, trained on a Python programming language dataset. The models are capable of extracting features from given natural language and programming language texts, and calculating the likelihood of them. They are intended for and best at program synthesis, that is, generating executable code given English prompts. The evaluation results show that CodeGen achieves state-of-the-art performance on two code generation benchmarks, HumanEval and MTPB.
CodeGen is a family of autoregressive language models for program synthesis, trained on a Python programming language dataset. The models are capable of extracting features from given natural language and programming language texts, and calculating the likelihood of them. They are intended for and best at program synthesis, that is, generating executable code given English prompts. The evaluation results show that CodeGen achieves state-of-the-art performance on two code generation benchmarks, HumanEval and MTPB.
0e475ef9b7e972a4f95661086b73041827297374
2023-05-26T23:44:31+00:00