nl2postcondition

Natural language to program postcondition generation

FSE'24 paper artefacts.

This repository contains the replication materials for the paper,

Can Large Language Models Transform Natural Language Intent into Formal Method Postconditions?

to appear in Foundations of Software Engineering (FSE), 2024

Authors: Madeline Endres (University of Michigan) Sarah Fakhoury (Microsoft Research); Saikat Chakraborty (Microsoft Research); Shuvendu Lahiri (Microsoft Research)

A preprint of the paper is available here: https://arxiv.org/pdf/2310.01831

This repository contains the following:

All LLM prompts and postconditions analyzed for the FSE paper The set of code-mutants produced for the FSE paper Qualitative analysis spreadsheet Analysis scripts + docker container for running the nl2postcondition with EvalPlus

Subfolders of this repository contains their own READMEs with more detailed instructions if needed. The layout of this repository is:

GeneratedPostconditions: All generated postconditions analyzed in the FSE paper, along with their evaluation results and logs. Includes both EvalPlus and Defects4J results.
QualitativeAnalysis: A spreadsheet with the results of our manual analysis of a subset of EvalPlus postconditions
PromptTemplates: Contains all prompts ablations used for both EvalPlus and Defects4J.
nl2postcondition_source_evalplus: All nl2postcondition code for the EvalPlus benchmark. Includes scripts for postcondition generation, postcondition preprocessing, and postcondition evaluation.

Due to integration with other internal projects, the source code for the Defects4J evaluation is not yet public.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
PromptTemplates		PromptTemplates
QualitativeAnalysis		QualitativeAnalysis
nl2postcondition_source_defects4j		nl2postcondition_source_defects4j
nl2postcondition_source_evalplus		nl2postcondition_source_evalplus
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
datasetcard_template_nl2postcond.md		datasetcard_template_nl2postcond.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

nl2postcondition

About

Uh oh!

Releases

Packages

Languages

License

prosyslab/nl-2-postcond

Folders and files

Latest commit

History

Repository files navigation

nl2postcondition

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages