ChatDev/SRDD/README.md
2024-02-02 10:23:23 +08:00

25 lines
1.3 KiB
Markdown

# SRDD (Software Requirement Description Dataset)
<p align="center">
<img src='./cover.png' width=800>
</p>
Welcome to SRDD (Software Requirement Description Dataset), a large prompted dataset tailored for Natural Language to Software (NL2Software) research. This repository contains a rich collection of prompts organized into 5 major categories and further subdivided into 40 subcategories. In total, the dataset comprises 1200 high-quality prompt samples extracted from ChatGPT 3.5, specifically curated to facilitate research in NL2Software.
## Structure
- The generated prompt contains three parts:
- Name of the software
- Description of this software
- Category of this software
- Details
- check.csv # Check Results
- data_ChatDev_format.sh # Data, in the format of executable shell scripts (in ChatDev)
- data_attribute_format.csv # Data, in the format of csv, containing three columns, Name/Description/Category
## Category
<p align="center">
<img src='./SRDD_Category.png' width=800>
</p>
# License
The SRDD dataset is licensed under **CC BY-NC 4.0**. This license explicitly permits non-commercial use of the data. We would like to emphasize that any models trained using these datasets should strictly adhere to the non-commercial usage restriction and should be employed exclusively for research purposes.