add NLDD dataset

This commit is contained in:
thinkwee 2023-12-18 22:16:33 +08:00
parent fa7360a392
commit 1fd4708727
7 changed files with 3622 additions and 1201 deletions

BIN
NLDD/NLDD_Category.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 335 KiB

21
NLDD/README.md Normal file
View File

@ -0,0 +1,21 @@
# NLDD (Natural Language Dataset for Dev)
<p align="center">
<img src='./cover.png' width=800>
</p>
Welcome to NLDD (Natural Language Dataset for Dev), a large prompted dataset tailored for Natural Language to Software (NL2Software) research. This repository contains a rich collection of prompts organized into 5 major categories and further subdivided into 40 subcategories. In total, the dataset comprises 1200 high-quality prompt samples extracted from ChatGPT 3.5, specifically curated to facilitate research in NL2Software.
## Structure
- The generated prompt contains three parts:
- Name of the software
- Description of this software
- Category of this software
- Details
- check.csv # Check Results
- data_ChatDev_format.sh # Data, in the format of executable shell scripts (in ChatDev)
- data_attribute_format.csv # Data, in the format of csv, containing three columns, Name/Description/Category
## Category
<p align="center">
<img src='./NLDD_Category.png' width=800>
</p>

BIN
NLDD/cover.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 1.7 MiB

1200
NLDD/data/check.csv Normal file

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff