Skip to content

wangdeze18/Code-Comment-Assessment-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Code-Comment-Assessment-Dataset

1 Introduction

This repository contains datasets for paper "Deep Code-Comment Understanding and Assessment".

2 Dataset

The public dataset is from this work. The public dataset includes the results of a manual assessment on the coherence between comments and the implementations of 3636 methods, gathered from three open source softwares implemented in Java.

Our labeled dataset is from the Java projects uploaded to GitHub before October 2018.

3 Data Structure

For each method in our labeled dataset, the structure is shown as belows:

  • #No
  • #File (the name of the source file)
  • #Comment
  • #Code

About

code-comment data for model training and evaluation

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published