-
Notifications
You must be signed in to change notification settings - Fork 8
/
Copy pathdataset_browsing.txt
93 lines (62 loc) · 1.67 KB
/
dataset_browsing.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
- Did he fix his lines?
- How many percent of fixed lines belong to him?
# process PyDriller for all branches
dataset has duplicated samples (different commit id => same content but different branches)
email can be changed overtime
Are there any environment condition that force a developer to fix a vulnerabilities, instead of others:
- Others are busy?
- Others developers are not active sometimes
- Only this developer know how to fix these files
{
repository(owner: "apache", name: "struts") {
defaultBranchRef {
target {
... on Commit {
history(path: "core/src/main/java/org/apache/struts2/util/TokenHelper.java") {
nodes {
committer {
user {
login
}
}
committedDate
message
oid
}
}
}
}
}
}
}
{
repository(owner: "spring-projects", name: "spring-framework") {
defaultBranchRef {
target {
... on Commit {
history(path: "spring-web/src/main/java/org/springframework/http/converter/json/MappingJackson2HttpMessageConverter.java") {
nodes {
committer {
user {
login
}
}
committedDate
message
oid
}
}
}
}
}
}
}
----------------------
get branch of all commits in dataset
get file history base on branch
is this first commit on files?
----------------------
Note:
- Feature data missing:
Only 6626/7127 commits has username available
torch.Size([1, 6, 768])