File tree
36 files changed
+194
-1324
lines changed- csrc
- attention
- rocm
- docs/source/quantization
- examples/fp8
- quantizer
- tests
- fp8_kv
- llama2-70b-fp8-kv
- llama2-7b-fp8-kv
- models/decoder_only/language
- vllm
- attention
- backends
- ops
- engine
- model_executor
- layers/quantization
- model_loader
- models
- worker
36 files changed
+194
-1324
lines changedLines changed: 5 additions & 5 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
105 | 105 |
| |
106 | 106 |
| |
107 | 107 |
| |
108 |
| - | |
| 108 | + | |
109 | 109 |
| |
110 | 110 |
| |
111 | 111 |
| |
| |||
285 | 285 |
| |
286 | 286 |
| |
287 | 287 |
| |
288 |
| - | |
| 288 | + | |
289 | 289 |
| |
290 | 290 |
| |
291 | 291 |
| |
| |||
415 | 415 |
| |
416 | 416 |
| |
417 | 417 |
| |
418 |
| - | |
| 418 | + | |
419 | 419 |
| |
420 | 420 |
| |
421 | 421 |
| |
| |||
513 | 513 |
| |
514 | 514 |
| |
515 | 515 |
| |
516 |
| - | |
| 516 | + | |
517 | 517 |
| |
518 | 518 |
| |
519 | 519 |
| |
| |||
549 | 549 |
| |
550 | 550 |
| |
551 | 551 |
| |
552 |
| - | |
| 552 | + | |
553 | 553 |
| |
554 | 554 |
| |
555 | 555 |
| |
|
Lines changed: 10 additions & 7 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
41 | 41 |
| |
42 | 42 |
| |
43 | 43 |
| |
44 |
| - | |
| 44 | + | |
45 | 45 |
| |
46 | 46 |
| |
47 | 47 |
| |
| |||
53 | 53 |
| |
54 | 54 |
| |
55 | 55 |
| |
56 |
| - | |
57 |
| - | |
58 |
| - | |
59 |
| - | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
60 | 60 |
| |
61 | 61 |
| |
62 | 62 |
| |
| |||
80 | 80 |
| |
81 | 81 |
| |
82 | 82 |
| |
| 83 | + | |
| 84 | + | |
83 | 85 |
| |
84 | 86 |
| |
85 | 87 |
| |
| |||
193 | 195 |
| |
194 | 196 |
| |
195 | 197 |
| |
196 |
| - | |
197 |
| - | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
198 | 201 |
| |
199 | 202 |
| |
200 | 203 |
| |
|
Lines changed: 10 additions & 7 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
45 | 45 |
| |
46 | 46 |
| |
47 | 47 |
| |
48 |
| - | |
| 48 | + | |
49 | 49 |
| |
50 | 50 |
| |
51 | 51 |
| |
| |||
62 | 62 |
| |
63 | 63 |
| |
64 | 64 |
| |
65 |
| - | |
66 |
| - | |
67 |
| - | |
68 |
| - | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
69 | 69 |
| |
70 | 70 |
| |
71 | 71 |
| |
| |||
92 | 92 |
| |
93 | 93 |
| |
94 | 94 |
| |
| 95 | + | |
| 96 | + | |
95 | 97 |
| |
96 | 98 |
| |
97 | 99 |
| |
| |||
212 | 214 |
| |
213 | 215 |
| |
214 | 216 |
| |
215 |
| - | |
216 |
| - | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
217 | 220 |
| |
218 | 221 |
| |
219 | 222 |
| |
|
Lines changed: 3 additions & 3 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
18 | 18 |
| |
19 | 19 |
| |
20 | 20 |
| |
21 |
| - | |
22 |
| - | |
| 21 | + | |
| 22 | + | |
23 | 23 |
| |
24 | 24 |
| |
25 | 25 |
| |
26 | 26 |
| |
27 | 27 |
| |
28 | 28 |
| |
29 |
| - | |
| 29 | + | |
30 | 30 |
| |
31 | 31 |
| |
32 | 32 |
| |
|
Lines changed: 17 additions & 13 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
159 | 159 |
| |
160 | 160 |
| |
161 | 161 |
| |
162 |
| - | |
163 |
| - | |
| 162 | + | |
| 163 | + | |
164 | 164 |
| |
165 | 165 |
| |
166 | 166 |
| |
| |||
196 | 196 |
| |
197 | 197 |
| |
198 | 198 |
| |
199 |
| - | |
| 199 | + | |
200 | 200 |
| |
201 |
| - | |
| 201 | + | |
202 | 202 |
| |
203 | 203 |
| |
204 | 204 |
| |
| |||
214 | 214 |
| |
215 | 215 |
| |
216 | 216 |
| |
217 |
| - | |
| 217 | + | |
218 | 218 |
| |
219 | 219 |
| |
220 | 220 |
| |
| |||
239 | 239 |
| |
240 | 240 |
| |
241 | 241 |
| |
242 |
| - | |
| 242 | + | |
243 | 243 |
| |
244 |
| - | |
| 244 | + | |
245 | 245 |
| |
246 | 246 |
| |
247 | 247 |
| |
| |||
258 | 258 |
| |
259 | 259 |
| |
260 | 260 |
| |
261 |
| - | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
262 | 264 |
| |
263 | 265 |
| |
264 | 266 |
| |
| |||
268 | 270 |
| |
269 | 271 |
| |
270 | 272 |
| |
271 |
| - | |
272 |
| - | |
| 273 | + | |
| 274 | + | |
273 | 275 |
| |
274 | 276 |
| |
275 | 277 |
| |
| |||
299 | 301 |
| |
300 | 302 |
| |
301 | 303 |
| |
302 |
| - | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
303 | 307 |
| |
304 | 308 |
| |
305 | 309 |
| |
| |||
308 | 312 |
| |
309 | 313 |
| |
310 | 314 |
| |
311 |
| - | |
312 |
| - | |
| 315 | + | |
| 316 | + | |
313 | 317 |
| |
314 | 318 |
| |
315 | 319 |
| |
|
Lines changed: 6 additions & 4 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
34 | 34 |
| |
35 | 35 |
| |
36 | 36 |
| |
37 |
| - | |
38 |
| - | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
39 | 40 |
| |
40 | 41 |
| |
41 | 42 |
| |
| |||
45 | 46 |
| |
46 | 47 |
| |
47 | 48 |
| |
48 |
| - | |
49 |
| - | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
50 | 52 |
| |
51 | 53 |
| |
52 | 54 |
| |
|
Lines changed: 12 additions & 8 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
236 | 236 |
| |
237 | 237 |
| |
238 | 238 |
| |
239 |
| - | |
| 239 | + | |
240 | 240 |
| |
241 | 241 |
| |
242 | 242 |
| |
| |||
438 | 438 |
| |
439 | 439 |
| |
440 | 440 |
| |
441 |
| - | |
| 441 | + | |
442 | 442 |
| |
443 | 443 |
| |
444 | 444 |
| |
| |||
448 | 448 |
| |
449 | 449 |
| |
450 | 450 |
| |
451 |
| - | |
| 451 | + | |
452 | 452 |
| |
453 | 453 |
| |
454 | 454 |
| |
| |||
995 | 995 |
| |
996 | 996 |
| |
997 | 997 |
| |
998 |
| - | |
| 998 | + | |
999 | 999 |
| |
1000 | 1000 |
| |
1001 | 1001 |
| |
| |||
1026 | 1026 |
| |
1027 | 1027 |
| |
1028 | 1028 |
| |
1029 |
| - | |
| 1029 | + | |
1030 | 1030 |
| |
1031 | 1031 |
| |
1032 | 1032 |
| |
| |||
1043 | 1043 |
| |
1044 | 1044 |
| |
1045 | 1045 |
| |
1046 |
| - | |
| 1046 | + | |
1047 | 1047 |
| |
1048 | 1048 |
| |
1049 | 1049 |
| |
| |||
1068 | 1068 |
| |
1069 | 1069 |
| |
1070 | 1070 |
| |
| 1071 | + | |
| 1072 | + | |
| 1073 | + | |
1071 | 1074 |
| |
1072 | 1075 |
| |
1073 | 1076 |
| |
| |||
1263 | 1266 |
| |
1264 | 1267 |
| |
1265 | 1268 |
| |
1266 |
| - | |
1267 |
| - | |
| 1269 | + | |
| 1270 | + | |
| 1271 | + | |
1268 | 1272 |
| |
1269 | 1273 |
| |
1270 | 1274 |
| |
|
0 commit comments