[Tensor] Int4QTensor with quantized 4-bit integer data type #2895

djeong20 · 2025-01-23T02:00:44Z

This pull request presents the class, a powerful solution for efficiently storing quantized 4-bit integer data. By packing each 4-bit integer into an 8-bit memory space, we utilize memory resources effectively—the first four bits hold the first 4-bit value, and the last four bits hold the second.

Build test: [X]Passed [ ]Failed [ ]Skipped
Run test: [X]Passed [ ]Failed [ ]Skipped

skykongkong8

Glad to have many negatigve unittest TCs as well. All good!

skykongkong8 · 2025-01-23T04:29:59Z

nntrainer/tensor/int4_tensor.cpp

+/// @todo this func should be template function
+void Int4QTensor::addValue(unsigned int b, unsigned int c, unsigned int h,
+                           unsigned int w, float value, float beta) {
+  auto const &idx = getIndex(b, c, h, w);
+  float output = getValue(idx);
+  output *= beta;
+  output += value;
+
+  // if result value is out of range, clamp to max/min value
+  int8_t val = std::trunc(std::clamp((int)output, -8, 7));
+
+  // encode result value to int8 data
+  ((int8_t *)getData())[idx / 2] =
+    (idx % 2 == 0) ? (val << 4) | (((int8_t *)getData())[idx / 2] & 0x0f)
+                   : (((int8_t *)getData())[idx / 2] << 4) | (val & 0x0f);
+}


Quick question:
Do we just expect the user to consider scale factor in input float value and float beta?
I am curious about how basic math in int4Q tensor goes...

Thanks for asking! Currently, no. This is to modify the quantized value directly.

EunjuYang · 2025-01-24T02:20:13Z

nntrainer/tensor/int4_tensor.cpp

+  // encode result value to int8 data
+  ((int8_t *)getData())[idx / 2] =
+    (idx % 2 == 0) ? (val << 4) | (((int8_t *)getData())[idx / 2] & 0x0f)
+                   : (((int8_t *)getData())[idx / 2] << 4) | (val & 0x0f);


As I understood, the computation should be :

Suggested change

: (((int8_t *)getData())[idx / 2] << 4) | (val & 0x0f);

: (((int8_t *)getData())[idx / 2] & 0xf0) | (val & 0x0f);

I'm quite confused with it. Please let me know If I'm wrong :)

you're right! thanks for pointing it out :)

EunjuYang · 2025-01-24T02:26:31Z

nntrainer/tensor/int4_tensor.cpp

+    (idx % 2 == 0) ? (val << 4) | ((int8_t *)getData())[idx / 2]
+                   : ((int8_t *)getData())[idx / 2] | (val & 0x0f);


I find we need to clear out the space we want to append the value.

Suggested change

(idx % 2 == 0) ? (val << 4) | ((int8_t *)getData())[idx / 2]

: ((int8_t *)getData())[idx / 2] | (val & 0x0f);

(idx % 2 == 0) ? (val << 4) | (((int8_t *)getData())[idx / 2] & 0x0f)

: (((int8_t *)getData())[idx / 2] & 0xf0) | (val & 0x0f);

makes sense 👍

This pull request presents the class, a powerful solution for efficiently storing quantized 4-bit integer data. By packing each 4-bit integer into an 8-bit memory space, we utilize memory resources effectively—where the first four bits hold the first 4-bit value and the last four bits hold the second. 1. Build test: [X]Passed [ ]Failed [ ]Skipped 2. Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Donghyeon Jeong <dhyeon.jeong@samsung.com>

djeong20 requested review from myungjoo, jijoongmoon, again4you, jaeyun-jung, leemgs, wooksong, gichan-jang, anyj0527, lhs8928, songgot, jihochu, DonghakPark, SeoHyungjun, baek2sm, skykongkong8, EunjuYang, dkjung and haehun as code owners January 23, 2025 02:00

github-actions bot added the Need Review label Jan 23, 2025

skykongkong8 approved these changes Jan 23, 2025

View reviewed changes

EunjuYang reviewed Jan 24, 2025

View reviewed changes

djeong20 force-pushed the add/tensor/qint4x2 branch from d60db7a to f526f3a Compare January 24, 2025 08:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tensor] Int4QTensor with quantized 4-bit integer data type #2895

[Tensor] Int4QTensor with quantized 4-bit integer data type #2895

djeong20 commented Jan 23, 2025

skykongkong8 left a comment

skykongkong8 Jan 23, 2025

djeong20 Jan 24, 2025

EunjuYang Jan 24, 2025 •

edited

Loading

djeong20 Jan 24, 2025

EunjuYang Jan 24, 2025

djeong20 Jan 24, 2025

	: (((int8_t *)getData())[idx / 2] << 4) \| (val & 0x0f);
	: (((int8_t *)getData())[idx / 2] & 0xf0) \| (val & 0x0f);

		(idx % 2 == 0) ? (val << 4) \| ((int8_t *)getData())[idx / 2]
		: ((int8_t *)getData())[idx / 2] \| (val & 0x0f);

[Tensor] Int4QTensor with quantized 4-bit integer data type #2895

Are you sure you want to change the base?

[Tensor] Int4QTensor with quantized 4-bit integer data type #2895

Conversation

djeong20 commented Jan 23, 2025

skykongkong8 left a comment

Choose a reason for hiding this comment

skykongkong8 Jan 23, 2025

Choose a reason for hiding this comment

djeong20 Jan 24, 2025

Choose a reason for hiding this comment

EunjuYang Jan 24, 2025 • edited Loading

Choose a reason for hiding this comment

djeong20 Jan 24, 2025

Choose a reason for hiding this comment

EunjuYang Jan 24, 2025

Choose a reason for hiding this comment

djeong20 Jan 24, 2025

Choose a reason for hiding this comment

EunjuYang Jan 24, 2025 •

edited

Loading