-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Utilize pixman for efficient pixel manipulation #55
Conversation
Consider the performance test program below ( #include <assert.h>
#include <stdio.h>
#include <string.h>
#include <sys/time.h>
#include <time.h>
#include "twin.h"
#define TEST_PIX_WIDTH 1200
#define TEST_PIX_HEIGHT 800
static twin_pixmap_t *src32, *dst32;
static int twidth, theight, titers;
static void test_argb32_source_argb32(void)
{
twin_operand_t srco = {.source_kind = TWIN_PIXMAP, .u.pixmap = src32};
twin_composite(dst32, 0, 0, &srco, 0, 0, NULL, 0, 0, TWIN_SOURCE, twidth,
theight);
}
static void test_argb32_over_argb32(void)
{
twin_operand_t srco = {.source_kind = TWIN_PIXMAP, .u.pixmap = src32};
twin_composite(dst32, 0, 0, &srco, 0, 0, NULL, 0, 0, TWIN_OVER, twidth,
theight);
}
static void do_test(const char *name, void (*test)(void))
{
struct timeval start, end;
unsigned long long sus, eus;
char spc[128];
char *s;
int i;
printf("%s", name);
gettimeofday(&start, NULL);
for (i = 0; i < titers; i++)
test();
gettimeofday(&end, NULL);
sus = (unsigned long long) start.tv_sec * 1000000ull + start.tv_usec;
eus = (unsigned long long) end.tv_sec * 1000000ull + end.tv_usec;
s = spc;
for (i = strlen(name); i < 40; i++)
*(s++) = ' ';
*s = 0;
printf("%s %f sec\n", spc, ((float) (eus - sus)) / 1000000.0);
}
#define DO_TEST(name) do_test(#name, test_##name)
static void do_tests(int width, int height, int iters)
{
twidth = width;
theight = height;
titers = iters;
DO_TEST(argb32_source_argb32);
DO_TEST(argb32_over_argb32);
}
static void do_all_tests(const char *title)
{
printf("[ %s: 10x10x1000000 ]\n", title);
do_tests(10, 10, 1000000);
printf("[ %s: 100x100x20000 ]\n", title);
do_tests(100, 100, 20000);
printf("[ %s: 200x200x10000 ]\n", title);
do_tests(200, 200, 10000);
printf("[ %s: 1200x800x200 ]\n", title);
do_tests(1200, 800, 200);
printf("\n");
}
int main(void)
{
/* Create some test pixmaps */
src32 = twin_pixmap_from_file("assets/tux.png", TWIN_ARGB32);
assert(src32);
dst32 = twin_pixmap_create(TWIN_ARGB32, TEST_PIX_WIDTH, TEST_PIX_HEIGHT);
assert(dst32);
/* fill pixmaps */
twin_fill(dst32, 0x80112233, TWIN_SOURCE, 0, 0, TEST_PIX_WIDTH,
TEST_PIX_HEIGHT);
/* pre-touch data */
test_argb32_source_argb32();
do_all_tests("Pixmap");
return 0;
} Enable it by turning off the existing demo programs. (You might integrate the above into --- a/Makefile
+++ b/Makefile
@@ -109,7 +109,7 @@ endif
ifeq ($(CONFIG_DEMO_APPLICATIONS), y)
target-y += demo-$(BACKEND)
demo-$(BACKEND)_depends-y += $(target.a-y)
-demo-$(BACKEND)_files-y = apps/main.c
+demo-$(BACKEND)_files-y = apps/perf.c
demo-$(BACKEND)_includes-y := include
demo-$(BACKEND)_ldflags-y := \
$(target.a-y) \ Reference test results with the builtin compositing:
|
Promote Pixman option in Kconfig: --- a/configs/Kconfig
+++ b/configs/Kconfig
@@ -16,6 +16,18 @@ config BACKEND_SDL
endchoice
+choice
+ prompt "Renderer Selection"
+ default RENDERER_BUILTIN
+
+config RENDERER_BUILTIN
+ bool "Built-in pixel manipulation"
+
+config RENDERER_PIXMAN
+ bool "Pixman based rendering"
+
+endchoice
+
menu "Features"
config LOGGING
@@ -39,10 +51,6 @@ config CURSOR
bool "Manipulate cursor"
default n
-config PIXMAN
- bool "Pixman to pixel manipulation"
- default y
-
endmenu
menu "Image Loaders" Once the |
Pixman is a low-level library for pixel manipulation, including image compositing and trapezoid rasterization. It serves as the backend for Cairo's in-memory rendering surface and is a binary dependency of Cairo. Mado can be seen as a simplified version of Cairo, and the proposed change introduces an efficient rendering path as an optional feature for faster vector graphics. The original pixel manipulation code should be retained for resource-constrained environments, with the potential for integration into the Linux kernel as an in-kernel vector graphics module. The proposed change adds a configurable option to switch between the built-in (or classical) renderer and the Pixman-based renderer. Applications using Mado should maintain source-level compatibility. The action items for this pull request include:
|
This comment was marked as resolved.
This comment was marked as resolved.
Evaluate the feasibility to replace the primitive operations, such as |
#55 (comment) |
src/pixmap.c
Outdated
@@ -13,6 +13,10 @@ twin_pixmap_t *twin_pixmap_create(twin_format_t format, | |||
twin_coord_t height) | |||
{ | |||
twin_coord_t stride = twin_bytes_per_pixel(format) * width; | |||
/* Padding the stride to a multiple of 4 bytes */ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do you think it is necessary? Explain and consider to submit another pull request for generic pixmap manipulation.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we comment src/pixmap.c#L17,L18, it will happen _pixman_log_error as follow:
*** BUG ***
In create_bits_image_internal: The expression bits == NULL || (rowstride_bytes % sizeof (uint32_t)) == 0 was false
Set a breakpoint on '_pixman_log_error' to debug
[1] 62492 segmentation fault (core dumped) ./demo-sdl
pixman_image_t *msk fails to be allocated in /src/pixman.c#L102. Finally, pixman_image_unref will access a nullptr, and segmentation fault occurs in /src/pixman.c#L107
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do not use screenshots for plain text content, as this is inaccessible to visually impaired users.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we comment it, it will happen _pixman_log_error as follow:
...
pixman_image_t *msk
fails to be allocated. Finally,pixman_image_unref
will access a nullptr, and segmentation fault occurs.
Next, you should check the pointer alignment requirements for Pixman and consider providing the corresponding macros for alignment. See mimalloc for reference.
Show preliminary benchmark results for comparison purposes. |
(Notice that we test in the pixman configuration) |
} | ||
|
||
/* Same function in draw.c */ | ||
static twin_argb32_t _twin_apply_alpha(twin_argb32_t v) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See if the function can be accelerated by Pixman.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Observing the difference between using (left image) and not using (right image) _twin_apply_alpha, we know that _twin_apply_alpha performs the conversion from ABGR to ARGB.
According to the implementation in _twin_apply_alpha,
return alpha << 24 | twin_int_mult(twin_get_8(v, 0), alpha, t1) << 16 |
twin_int_mult(twin_get_8(v, 8), alpha, t2) << 8 |
twin_int_mult(twin_get_8(v, 16), alpha, t3) << 0;
it also multiplies RBG channels with alpha value.
However, the public API in pixman.h doesn't provide the functions about ABGR to ARGB conversion and alpha premultiplication for user.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we know that
_twin_apply_alpha
performs the conversion from ABGR to ARGB.
Check the inner function _convertBGRtoARGB
in file src/image-png.c
which was a workaround for macOS.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I ran mado with gdb. The function _convertBGRtoARGB isn't executed because my operating system is not macOS.
/src/pixman.c#L172 is executed because my computer is a little-endian machine. The PNG data is originally in RGBA format as the following image, but it becomes ABGR format on a little-endian machine. In /src/pixman.c#L172, mado has already performed the ABGR-to-ARGB conversion for the requirement of the pixmap's pixel data.
According to pixman.h, the in or over operator don't exist in public API. Because |
ecd5438
to
54b51b5
Compare
Can you provide preliminary benchmark results for evaluation purpose? |
The following evaluation results come from this test code: # built-in method
[ Pixmap: 10x10x1000000 ]
argb32_source_argb32 0.076773 sec
argb32_over_argb32 0.130076 sec
[ Pixmap: 100x100x20000 ]
argb32_source_argb32 0.065616 sec
argb32_over_argb32 0.176866 sec
[ Pixmap: 200x200x10000 ]
argb32_source_argb32 0.147732 sec
argb32_over_argb32 0.352873 sec
[ Pixmap: 1200x800x200 ]
argb32_source_argb32 0.058365 sec
argb32_over_argb32 0.458145 sec # pixman
[ Pixmap: 10x10x1000000 ]
argb32_source_argb32 0.162651 sec
argb32_over_argb32 0.149407 sec
[ Pixmap: 100x100x20000 ]
argb32_source_argb32 0.020530 sec
argb32_over_argb32 0.053927 sec
[ Pixmap: 200x200x10000 ]
argb32_source_argb32 0.051592 sec
argb32_over_argb32 0.132560 sec
[ Pixmap: 1200x800x200 ]
argb32_source_argb32 0.041629 sec
argb32_over_argb32 0.058690 sec |
Can you explain the test item |
The following results show that it results from the allocation of pixman_image_t for src and dst in 'twin_composite' of pixman.c:
Generally, the results with pixman method are better than the built-in method. |
We should integrate the performance suite before merging the proposed Pixman-based renderer to analyze its performance behavior and identify any regressions. Please submit a separate pull request for that purpose. |
I plan to merge this pull request for further performance evaluation. Meanwhile, I would create new issue(s) to address pixman specific work. Some of them would be as following:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Rebase the latest main
branch, resolve conflicts, and highlight the switch between the built-in renderer and the Pixman-based implementation. For RISC-V platforms without vector or SIMD extensions, the built-in renderer offers simplicity and works well. On modern CPU architectures with rich ISA extensions, Pixman provides faster pixel manipulation, making the switch to the Pixman-based implementation advantageous.
Could I refer to part of your sentence as a reference and include it in the commit message? |
Sure! It is collaboration. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't say "According to issue #6." This pull request was linked to that already.
Don't say "I replace original image compositing with Pixman function." Always write down commit message in third-person point of view.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Show (partial) benchmark results in git commit messages.
Replace original image compositing with Pixman function. Breifly, include: - pixman_image_composite: image compositing - pixman_image_fill_rectangles: fill the image with rectangles Originally, the built-in renderer offers simplicity and works well for RISC-V platforms without vector or SIMD extensions. However, modern CPU architectures have rich ISA extensions, and Pixman provides faster pixel manipulation. Hence, we support Pixman-based implementation to make it advantageous. We just switch to Pixman based rendering in Mado system configuration. The following benchmark results as: width, height, iteration, operation | built-in (sec) | pixman (sec) | --------------------------------------------------------------------- 100 , 100 , 20000 , source | 0.065616 | 0.020530 | , over | 0.176866 | 0.053927 | --------------------------------------------------------------------- 200 , 200 , 10000 , source | 0.147732 | 0.051592 | , over | 0.352873 | 0.132560 | --------------------------------------------------------------------- 1200 , 800 , 200 , source | 0.058365 | 0.041629 | , over | 0.458145 | 0.058690 | --------------------------------------------------------------------- Note: the table shows that Pixman method is better than built-in method for image compositing runtime. Close sysprog21#6
Thank @Bennctu for contributing! |
According to issue #6, I replace original image compositing and trapezoid rasterization with Pixman function. Briefly, include:
Close #6
We compare the results of the original method with those rewritten using the Pixman library:
(Left image: original method, and right image: Pixman method)