GitHub - rfgplk/ajax-simd: a zero-overhead (and common sense) C++ data-parallel types library

ajax 🐘

zero-overhead (and common sense) C++ data-parallel types library

ajax is a C++23 header-only SIMD abstraction library that provides thin, zero-overhead wrappers around x86 SIMD intrinsics; allowing developers to write expressive, type-safe, and performance-critical data-parallel code.

API

// all main public facing functions are found within src/simd.hpp
// where T is the bit width, and F primitive/lane width 
v128<T, F> // 128-bit SSE vectors
v256<T, F> // 256-bit AVX/AVX2 vectors
v512<T, F> // 512-bit AVX-512 vectors
// aliases
using v8 = v128<i128, __v8>;
using v16 = v128<i128, __v16>;
using v32 = v128<i128, __v32>;
using v64 = v128<i128, __v64>;
using vfloat = v128<f128, __vf>;
using vdouble = v128<d128, __vd>;

// 256bit
using w8 = v256<i256, __v8>;
using w16 = v256<i256, __v16>;
using w32 = v256<i256, __v32>;
using w64 = v256<i256, __v64>;
using wfloat = v256<f256, __vf>;
using wdouble = v256<d256, __vd>;
// 512bit
using z8 = v512<i512, __v8>;
using z16 = v512<i512, __v16>;
using z32 = v512<i512, __v32>;
using z64 = v512<i512, __v64>;
using zfloat = v512<f512, __vf>;
using zdouble = v512<d512, __vd>;

// Also supports Eigen-like naming convention
using packet16c = v128<i128, __v8>;
using packet8s = v128<i128, __v16>;
using packet4i = v128<i128, __v32>;
using packet2l = v128<i128, __v64>;
using packet4f = v128<f128, __vf>;
using packet2d = v128<d128, __vd>;

// common usage is designed to be exactly like std::array/std::vector (or other similar serial containers), with minimal differences
ajax::w32 arr = {};
arr += 5;
arr -= 20;
arr = { 5, 10, 15, 20, 25, 30, 35, 40 };
if(arr[0] == 5) // true
arr = (int)5;
arr = (char)5;

Features

written for x86_64
C++23, header-only, zero-overhead design
strongly typed SIMD abstractions
supports 128-bit, 256-bit, and 512-bit vector widths
direct mapping to native x86 intrinsics
compile-time dispatch via concepts and constraints
no runtime cost, no dynamic allocation, enabling constant folding and compile time evaluation
minimal abstraction over intrinsic registers

Motivation

ajax was developed to provide a minimal, no-nonsense SIMD layer that preserves the full power of native intrinsics while improving readability, correctness, and composability. The lack of common-sense, easily usable SIMD libraries that strike a practical balance between abstraction and control was glaring; existing solutions either expose raw intrinsics with little structure or introduce layers of indirection that impact performance while offering little value in terms of development speed.

In particular, the introduction of std::simd to experimental C++ was found to be overly cumbersome and, in many cases, illogical. Its design diverges significantly from natural container-like semantics, introducing complexity that makes low-level data-parallel programming harder rather than simpler. Instead of feeling like a straightforward extension of fundamental types, it often requires adapting algorithms to fit the abstraction.

License

Licensed under the Boost Software License.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
bin		bin
examples		examples
external		external
src		src
.clang-format		.clang-format
INSTALL		INSTALL
LICENSE		LICENSE
README.md		README.md
ajax_logo.png		ajax_logo.png
build.ninja		build.ninja

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ajax 🐘

zero-overhead (and common sense) C++ data-parallel types library

API

Features

Motivation

License

About

Uh oh!

Releases

Packages

Languages

License

rfgplk/ajax-simd

Folders and files

Latest commit

History

Repository files navigation

ajax 🐘

zero-overhead (and common sense) C++ data-parallel types library

API

Features

Motivation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages