Skip to content

LLVM and SPIRV-LLVM-Translator pulldown (WW10 2024) #12939

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 794 commits into from
Mar 7, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
794 commits
Select commit Hold shift + click to select a range
df52521
[RemoveDIs] Follow up to 6b62a9135a2, fix a ICmpInst constructor call
jmorse Mar 4, 2024
f32c6b2
[lldb/Test] Fix oversight in ProcessEventDataTest::SetUp (NFC) (#83895)
medismailben Mar 4, 2024
abe4677
[InstCombine] Fix infinite loop due to incorrect `DoesConsume` (#82973)
dtcxzyw Mar 4, 2024
79e8f29
[lldb/lua] Fix Lua building on Windows (#83871)
amordo Mar 4, 2024
0597644
[mlir][transform] replace original op to loop ops (#83537)
HerrCai0907 Mar 4, 2024
930e7ff
[AArch64] Optimize abs, neg and copysign for fp16/bf16
majnemer Mar 4, 2024
1c9125c
AMDGPU: Proper use of HasImageInsts in vimage inst definitions, NFC (…
changpeng Mar 4, 2024
3b5965e
[clang-format][doc] fix documentation for clang-format (#83415)
PeterChou1 Mar 4, 2024
72cf95d
[SPIR-V] Fix warning -Wsometimes-uninitialized (#83901)
michalpaszkowski Mar 4, 2024
93e423f
[libc][test] update constants used in stdbit test (#83893)
nickdesaulniers Mar 4, 2024
be3eeea
[AArch64] Use SHLLv4i16 to shift+widen BF16 to F32.
majnemer Mar 4, 2024
bb97c99
[OpenACC] Enable serial/kernels Compute Constructs
erichkeane Mar 4, 2024
e10dc60
[mlir][sparse][nfc] fixed typo in "translate" (#83891)
aartbik Mar 4, 2024
2e93ee6
Update Benchmark (#83488)
mtrofin Mar 4, 2024
d95e6d0
[mlir] GEMM Hopper Tensor Core Integration Test (#81478)
grypp Mar 4, 2024
922a431
[profdata][nfc] Disable several tests on Windows (#83907)
alanzhao1 Mar 4, 2024
aec6a04
Revert "Update Benchmark (#83488)"
mtrofin Mar 4, 2024
6325dd5
[HLSL][SPIR-V] Add SV_DispatchThreadID semantic support (#82536)
sudonatalie Mar 4, 2024
df9ba13
[LV] Handle scalable VFs in optimizeForVFAndUF (#82669)
preames Mar 4, 2024
eaa0d3b
[libc][test][stdbit] fix has_single_bit test names (#83904)
nickdesaulniers Mar 4, 2024
a5b7971
Reapply "Update Benchmark (#83488)" (#83916)
mtrofin Mar 4, 2024
8848258
[SROA] Unfold gep of index phi (round 2) (#83494)
aeubanks Mar 4, 2024
691fc7c
[mlir][sparse] add dim/lvl information to sparse_tensor.print (#83913)
aartbik Mar 4, 2024
8cc8fda
[AArch64] Also promote vector bf16 INT_TP_FP to f32
d0k Mar 4, 2024
52b69aa
[mlir][sparse] support sparsifying batch levels (#83898)
PeimingLiu Mar 4, 2024
10ccde3
[InstallAPI] Collect symbols from ObjC Ivars (#83632)
cyndyishida Mar 4, 2024
3bc0ff2
[Analysis] Move `DomConditionCache::findAffectedValues` to a new file…
goldsteinn Feb 26, 2024
6ee46ab
[Analysis] Share `findAffectedValues` between DomConditionCache and A…
goldsteinn Feb 26, 2024
db3bbe0
[Analysis] Unify most of the tracking between AssumptionCache and Dom…
goldsteinn Feb 26, 2024
82cc2a6
[libc] Include stdfix.h in baremetal targets (#83900)
petrhosek Mar 4, 2024
6fd27d5
[libc++] Don't generate the modulemap file (#80352)
ldionne Mar 4, 2024
4d80df0
[libc++] Do not forward-declare syncstream outside experimental (#82511)
ldionne Mar 4, 2024
5174b38
[libc++] Use __wrap_iter in string_view and array in the unstable ABI…
ldionne Mar 4, 2024
40081a4
[libc++] Fix diagnostic for <stdatomic.h> before C++23 (#83351)
ldionne Mar 4, 2024
07b1aeb
[LLVM][NFC] Ignore including the GNUInstallDirs on the GPU (#83910)
jhuber6 Mar 4, 2024
5000e4c
[lldb/crashlog] Fix breaking changes in textual report format (#83861)
medismailben Mar 4, 2024
488ac3d
[lldb] Enable a test that was never enabled (#83925)
felipepiovezan Mar 4, 2024
05390df
[mlir][sparse] migration to sparse_tensor.print (#83926)
aartbik Mar 4, 2024
8e56fb8
[clang-tidy] CRTP Constructor Accessibility Check (#82403)
isuckatcs Mar 5, 2024
1ebbf97
[LLVM] Partially revert GPU change to include dirs
jhuber6 Mar 5, 2024
bdfebc3
[X86] Use update_mir_test_checks.py to generate CHECK lines in masked…
topperc Mar 5, 2024
8d6e867
[LSR][term-fold] Ensure the simple recurrence is from the current loo…
patrick-rivos Mar 5, 2024
57592e9
[Clang] Fixes of builtin definitions after PR #68324. (#81022)
michele-scandale Mar 5, 2024
81617f8
[Driver][RISCV] Forward --no-relax option to linker for RISC-V on *BS…
brad0 Mar 5, 2024
3e40c96
[X86] Resolve FIXME: Add FPCW as a rounding control register (#82452)
AZero13 Mar 5, 2024
275fe3a
[mlir][sparse] support complex type for sparse_tensor.print (#83934)
aartbik Mar 5, 2024
a5c90e4
[LoongArch] Switch to the Machine Scheduler (#83759)
wangleiat Mar 5, 2024
6f5c4f2
[mlir][vector]Add Vector bitwidth target to Linearize Vectorizable an…
bviyer Mar 5, 2024
ccf0c8d
[BOLT] Add reading support for Linux kernel exception table (#83100)
maksfb Mar 5, 2024
1a67dee
Revert "[ORC][JITLink] Add Intel VTune support to JITLink (#81826)"
rupprecht Mar 5, 2024
564b81d
Add support for x87 registers on GISel register selection (#83528)
MalaySanghi Mar 5, 2024
377feae
[X86][NFC] Clang-format X86DisassemblerDecoder.h
KanRobert Mar 5, 2024
65a8e3a
[MLIR] Fix crash in notifyBlockInserted() debug output (NFC)
joker-eph Mar 5, 2024
0fbe45b
[RISCV] Add support of Sscofpmf (#83831)
wangpc-pp Mar 5, 2024
9dab2e3
[clang][Sema] Warn on return of pointer/reference to compound literal…
JOE1994 Mar 5, 2024
7cd3268
[X86][TableGen] Fix the mnemonic table for CMPCCXADD
KanRobert Mar 5, 2024
cec2073
[clang][Interp] Diagnose comparisions against weak function pointers
tbaederr Mar 4, 2024
d016712
[bzl] Remove obsolete `output_to_genfiles = True` (#83944)
rupprecht Mar 5, 2024
662d821
[mlir][sparse] migrate datastructure tests to sparse_tensor.print (#8…
aartbik Mar 5, 2024
ed62758
[PowerPC][NFC] Update aix-tls-xcoff-reloc.ll (#83764)
Mar 5, 2024
0c47363
[InstCombine] Simplify nested selects with implied condition (#83739)
dtcxzyw Mar 5, 2024
3cb999c
[RISCV] Remove note of Sscofpmf and add a newline after FeatureStdExt…
wangpc-pp Mar 5, 2024
80f9458
[libc++][sstream] P2495R3: Interfacing `stringstream`s with `string_v…
H-G-Hristov Mar 5, 2024
9b672de
[clang][Builtins] Parse clang extended vectors types. (#83584)
fpetrogalli Mar 5, 2024
90e9e96
[mlir][NFC] Apply rule of five to *Pass classes (#80998)
andrey-golubev Mar 5, 2024
a36b73e
Apply clang-tidy fixes for modernize-loop-convert in Barvinok.cpp (NFC)
joker-eph Mar 1, 2024
9915418
Apply clang-tidy fixes for modernize-use-emplace in Barvinok.cpp (NFC)
joker-eph Mar 1, 2024
2db8b94
Apply clang-tidy fixes for performance-unnecessary-value-param in Bar…
joker-eph Mar 1, 2024
1934fc6
Apply clang-tidy fixes for readability-container-size-empty in Barvin…
joker-eph Mar 1, 2024
1837579
Apply clang-tidy fixes for readability-simplify-boolean-expr in Integ…
joker-eph Mar 1, 2024
46f65e4
[mlir]use correct iterator when eraseOp (#83444)
HerrCai0907 Mar 5, 2024
eaa9ef6
[lld][LoongArch] Support the R_LARCH_{ADD,SUB}_ULEB128 relocation typ…
MQ-mengqing Mar 5, 2024
0e337c6
Replace copy with a reference. (#82485)
MalaySanghi Mar 5, 2024
3105cfe
[FileCheck] Fix parsing empty global and pseudo variable names (#83667)
kovdan01 Mar 5, 2024
ad5aea3
Revert "[flang] disable memory-allocation-opt.fir test on windows" (#…
jeanPerier Mar 5, 2024
74dfded
[flang] Deallocate structure constructor allocatable components (#83824)
jeanPerier Mar 5, 2024
2984699
[flang] Implement passing of assumed-type actual arguments. (#83851)
jeanPerier Mar 5, 2024
9a894e7
[Github Automation] Allow colon after cherry-pick command (#81002)
Sh0g0-1758 Mar 5, 2024
ff66e9b
[Docs] Update frontend perfomance tips about loads/stores (#83833)
nikic Mar 5, 2024
d773c00
[Clang] Fix looking for immediate calls in default arguments. (#80690)
cor3ntin Mar 5, 2024
690bf64
[clang] Support `__is_trivially_copyable(int()&)==false` (#81298)
AMP999 Mar 5, 2024
6862233
[LangRef] Add aligned attribute description into masked.compress/expa…
yetingk Mar 5, 2024
88414c8
[analyzer][NFC] Remove dead code (#83968)
steakhal Mar 5, 2024
c5d16e7
[IR] Update comment in Verifier::visitPHINode
jayfoad Mar 5, 2024
16f2a1f
[mlir] Apply ClangTidy findings
akuegel Mar 5, 2024
56abb8d
[AArch64] Be stricter about insert/extract index
huntergr-arm Mar 5, 2024
0709eeb
[mlir] Followup fix, added noexcept in wrong place
akuegel Mar 5, 2024
b585c43
[Flang][OpenMP] : Add a temporary lowering for workshare directive (#…
kiranchandramohan Mar 5, 2024
a668846
[DAGCombiner] Handle extending EXTRACT_VECTOR_ELTs in calculateBytePr…
lukel97 Mar 5, 2024
762f762
[AMDGPU] Rename get_MUBUF_ps and use it for MTBUF too. NFC. (#83991)
jayfoad Mar 5, 2024
923ddf6
[ObjC] Check entire chain of superclasses to see if class layout is s…
AZero13 Mar 5, 2024
191f767
[X86] 2007-03-15-GEP-Idx-Sink.ll - regenerate test checks
RKSimon Mar 5, 2024
49f9505
[X86] pr59305.ll - replace "X86-64" check prefix with "X64"
RKSimon Mar 5, 2024
d1d2932
[Support][NFC] Add test documenting that empty `Regex` pattern matche…
martinboehme Mar 5, 2024
2089596
[NVPTX] Remove sub.s16x2 instruction
d0k Mar 5, 2024
94a0dd5
[InstCombine] Fix Failure to convert vector fp comparisons that can b…
SahilPatidar Mar 5, 2024
de1f338
[TableGen] Fix wrong codegen of BothFusionPredicateWithMCInstPredicat…
wangpc-pp Mar 5, 2024
4693efe
[AMDGPU] Remove Base_MUBUF_Real_Atomic_gfx11. NFC. (#83994)
jayfoad Mar 5, 2024
67a7a5e
[AMDGPU] Only use the BUF Base_ prefix for multiple architectures. NFC.
jayfoad Mar 5, 2024
90e97e7
[libc] Disable verbose logging messages on hermetic tests (#83954)
jhuber6 Mar 5, 2024
341d674
[LLVM][AArch64][CodeGen] Mark FFR as a reserved register. (#83437)
paulwalker-arm Mar 5, 2024
9f29401
Merge from 'sycl' to 'sycl-web'
Mar 5, 2024
c00c901
[clang] Use separator for large numeric values in overflow diagnostic…
Atousa Mar 5, 2024
d95a0d7
[DAG] Teach SelectionDAGBuilder to read parameter alignment of compre…
yetingk Mar 5, 2024
d51fcd4
[InstCombine] Handle scalable splat in `getFlippedStrictnessPredicate…
dtcxzyw Mar 5, 2024
e84182a
[X86][Inline] Skip inline asm in inlining target feature check (#83820)
nikic Mar 5, 2024
894f52f
[AMDGPU] Use BUF multiclasses to reduce repetition. NFC. (#84003)
jayfoad Mar 5, 2024
a642eb8
[Clang][Sema] Fix crash when using name of UnresolvedUsingValueDecl w…
sdkrystian Mar 5, 2024
0d8e16a
[analyzer][NFC] Make CheckerDocumentation checker in-sync with actual…
steakhal Mar 5, 2024
4ce737b
[clang] Sequence C++20 Parenthesized List Init (#83476)
vapdrs Mar 5, 2024
4cf8b29
[AMDGPU][PromoteAlloca] Correctly handle a variable vector index (#83…
bcahoon Mar 5, 2024
d9b435c
[lld][test] Fix sanitizer buildbot failure
SixWeining Mar 5, 2024
a1a590e
[InstCombine] Fix miscompilation in PR83947 (#83993)
dtcxzyw Mar 5, 2024
b2c16e7
Revert "[ARM] R11 not pushed adjacent to link register with PAC-M and…
jwestwood921 Mar 5, 2024
f3be842
[mlir] Expose MLIR_ROCM_CONVERSIONS_ENABLED in mlir-config.h. (#83977)
ingomueller-net Mar 5, 2024
1c2b79a
[SLP]Add runtime stride support for strided loads.
alexey-bataev Mar 5, 2024
a64975f
[mlir][irdl] Add support for basic structural constraints in tblgen-t…
math-fehr Mar 5, 2024
fac791d
Export `LLVM_VERSION_MAJOR` CMake variable as a directory property (#…
bjacob Mar 5, 2024
a691f65
[mlir][py] better support for arith.constant construction (#83259)
ftynse Mar 5, 2024
2fe81ed
[NFC][RemoveDIs] Insert instruction using iterators in Transforms/
jmorse Mar 4, 2024
a41bcb3
[lld][LoongArch] Fix handleUleb128
SixWeining Mar 5, 2024
6409c21
[SCEVExpander] Use PoisoningVH for OrigFlags
nikic Mar 5, 2024
08e036e
[PhaseOrdering][X86] Add test coverage for #67803
RKSimon Mar 5, 2024
da63746
[infra] Add git to Linux container (#82687)
sudonatalie Mar 5, 2024
ec7062d
[clangd] Add metric for rename decl kind (#83867)
DavidGoldman Mar 5, 2024
88d82b7
[libc] fix more readability-identifier-naming lints (#83914)
nickdesaulniers Mar 5, 2024
640c857
[libc] fix readability-identifier-naming in memory_utils/utils.h (#83…
nickdesaulniers Mar 5, 2024
2aa22ca
[libc] suppress readability-identifier-naming for std::numeric_limits…
nickdesaulniers Mar 5, 2024
169824b
[SLP][NFC]SPlit test/Transforms/SLPVectorizer/AArch64/getelementptr.ll,
alexey-bataev Mar 5, 2024
3b84b6f
[CostModel][X86] Add test coverage for 'concat subvector' style shuffles
RKSimon Mar 5, 2024
afb05cd
[Flang][MLIR][OpenMP] Create a deferred declare target marking proces…
agozillon Mar 5, 2024
7df43cc
[clang] Add -Wmissing-designated-field-initializers (#81364)
vvd170501 Mar 5, 2024
761b88b
Merge from 'main' to 'sycl-web' (207 commits)
jyu2-git Mar 5, 2024
e49479b
[AMDGPU] Remove unneeded BUF _impl multiclasses. NFC. (#84034)
jayfoad Mar 5, 2024
26058e6
[DAG] select (sext m), (add X, C), X --> (add X, (and C, (sext m)))) …
elhewaty Mar 5, 2024
6b5888c
[AArch64] Verify ldp/stp alignment stricter (#83948)
ytmukai Mar 5, 2024
a8cb9db
[SystemZ] Use proper relocation for TLS variable debug info (#83975)
uweigand Mar 5, 2024
1e828f8
[SROA]: Only defer trying partial sized ptr or ptr vector types
jrbyrnes Feb 19, 2024
f33f66b
[NFC][RemoveDIs] Always use iterators for inserting PHIs
jmorse Mar 5, 2024
f836048
[gn] port 6fd27d5b0321f (no more module.modulemap.in)
nico Mar 5, 2024
1b1aea7
AMDGPU: Make s_wait_samplecnt(_bvhcnt) dependent on hasImageInsts, NF…
changpeng Mar 5, 2024
a730ed7
[SLP]Improve minbitwidth analysis.
alexey-bataev Mar 5, 2024
643b31d
[HLSL] implement `mad` intrinsic (#83826)
farzonl Mar 5, 2024
06fea93
[TextAPI] add missing platforms for translating triples to tapi targets
cyndyishida Mar 5, 2024
b2ca23a
[HLSL] implement exp intrinsic (#83832)
farzonl Mar 5, 2024
2807ea6
[HLSL] implement the any intrinsic (#83903)
farzonl Mar 5, 2024
9f67f19
[gn] Remove ScudoBenchmarks
nico Mar 5, 2024
233f750
[flang] Catch more bad pointer initialization targets (#83731)
klausler Mar 5, 2024
041638c
[libc][stdbit] implement stdc_bit_width (C23) (#83892)
nickdesaulniers Mar 5, 2024
aa4e3d5
[gn build] Port 1a67dee08913
llvmgnsyncbot Mar 5, 2024
97e9780
[gn build] Port 8e56fb824a43
llvmgnsyncbot Mar 5, 2024
caad379
[gn build] Port a8cb9db5f59d
llvmgnsyncbot Mar 5, 2024
ee044d5
[clang] Diagnose config_macros before building modules (#83641)
Bigcheese Mar 5, 2024
083d8aa
[SLP][NFC]Use TargetTransformInfo:: instead of TTI:: in BoUpSLP to avoid
alexey-bataev Mar 5, 2024
1b812f9
[lldb] Log to system log instead of stderr from Host::SystemLog (#83366)
JDevlieghere Mar 5, 2024
3cef82d
[flang] Fix bogus error message about invalid polymorphic entity (#83…
klausler Mar 5, 2024
61c0677
[KnownBits] Add API for `nuw` flag in `computeForAddSub`; NFC
goldsteinn Feb 29, 2024
17162b6
[KnownBits] Make `nuw` and `nsw` support in `computeForAddSub` optimal
goldsteinn Feb 29, 2024
9a20612
[flang] NULL(NULL(NULL(...(NULL()...))) means NULL() (#83738)
klausler Mar 5, 2024
d35f2c4
Remove constant local variable (#83850)
Mar 5, 2024
069aee0
[flang] Rework F'2023 constraint C1167 checking (#83888)
klausler Mar 5, 2024
cc7544e
[benchmark] Delete WORKSPACE bzl files (#84013)
rupprecht Mar 5, 2024
a9304ed
Fix remaining build failures with GCC 8.3 (#83266)
RoboTux Mar 5, 2024
13cd0a9
[flang] Skim usage before marking unknown module externals as subrout…
klausler Mar 5, 2024
b3b408b
[Fuchsia] Include baremetal ARM builtins and libc (#83949)
petrhosek Mar 5, 2024
83ca78d
[flang] Emit "raw" name for procedure interface in module file (#83915)
klausler Mar 5, 2024
2f343fc
[lldb] Print a message when background tasks take a while to complete…
JDevlieghere Mar 5, 2024
c8b3edc
Revert "[SLP][NFC]Use TargetTransformInfo:: instead of TTI:: in BoUpS…
alexey-bataev Mar 5, 2024
aae152f
Revert "[SLP]Improve minbitwidth analysis."
alexey-bataev Mar 5, 2024
64faa52
Disable clang-tidy misc-include-cleaner (#83945)
joker-eph Mar 5, 2024
b6ca602
[mlir][sparse] migrate tests to sparse_tensor.print (#84055)
aartbik Mar 5, 2024
b8c6252
[libc++abi] Always re-export std:: exception types from libc++abi (#8…
ldionne Mar 5, 2024
eccc717
[SystemZ] [z/OS] Emit offset to PPA2 in separate MCSection (#84043)
Everybody0523 Mar 5, 2024
4a4fb93
Use the new ThreadPoolInterface base class instead of the concrete im…
joker-eph Mar 5, 2024
13bb726
[OpenACC] Fix typo in StmtOpenACC.cpp header.
erichkeane Mar 5, 2024
9faca1e
[compiler-rt/darwin] Disable building sanitizers on platforms without…
rohit-rao Mar 5, 2024
7de6f61
[SPIR-V] Memory leak fix in SPIRVEmitIntrinsics (#83015)
bwlodarcz Mar 5, 2024
a5095b9
[RISCV] Add test for incorrect FP build vector lowering. NFC
topperc Mar 5, 2024
5a52662
[HLSL] implement the rcp intrinsic (#83857)
farzonl Mar 5, 2024
50d848d
[flang] Added lowering and runtime for COMPLEX(16) intrinsics. (#83874)
vzakhari Mar 5, 2024
bf631c6
AMDGPU: Copy SubtargetPredicate from pseudo for DSDIR_Real (#84057)
changpeng Mar 5, 2024
201572e
[AArch64] Implement -fno-plt for SelectionDAG/GlobalISel
MaskRay Mar 5, 2024
f51ade2
[BOLT] Add reading support for Linux kernel .parainstructions section…
maksfb Mar 5, 2024
e96c0c1
[InstCombine] Fix shift calculation in InstCombineCasts (#84027)
dianqk Mar 5, 2024
58d8805
[RISCV] Always use signed APSInt in getExactInteger. (#84070)
topperc Mar 5, 2024
e77a473
[ORC][MachO] Simplify use of LC_BUILD_VERSION in JITDylib headers.
Mar 5, 2024
d6c52c1
AMDGPU: Define HasExpOrExportInsts for export instruction definitions…
changpeng Mar 5, 2024
6f11c95
Revert "[AArch64] Verify ldp/stp alignment stricter" (#84096)
fmayer Mar 5, 2024
55c466d
[X86][AVX512BF16] Add a few missing insert/extract patterns
d0k Mar 5, 2024
0207270
[RISCV] Don't remove extends for i1 indices in mgather/mscatter (#83951)
lukel97 Mar 6, 2024
6c39e3f
[clang][RISCV] Reorder sema check for RVV type (#83553)
4vtomat Mar 6, 2024
2a1b09f
[RISCV][SiFive] Add RISCVUsage for SiFive Intelligence Extensions (#8…
4vtomat Mar 6, 2024
11f74cd
[RISCV] Improve error message when the extension is not supported (#8…
4vtomat Mar 6, 2024
aeda1a6
[Clang][LoongArch] Precommit test for fix wrong return value type of …
wangleiat Mar 5, 2024
8b326d5
[clang-tidy] fix false negative in cppcoreguidelines-missing-std-forw…
jcsxky Mar 6, 2024
889d99a
[TTI] Add alignment argument to TTI for compress/expand support (#83516)
nikolaypanchenko Mar 6, 2024
31c304b
[SCEV] Migrate some tests to be autogenerated
preames Mar 6, 2024
716042a
Rename llvm::ThreadPool -> llvm::DefaultThreadPool (NFC) (#83702)
joker-eph Mar 6, 2024
2f479b8
[Clang][LoongArch] Fix wrong return value type of __iocsrrd_h (#84100)
wangleiat Mar 6, 2024
987fe6f
[SCEV] Migrate a couple tests to be auto generated
preames Mar 6, 2024
d32bcbf
[lldb] Fix build failure in Debugger.cpp (NFC)
DamonFool Mar 6, 2024
f439c71
[InstrProf][NFC] Fix -Wimplicit-fallthrough warning in InstrProf.cpp …
wangleiat Mar 6, 2024
49ec8b7
AMDGPU: Define and Use HasInterpInsts for interp inst definitions (#8…
changpeng Mar 6, 2024
85388a0
[RISCV] Move RISCVVType namespace to TargetParser (#83222)
wangpc-pp Mar 6, 2024
96fc548
Revert "[mlir][py] better support for arith.constant construction" (#…
joker-eph Mar 6, 2024
929ceec
[gn build] Port 85388a06b602
llvmgnsyncbot Mar 6, 2024
0fa04b6
[libomptarget] Fix libomptarget.rtl.amdgpu.so installation
ye-luo Mar 6, 2024
ae709c1
[RISCV] Add TargetParser to MCA (#84109)
wangpc-pp Mar 6, 2024
403b9cf
[WebAssembly] Use RefTypeMem2Local instead of Mem2Reg (#83196)
aheejin Mar 6, 2024
433b711
[CUDA] Correctly set CUDA default architecture (#84017)
jhuber6 Mar 6, 2024
a30233f
[libc] Fix standalone cross compiling build for the GPU (#84042)
jhuber6 Mar 6, 2024
0f3628a
AMDGPU: Correct cycle counts for f64 mfma on gfx940 (#83782)
arsenm Mar 6, 2024
5549b01
[clangd] Make all calls to format::getStyle() go through getFormatSty…
HighCommander4 Mar 6, 2024
d1aec79
[clangd] Remove calls to getFileLoc() in declToSym() (#83532)
HighCommander4 Mar 6, 2024
b930b14
[mlir][complex] Support fast math flag in converting complex.atan2 op…
Lewuathe Mar 6, 2024
7bad74e
[clangd] Show argument names for function pointer struct fields (#69011)
Qwinci Mar 6, 2024
5fb3311
[RISCV] Use uint32_t for NumOfVReg in getVLENFactoredAmount. (#84110)
topperc Mar 6, 2024
2a13422
Convert many LivePhysRegs uses to LiveRegUnits (#83905)
AZero13 Mar 6, 2024
3f7aa04
[nfc] Fix RTTI for `InstrProf` intrinsics (#83511)
mtrofin Mar 6, 2024
9e759f3
[AArch64] Fix fptoi/itofp for bf16
majnemer Mar 6, 2024
eceb24c
[RISCV] Hoist immediate addresses from loads/stores (#83644)
francisvm Mar 6, 2024
228c737
[RISCV] Add tests for vsetvli/vsetvlimax with different SEW/LMUL
wangpc-pp Feb 18, 2024
6082263
Restore "Implement convergence control in MIR using SelectionDAG (#71…
ssahasra Mar 6, 2024
95b52ec
[RISCV] Take SEW/LMUL into account for value tracking of vsetvli[max]…
wangpc-pp Mar 6, 2024
6bf6128
[gn build] Port 60822637bf00
llvmgnsyncbot Mar 6, 2024
5ff3f66
[ELF] Improve wildcard tests for input section descriptions
MaskRay Mar 6, 2024
d3e79e4
[ELF] Improve wildcard test
MaskRay Mar 6, 2024
d3df2a8
[C++20] [Modules] Handle transitive import in the module properly
ChuanqiXu9 Mar 6, 2024
bec7ad9
[RISCV] Add tests for vw{add,sub,mul} with nested extend. NFC
lukel97 Mar 6, 2024
7a0accc
[clang] Add some CodeGen tests for CWG 4xx issues (#83715)
Endilll Mar 6, 2024
9f45c5e
[InstCombine] Fix infinite loop in select equivalence fold (#84036)
nikic Mar 6, 2024
d773f0e
Merge from 'sycl' to 'sycl-web'
Mar 6, 2024
6cdf596
[InstCombine] If inst in unreachable refers to an inst change it to p…
ParkHanbum Mar 6, 2024
aced81c
[C23] Implement N3018: The constexpr specifier for object definitions…
Fznamznon Mar 6, 2024
6e27dd4
[mlir][gpu] Replace MLIR_GPU_TO_HSACO_PASS_ENABLE by more generic one…
ingomueller-net Mar 6, 2024
d70254a
[mlir][nvvm] Add missing include to llvm-config.h. (#83998)
ingomueller-net Mar 6, 2024
0e5bde0
[mlir] Remove noexcept again from move constructors.
akuegel Mar 6, 2024
52d5b8e
[AMDGPU] Don't form sext/abs/neg fp8 cvt (#83843)
Pierre-vh Mar 6, 2024
471a612
[mlir][drr] Add warning for simple case of mismatched variadic. (#84040)
jpienaar Mar 6, 2024
ac74d9e
[clang][NFC] Regroup declarations in `Sema` (#82217)
Endilll Mar 6, 2024
0d6e9bd
Merge from 'main' to 'sycl-web' (128 commits)
jyu2-git Mar 6, 2024
ee3fb55
Merge from 'sycl' to 'sycl-web' (5 commits)
Mar 6, 2024
7dfc12d
Merge from 'main' to 'sycl-web' (29 commits)
jyu2-git Mar 6, 2024
4f88a10
Remove internal values for SPV_INTEL_maximum_registers (#2387)
vmaksimo Feb 29, 2024
31826ee
Implement lowering of llvm.bitreverse.* (#2345)
LU-JOHN Mar 4, 2024
89b4bc9
Use unordered_map for better performance (#2356)
LU-JOHN Mar 4, 2024
c7b3545
Fix TaskSequenceCreateINTEL instruction verification (#2384)
vmaksimo Mar 5, 2024
949feed
Remove extra semicolon (#2401)
svenvh Mar 5, 2024
4a58a77
Remove tail from atomic_compare_exchange_strong_explicit call (#2395)
wenju-he Mar 5, 2024
2483d62
Handle OpVectorShuffle with differing vector sizes (#2391)
svenvh Mar 6, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
8 changes: 0 additions & 8 deletions .arcconfig

This file was deleted.

15 changes: 0 additions & 15 deletions .arclint

This file was deleted.

2 changes: 1 addition & 1 deletion .clang-tidy
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
Checks: '-*,clang-diagnostic-*,llvm-*,misc-*,-misc-const-correctness,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-misc-no-recursion,-misc-use-anonymous-namespace,readability-identifier-naming'
Checks: '-*,clang-diagnostic-*,llvm-*,misc-*,-misc-const-correctness,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-misc-no-recursion,-misc-use-anonymous-namespace,readability-identifier-naming,-misc-include-cleaner'
CheckOptions:
- key: readability-identifier-naming.ClassCase
value: CamelCase
Expand Down
3 changes: 3 additions & 0 deletions .github/new-prs-labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -869,6 +869,9 @@ backend:SystemZ:
third-party:unittests:
- third-party/unittests/**

third-party:benchmark:
- third-party/benchmark/**

llvm:binary-utilities:
- llvm/docs/CommandGuide/llvm-*
- llvm/include/llvm/BinaryFormat/**
Expand Down
6 changes: 5 additions & 1 deletion .github/workflows/build-ci-container.yml
Original file line number Diff line number Diff line change
Expand Up @@ -77,14 +77,18 @@ jobs:
cp ./.github/workflows/containers/github-action-ci/storage.conf ~/.config/containers/storage.conf
podman info

# Download the container image into /mnt/podman rather than
# $GITHUB_WORKSPACE to avoid space limitations on the default drive
# and use the permissions setup for /mnt/podman.
- name: Download stage1-toolchain
uses: actions/download-artifact@v4
with:
name: stage1-toolchain
path: /mnt/podman

- name: Load stage1-toolchain
run: |
podman load -i stage1-toolchain.tar
podman load -i /mnt/podman/stage1-toolchain.tar

- name: Build Container
working-directory: ./.github/workflows/containers/github-action-ci/
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -12,11 +12,13 @@ COPY --from=stage2-toolchain $LLVM_SYSROOT $LLVM_SYSROOT
# Need to install curl for hendrikmuhs/ccache-action
# Need nodejs for some of the GitHub actions.
# Need perl-modules for clang analyzer tests.
# Need git for SPIRV-Tools tests.
RUN apt-get update && \
apt-get install -y \
binutils \
cmake \
curl \
git \
libstdc++-11-dev \
ninja-build \
nodejs \
Expand Down
8 changes: 8 additions & 0 deletions bolt/include/bolt/Core/BinaryFunction.h
Original file line number Diff line number Diff line change
Expand Up @@ -2056,6 +2056,14 @@ class BinaryFunction {
/// Returns false if disassembly failed.
Error disassemble();

/// An external interface to register a branch while the function is in
/// disassembled state. Allows to make custom modifications to the
/// disassembler. E.g., a pre-CFG pass can add an instruction and register
/// a branch that will later be used during the CFG construction.
///
/// Return a label at the branch destination.
MCSymbol *registerBranch(uint64_t Src, uint64_t Dst);

Error handlePCRelOperand(MCInst &Instruction, uint64_t Address,
uint64_t Size);

Expand Down
7 changes: 3 additions & 4 deletions bolt/include/bolt/Core/ParallelUtilities.h
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@

#include "bolt/Core/MCPlusBuilder.h"
#include "llvm/Support/CommandLine.h"
#include "llvm/Support/ThreadPool.h"

using namespace llvm;

Expand All @@ -28,8 +29,6 @@ extern cl::opt<unsigned> TaskCount;
} // namespace opts

namespace llvm {
class ThreadPool;

namespace bolt {
class BinaryContext;
class BinaryFunction;
Expand All @@ -50,8 +49,8 @@ enum SchedulingPolicy {
SP_BB_QUADRATIC, /// cost is estimated by the square of the BB count
};

/// Return the managed thread pool and initialize it if not initiliazed.
ThreadPool &getThreadPool();
/// Return the managed thread pool and initialize it if not initialized.
ThreadPoolInterface &getThreadPool();

/// Perform the work on each BinaryFunction except those that are accepted
/// by SkipPredicate, scheduling heuristic is based on SchedPolicy.
Expand Down
24 changes: 17 additions & 7 deletions bolt/lib/Core/BinaryFunction.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -1445,6 +1445,16 @@ Error BinaryFunction::disassemble() {
return Error::success();
}

MCSymbol *BinaryFunction::registerBranch(uint64_t Src, uint64_t Dst) {
assert(CurrentState == State::Disassembled &&
"Cannot register branch unless function is in disassembled state.");
assert(containsAddress(Src) && containsAddress(Dst) &&
"Cannot register external branch.");
MCSymbol *Target = getOrCreateLocalLabel(Dst);
TakenBranches.emplace_back(Src - getAddress(), Dst - getAddress());
return Target;
}

bool BinaryFunction::scanExternalRefs() {
bool Success = true;
bool DisassemblyFailed = false;
Expand Down Expand Up @@ -1759,13 +1769,6 @@ void BinaryFunction::postProcessJumpTables() {
}
}
}

// Remove duplicates branches. We can get a bunch of them from jump tables.
// Without doing jump table value profiling we don't have use for extra
// (duplicate) branches.
llvm::sort(TakenBranches);
auto NewEnd = std::unique(TakenBranches.begin(), TakenBranches.end());
TakenBranches.erase(NewEnd, TakenBranches.end());
}

bool BinaryFunction::validateExternallyReferencedOffsets() {
Expand Down Expand Up @@ -2128,6 +2131,13 @@ Error BinaryFunction::buildCFG(MCPlusBuilder::AllocatorIdTy AllocatorId) {
// e.g. exit(3), etc. Otherwise we'll see a false fall-through
// blocks.

// Remove duplicates branches. We can get a bunch of them from jump tables.
// Without doing jump table value profiling we don't have a use for extra
// (duplicate) branches.
llvm::sort(TakenBranches);
auto NewEnd = std::unique(TakenBranches.begin(), TakenBranches.end());
TakenBranches.erase(NewEnd, TakenBranches.end());

for (std::pair<uint32_t, uint32_t> &Branch : TakenBranches) {
LLVM_DEBUG(dbgs() << "registering branch [0x"
<< Twine::utohexstr(Branch.first) << "] -> [0x"
Expand Down
10 changes: 5 additions & 5 deletions bolt/lib/Core/ParallelUtilities.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@ namespace ParallelUtilities {

namespace {
/// A single thread pool that is used to run parallel tasks
std::unique_ptr<ThreadPool> ThreadPoolPtr;
std::unique_ptr<DefaultThreadPool> ThreadPoolPtr;

unsigned computeCostFor(const BinaryFunction &BF,
const PredicateTy &SkipPredicate,
Expand Down Expand Up @@ -102,11 +102,11 @@ inline unsigned estimateTotalCost(const BinaryContext &BC,

} // namespace

ThreadPool &getThreadPool() {
ThreadPoolInterface &getThreadPool() {
if (ThreadPoolPtr.get())
return *ThreadPoolPtr;

ThreadPoolPtr = std::make_unique<ThreadPool>(
ThreadPoolPtr = std::make_unique<DefaultThreadPool>(
llvm::hardware_concurrency(opts::ThreadCount));
return *ThreadPoolPtr;
}
Expand Down Expand Up @@ -145,7 +145,7 @@ void runOnEachFunction(BinaryContext &BC, SchedulingPolicy SchedPolicy,
TotalCost > BlocksCount ? TotalCost / BlocksCount : 1;

// Divide work into blocks of equal cost
ThreadPool &Pool = getThreadPool();
ThreadPoolInterface &Pool = getThreadPool();
auto BlockBegin = BC.getBinaryFunctions().begin();
unsigned CurrentCost = 0;

Expand Down Expand Up @@ -202,7 +202,7 @@ void runOnEachFunctionWithUniqueAllocId(
TotalCost > BlocksCount ? TotalCost / BlocksCount : 1;

// Divide work into blocks of equal cost
ThreadPool &Pool = getThreadPool();
ThreadPoolInterface &Pool = getThreadPool();
auto BlockBegin = BC.getBinaryFunctions().begin();
unsigned CurrentCost = 0;
unsigned AllocId = 1;
Expand Down
8 changes: 8 additions & 0 deletions bolt/lib/Core/Relocation.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -381,6 +381,14 @@ static uint64_t encodeValueAArch64(uint64_t Type, uint64_t Value, uint64_t PC) {
// OP 1001_01 goes in bits 31:26 of BL.
Value = ((Value >> 2) & 0x3ffffff) | 0x94000000ULL;
break;
case ELF::R_AARCH64_JUMP26:
Value -= PC;
assert(isInt<28>(Value) &&
"only PC +/- 128MB is allowed for direct branch");
// Immediate goes in bits 25:0 of B.
// OP 0001_01 goes in bits 31:26 of B.
Value = ((Value >> 2) & 0x3ffffff) | 0x14000000ULL;
break;
}
return Value;
}
Expand Down
2 changes: 1 addition & 1 deletion bolt/lib/Passes/IdenticalCodeFolding.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -397,7 +397,7 @@ Error IdenticalCodeFolding::runOnFunctions(BinaryContext &BC) {
Timer SinglePass("single fold pass", "single fold pass");
LLVM_DEBUG(SinglePass.startTimer());

ThreadPool *ThPool;
ThreadPoolInterface *ThPool;
if (!opts::NoThreads)
ThPool = &ParallelUtilities::getThreadPool();

Expand Down
2 changes: 1 addition & 1 deletion bolt/lib/Rewrite/DWARFRewriter.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -784,7 +784,7 @@ void DWARFRewriter::updateDebugInfo() {
}
} else {
// Update unit debug info in parallel
ThreadPool &ThreadPool = ParallelUtilities::getThreadPool();
ThreadPoolInterface &ThreadPool = ParallelUtilities::getThreadPool();
for (std::unique_ptr<DWARFUnit> &CU : BC.DwCtx->compile_units())
ThreadPool.async(processUnitDIE, CU.get(), &DIEBlder);
ThreadPool.wait();
Expand Down
Loading