Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge changes from develop in preparation of the 0.3.29 release #5066

Merged
merged 376 commits into from
Jan 12, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
376 commits
Select commit Hold shift + click to select a range
bee123e
fix: add missing NO_AFFINITY checks
NickelWenzel Oct 9, 2024
0b7fb5c
CGEMM & ZGEMM using C code.
Oct 9, 2024
9762464
Fix CBLAS interface filling in the wrong triangle for Row-Major
martin-frbg Oct 9, 2024
dbd8376
Merge pull request #4926 from NickelWenzel/fix_arm64_windows_and_uwp
martin-frbg Oct 9, 2024
e52d9b4
Merge pull request #4928 from austinpagan/czgemm_in_c
martin-frbg Oct 9, 2024
b89fb96
Update Android NDK install path for M1/armv7 crossbuild
martin-frbg Oct 10, 2024
e4bc5e4
remove stray quote
martin-frbg Oct 10, 2024
e0ad20f
Merge pull request #4932 from martin-frbg/cirrusosxndk
martin-frbg Oct 10, 2024
550bc77
Fix expectation values for CblasRowMajor order
martin-frbg Oct 10, 2024
7ba6591
Merge branch 'OpenMathLib:develop' into issue4905
martin-frbg Oct 10, 2024
5a79446
Merge pull request #4918 from HaoZeke/testFixes
martin-frbg Oct 10, 2024
18a23c2
Merge pull request #4929 from martin-frbg/issue4905
martin-frbg Oct 11, 2024
c8f53b8
Merge remote-tracking branch 'origin/develop' into vectorizeBF16GEMV
ChipKerchner Oct 11, 2024
1d51ca5
Change multi-threading logic for SBGEMV to be the same as SGEMV.
ChipKerchner Oct 11, 2024
0082240
Merge branch 'thread_sbgemv' into vectorizeBF16GEMV
ChipKerchner Oct 11, 2024
3184b7f
Merge pull request #4933 from ChipKerchner/thread_sbgemv
martin-frbg Oct 12, 2024
a53a197
Merge remote-tracking branch 'origin/develop' into vectorizeBF16GEMV
ChipKerchner Oct 12, 2024
f8e113f
Replace types with include file.
ChipKerchner Oct 13, 2024
36bd3ee
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Pow…
ChipKerchner Oct 13, 2024
2391dc1
Merge branch 'vectorizeBF16GEMV' of github.ibm.com:PowerAppLibs/OpenB…
ChipKerchner Oct 13, 2024
a47b3c8
Fix unroll parameter selection for MIPS64_GENERIC
martin-frbg Oct 13, 2024
c979c1d
Merge pull request #4936 from martin-frbg/fixmips64generic
martin-frbg Oct 14, 2024
a659f40
Fix leading dimension for B (Reference-LAPACK PR 1064)
martin-frbg Oct 14, 2024
7018c1b
Fix leading dimension for B (Reference-LAPACK PR 1064)
martin-frbg Oct 14, 2024
27ed6da
Fix leading dimension for B (Reference-LAPACK PR 1064)
martin-frbg Oct 14, 2024
22628f1
Fix leading dimension for B (Reference-LAPACK PR 1064)
martin-frbg Oct 14, 2024
8483a71
Merge pull request #4937 from martin-frbg/lapack1064
martin-frbg Oct 14, 2024
6a60eb1
Merge pull request #4924 from XiWeiGu/la64_readme
martin-frbg Oct 16, 2024
457d1c6
remove unused CI badges, wiki->docs, xianyi->OpenMathLib
martin-frbg Oct 17, 2024
15edb44
LoongArch64: Opt somatcopy_rt with LASX
XiWeiGu Oct 14, 2024
acf6cab
LoongArch64: Opt somatcopy_rn with LASX
XiWeiGu Oct 17, 2024
b371293
LoongArch64: Opt somatcopy_cn with LASX
XiWeiGu Oct 17, 2024
bb31bbe
LoongArch64: Opt somatcopy_ct with LASX
XiWeiGu Oct 17, 2024
ab71a1e
Better VSX.
ChipKerchner Oct 17, 2024
15a5759
Merge pull request #4944 from ChipKerchner/vectorizeBF16GEMV
martin-frbg Oct 17, 2024
33078d1
stress importance of TARGET setting in DYNAMIC_ARCH builds
martin-frbg Oct 17, 2024
3fc1225
Merge branch 'OpenMathLib:develop' into update_readme
martin-frbg Oct 17, 2024
a938972
Merge pull request #4943 from martin-frbg/update_readme
martin-frbg Oct 17, 2024
ffaa576
Bench: Add omatcopy
XiWeiGu Oct 17, 2024
016bdb9
Merge pull request #4946 from XiWeiGu/la64_omatcopy_lasx
martin-frbg Oct 18, 2024
8672077
write HAVE_SVE to config where applicable
martin-frbg Oct 18, 2024
c4bb4e7
NeoverseN2 has SVE too
martin-frbg Oct 18, 2024
a93ec74
Merge pull request #4948 from martin-frbg/fixhavesve
martin-frbg Oct 18, 2024
4dba6ce
work around mingw32-gfortran 14.2 miscompiling CBLAS1 tests
martin-frbg Oct 20, 2024
8a0cd5f
Merge pull request #4949 from martin-frbg/mingw32-14.2
martin-frbg Oct 20, 2024
b6ec73e
Fix AIX build
ayappanec Oct 21, 2024
020cce1
Fix build issues with gcc compiler as well
ayappanec Oct 23, 2024
72461f1
Merge pull request #4950 from ayappanec/fix-aix-build
martin-frbg Oct 23, 2024
79f4bbd
fix: return types of *trtrs routines
NickelWenzel Oct 24, 2024
cb48505
optimize gemv forwarding on ARM64 systems
Oct 25, 2024
8f59538
gh-actions: Test LoongArch64 with gcc14 from Ubuntu 24.04
XiWeiGu Oct 25, 2024
ac73682
Merge pull request #4955 from cdaley/optimize_gemv_forwarding
martin-frbg Oct 25, 2024
815cb24
remove unused INLINE macro definitions
TiborGY Aug 16, 2023
0e6a2cc
bump the minimum_required version instead
martin-frbg Oct 25, 2024
a8bb105
Merge pull request #4848 from haampie/fix/cmake-min-version
martin-frbg Oct 25, 2024
f66e6d3
Merge pull request #4953 from NickelWenzel/fix_trtrs_return_types
martin-frbg Oct 25, 2024
73c6a28
x86_64: opt somatcopy_ct with AVX
XiWeiGu Oct 29, 2024
2718b37
Update CONTRIBUTORS.md
CDAC-SSDG Oct 30, 2024
0667cf6
Added optimized scal routine files
CDAC-SSDG Oct 30, 2024
b8bc2a7
Added sve optimized kernels for swap routine
SushilPratap04 Oct 30, 2024
7822ae9
Added sve kernels for rot routine.
SushilPratap04 Oct 30, 2024
fa880ab
Update KERNEL.ARMV8SVE
SushilPratap04 Oct 30, 2024
668e28a
Delete kernel/arm64/rot.c
Juliya32 Oct 30, 2024
d90ee00
Delete kernel/arm64/rot_kernel_c.c
Juliya32 Oct 30, 2024
012fe4d
Delete kernel/arm64/rot_kernel_sve.c
Juliya32 Oct 30, 2024
3b2421c
Add files via upload
Juliya32 Oct 30, 2024
24b5cca
Merge pull request #4202 from TiborGY/inlines_pt2
martin-frbg Oct 30, 2024
0cf656f
Add copies of GEMMT under its new name GEMMTR
martin-frbg Oct 30, 2024
c3e7d08
Copy GEMMT to its new name GEMMTR
martin-frbg Oct 30, 2024
c9d1a3b
Merge pull request #4954 from XiWeiGu/la64_update_gh_actions
martin-frbg Oct 30, 2024
598bd21
Merge pull request #4958 from XiWeiGu/x86_64_opt_somatcopy_ct_avx
martin-frbg Oct 30, 2024
c634114
Merge pull request #4960 from martin-frbg/gemmtr
martin-frbg Oct 30, 2024
d3272e5
explicitly link to OpenMP
h-vetinari Jun 30, 2024
453b9e4
Merge pull request #4961 from h-vetinari/flang
martin-frbg Oct 31, 2024
87a1815
Update version of upload-artifacts again
martin-frbg Nov 6, 2024
794f75a
Merge pull request #4965 from martin-frbg/homebrewjob-2
martin-frbg Nov 6, 2024
2332ea7
fix misleading indentation
martin-frbg Nov 6, 2024
24c5152
Merge pull request #4966 from martin-frbg/getrf_indent
martin-frbg Nov 6, 2024
50da5c2
Add Apple M4 as VORTEX with HAVE_SME
martin-frbg Nov 12, 2024
2057cd7
Merge pull request #4971 from martin-frbg/m4init
martin-frbg Nov 13, 2024
760bf7a
Update Fortran return for complex data types (Cray and Nvidia compilers)
cenewcombe Nov 13, 2024
10cf06d
Merge branch 'OpenMathLib:develop' into develop
cenewcombe Nov 13, 2024
3e7e312
Merge pull request #4974 from cenewcombe/develop
martin-frbg Nov 13, 2024
cea9df3
Update Cray compiler options and calling convention
martin-frbg Nov 13, 2024
2e2f952
Merge pull request #4975 from martin-frbg/fixup4974
martin-frbg Nov 14, 2024
926e56e
Align GEMM3M parameters for GENERIC with ZGEMM and add P/Q/R
martin-frbg Nov 14, 2024
2a290df
forward GEMM3M calls for GENERIC targets to the regular C/ZGEMM for now
martin-frbg Nov 14, 2024
d04686a
Re-enable the EXPRECISION option for non-Windows x86/x86_64
martin-frbg Nov 14, 2024
4060dd4
Add dummy implementations of openblas_get/set_affinity
martin-frbg Nov 15, 2024
e334b79
Merge pull request #4977 from martin-frbg/issue4973
martin-frbg Nov 17, 2024
9db51f7
Remove any optimization flags from DEBUG builds on POWER architecture
martin-frbg Nov 17, 2024
18014b0
Merge pull request #4979 from martin-frbg/issue4978-2
martin-frbg Nov 17, 2024
bfaf5b9
Restore libsuffix support in pkg-config file
svillemot Nov 20, 2024
a0131e5
doc: update README to link to the html docs and fix links
rgommers Nov 21, 2024
0b3db03
added optimizations for RISC-V YIELDING
Nov 22, 2024
c520ed1
Merge pull request #4984 from rgommers/docs-link
martin-frbg Nov 22, 2024
3a63bba
Add compiler version notes and mention the f2c fallback LAPACK
martin-frbg Nov 22, 2024
760a537
Update build instructions for WoA (use LLVM19 and its flang-new)
martin-frbg Nov 22, 2024
009c1e0
fix download link for the current WoA binary of LLVM
martin-frbg Nov 23, 2024
8481301
Merge pull request #4987 from martin-frbg/issue3973
martin-frbg Nov 23, 2024
82088cb
Merge pull request #4986 from martin-frbg/readme_compilers
martin-frbg Nov 23, 2024
7452af4
CI (MinGW): Remove work-around with NO_AVX512 that was needed for old…
mmuetzel Aug 5, 2023
71963a7
Merge pull request #4985 from CheryDan/RISCV/sched
martin-frbg Nov 26, 2024
8e8003a
Merge pull request #4180 from mmuetzel/cmake
martin-frbg Nov 26, 2024
f5e6b5b
CI (MinGW): Remove CLANG32 environment from build matrix.
mmuetzel Nov 26, 2024
35f2e6a
Merge pull request #4992 from mmuetzel/ci-msys2
martin-frbg Nov 26, 2024
57a51d7
translate CMAKE_SYSTEM_NAME in compilations on or for IOS
martin-frbg Nov 27, 2024
0578a89
Merge pull request #4993 from martin-frbg/issue4991
martin-frbg Nov 27, 2024
0c440f8
disable multithreading for small workloads
martin-frbg Nov 27, 2024
4918bee
Loop-unrolled transposed [SD]GEMV kernels for A64FX and Neoverse V1
iha-taisei Dec 2, 2024
dc90563
arm: Declare symbols as .type function
dg0yt Dec 3, 2024
93eb42f
Fix redefinition of FAILED
dg0yt Dec 3, 2024
5f65846
Merge pull request #4998 from dg0yt/arm-type-function
martin-frbg Dec 4, 2024
a8b1705
CMake build has wrong PIC flag for NAG
mathomp4 Nov 26, 2024
2eaf285
Use F_COMPILER name
mathomp4 Nov 26, 2024
9c5d201
Merge pull request #4999 from dg0yt/macro-failed
martin-frbg Dec 4, 2024
be19966
Fixes for NAG CMake
mathomp4 Dec 4, 2024
35334ed
Fixes for Fortran Standards violations for lapack-netlib
mathomp4 Dec 4, 2024
d3b2036
Move to use ERROR STOP instead of ABORT
mathomp4 Dec 4, 2024
c4e8bac
Fix indent
mathomp4 Dec 4, 2024
1a6ecda
utilize /proc/cpuinfo on NetBSD too
martin-frbg Dec 4, 2024
a791912
handle uname returning evbarm on NetBSD
martin-frbg Dec 4, 2024
4ba471d
Merge pull request #5003 from mathomp4/bugfix/nag-pic
martin-frbg Dec 4, 2024
5dc4d7d
Merge pull request #5005 from martin-frbg/evbarm
martin-frbg Dec 5, 2024
5fe983d
retire the thunderx2 nrm2 kernels for now due to NAN and inaccuracies
martin-frbg Dec 5, 2024
3345007
retire the thunderx2 NRM2 kernels due to reported inaccuracies and NAN
martin-frbg Dec 5, 2024
230e665
Merge pull request #4996 from iha-taisei/sdgemv_sve_unroll
martin-frbg Dec 5, 2024
81666de
Merge pull request #5007 from martin-frbg/issue5006
martin-frbg Dec 5, 2024
0f8ff82
Add build notes for Windows and flang from gh Discussion 5008
martin-frbg Dec 6, 2024
72f7b70
Merge pull request #5009 from martin-frbg/pybenchdoc
martin-frbg Dec 6, 2024
5aea097
add missing lapack 3.11+ symbols
martin-frbg Dec 10, 2024
61d5aec
remove typo
martin-frbg Dec 10, 2024
89f02ed
Merge pull request #5014 from martin-frbg/issue5013
martin-frbg Dec 11, 2024
b9f51a5
Delete kernel/arm64/rot.c
CDAC-SSDG Dec 13, 2024
10857c9
Delete kernel/arm64/rot_kernel_c.c
CDAC-SSDG Dec 13, 2024
f62519c
Delete kernel/arm64/rot_kernel_sve.c
CDAC-SSDG Dec 13, 2024
5540f21
Delete kernel/arm64/scal.c
CDAC-SSDG Dec 13, 2024
95a9701
Delete kernel/arm64/scal_kernel_c.c
CDAC-SSDG Dec 13, 2024
3b7b746
Delete kernel/arm64/scal_kernel_sve.c
CDAC-SSDG Dec 13, 2024
f6416c0
Delete kernel/arm64/swap.c
CDAC-SSDG Dec 13, 2024
c17c19f
Delete kernel/arm64/swap_kernel_c.c
CDAC-SSDG Dec 13, 2024
7658501
Delete kernel/arm64/swap_kernel_sve.c
CDAC-SSDG Dec 13, 2024
41912f9
Update CONTRIBUTORS.md
CDAC-SSDG Dec 13, 2024
06ffd41
Update KERNEL.ARMV8SVE
CDAC-SSDG Dec 13, 2024
dd71e42
Added Updated swap and rot sve kernels.
CDAC-SSDG Dec 13, 2024
3368a4e
Update swap_kernel_sve.c
SushilPratap04 Dec 13, 2024
229d8a0
Merge pull request #4959 from CDAC-Bengaluru/level-1-sve
martin-frbg Dec 13, 2024
d00cc40
Replaced the __riscv_vid_v_i32m2 and __riscv_vid_v_i64m2 with __riscv…
Dec 18, 2024
48caf23
Fix build warning about discarding volatile qualifier in memory.c
rgommers Dec 18, 2024
765ad8b
Fix guard around `alloc_hugetlb`, fixes compile warning
rgommers Dec 18, 2024
d3cc8c6
Merge pull request #5022 from tingboliao/develop
martin-frbg Dec 18, 2024
e460512
Update WoA build instructions from rewording in issue #5001
martin-frbg Dec 18, 2024
a93d3db
fix formatting of WoA section
martin-frbg Dec 18, 2024
70dddac
Merge pull request #5023 from rgommers/fix-warnings
martin-frbg Dec 19, 2024
f2be482
Merge pull request #5024 from martin-frbg/issue5001
martin-frbg Dec 19, 2024
1c4401e
Add target-specific options to enable SVE with the NVIDIA compiler
martin-frbg Dec 19, 2024
37a4ca7
Merge pull request #5025 from martin-frbg/nvidia_arm64
martin-frbg Dec 20, 2024
32319a3
Add options for Intel oneAPI 2025.0 ifx on Windows
martin-frbg Dec 23, 2024
30188a5
Don't assume underlined symbols for ifx; make cpuid.S inclusion condi…
martin-frbg Dec 23, 2024
d78fbe4
Assume no underline suffixes on symbols when compiling with ifx on Wi…
martin-frbg Dec 23, 2024
5d81e51
Assume no underline suffixes on symbols when compiling with ifx on Wi…
martin-frbg Dec 23, 2024
5c9417d
Assume no underline suffixes on symbols when compiling with ifx on Wi…
martin-frbg Dec 23, 2024
64c6c79
Assume no underline suffixes on symbols when compiling with Intel ifx…
martin-frbg Dec 23, 2024
05fe49d
Rename local copy functions to avoid name clash with the standard BLA…
martin-frbg Dec 23, 2024
e6fd629
Expressly declare the .S extension for assembly (documented as standa…
martin-frbg Dec 23, 2024
0bea1cf
Optimize the zgemm_tcopy_4_rvv function to be compatible with the sit…
Dec 24, 2024
c4c3d9e
Merge remote-tracking branch 'refs/remotes/origin/develop' into develop
Dec 24, 2024
fbf594b
Guard against empty CMAKE_Fortran_COMPILER_ID
martin-frbg Dec 24, 2024
6af4e76
Merge pull request #5029 from martin-frbg/issue5020
martin-frbg Dec 24, 2024
762fa1a
fix link to faq
david-cortes Dec 24, 2024
17803e7
Merge pull request #5031 from david-cortes/fix_doc_links
martin-frbg Dec 24, 2024
df42f79
docs: update extensions and install pages with last wiki edits
rgommers Dec 26, 2024
d5e2555
Improve OpenBLASConfig.cmake contents
martin-frbg Dec 29, 2024
d863dcf
Merge pull request #5033 from rgommers/doc-port-last-wiki-edits
martin-frbg Dec 29, 2024
36b0fb3
Merge pull request #5035 from martin-frbg/issue4396
martin-frbg Dec 30, 2024
c125866
Merge branch 'OpenMathLib:develop' into m3m_exprec
martin-frbg Dec 30, 2024
73527aa
Merge pull request #5030 from tingboliao/develop
martin-frbg Dec 30, 2024
718fb73
Merge pull request #4976 from martin-frbg/m3m_exprec
martin-frbg Dec 30, 2024
fff2e21
Add LAPACK-TEST errors topic
martin-frbg Dec 30, 2024
a107547
Merge pull request #4994 from martin-frbg/issue4886
martin-frbg Dec 30, 2024
c37509c
Optimize the nrm2_rvv function to further improve performance.
Dec 31, 2024
9297c46
Merge pull request #5036 from martin-frbg/issue4032
martin-frbg Dec 31, 2024
0a2d9aa
Merge pull request #4982 from svillemot/develop
martin-frbg Dec 31, 2024
6ad793d
Fix naming of suffixed libraries in the cmake and pkgconfig files
martin-frbg Dec 31, 2024
85a3332
Merge pull request #5039 from martin-frbg/fixgmakenaming
martin-frbg Dec 31, 2024
e9ff70b
Add an install_tests target to facilitate testing on cross-compiled t…
martin-frbg Dec 31, 2024
2f86913
Merge pull request #5040 from martin-frbg/issue922
martin-frbg Dec 31, 2024
a63282a
Merge pull request #5037 from tingboliao/develop
martin-frbg Dec 31, 2024
be807c9
Identify all cores, group by performance and report the fastest TARGET
martin-frbg Jan 1, 2025
3c3d1c4
Identify all cores and select the most performant one as TARGET
martin-frbg Jan 1, 2025
ed95791
fix conflicting variables
martin-frbg Jan 1, 2025
a182251
fix typo
martin-frbg Jan 1, 2025
14c72d6
Add the test cases of rot to improve unit testing.
Jan 2, 2025
7fd73a4
Fix accidentally dropped cpu ids and add MacOS performance groups
martin-frbg Jan 2, 2025
e4f83d4
Merge pull request #5041 from martin-frbg/issue2715
martin-frbg Jan 2, 2025
108bf59
Create harmonyos.yml
martin-frbg Jan 3, 2025
67bbde7
Update harmonyos.yml
martin-frbg Jan 3, 2025
593427c
Update harmonyos.yml
martin-frbg Jan 3, 2025
14e6835
Update harmonyos.yml
martin-frbg Jan 3, 2025
52c7695
Merge pull request #5044 from martin-frbg/issue5043
martin-frbg Jan 3, 2025
9b9c0aa
temporarily disable the default S/DSCAL kernel
martin-frbg Jan 3, 2025
a71041e
Merge pull request #5045 from martin-frbg/issue5034
martin-frbg Jan 3, 2025
afd5725
Add build instructions for Huawei's HarmonyOS from #5043
martin-frbg Jan 3, 2025
6f9b9e2
Merge pull request #5046 from martin-frbg/doc5044
martin-frbg Jan 3, 2025
33b9e5b
Add a documentation page for the runtime variables
martin-frbg Jan 3, 2025
b6c906f
Add OPENBLAS_L2_SIZE and improve formatting
martin-frbg Jan 4, 2025
81e1be8
Revert "temporarily disable the default S/DSCAL kernel"
barracuda156 Jan 4, 2025
229efa4
scal.S: use r11 on 32-bit Darwin on powerpc
barracuda156 Jan 4, 2025
0be97ed
Merge pull request #5047 from martin-frbg/issue3967
martin-frbg Jan 4, 2025
970e48e
docs: improve readability of the Build system page
rgommers Jan 4, 2025
d4addc0
docs: improve description of library, data type and toolchain build v…
rgommers Jan 4, 2025
c526b10
docs: add library and symbol name build variables
rgommers Jan 4, 2025
ed11415
docs: add the build variables for BLAS/LAPACK functionality
rgommers Jan 4, 2025
5aa1845
docs: fix two broken links related to MSVC
rgommers Jan 4, 2025
f764d76
docs: improve the Makefile dependency graph
rgommers Jan 4, 2025
c0bf48f
docs: remove warning on the Build system page
rgommers Jan 4, 2025
1833e68
docs: improve rendering of "Runtime variables" page
rgommers Jan 4, 2025
eda80f4
docs: improve rendering of Windows on Arm instructions
rgommers Jan 4, 2025
f697cfe
docs: improve the rendering of the HarmonyOS build instructions
rgommers Jan 4, 2025
91d861a
Merge pull request #5048 from barracuda156/powerpc
martin-frbg Jan 4, 2025
74be676
Merge pull request #5049 from rgommers/docs-buildsystem-page
martin-frbg Jan 4, 2025
8385e02
Do not check LDVT when VT is not going to be referenced
martin-frbg Jan 6, 2025
05dce05
Update Makefile.riscv64
fleclairTT Jan 7, 2025
d0350a0
Merge pull request #1 from fleclairTT/fleclairTT-remove-fast-math
fleclairTT Jan 7, 2025
69d3541
Merge pull request #5055 from fleclairTT/develop
martin-frbg Jan 7, 2025
0a5dbf1
Optimize the omatcopy_cn and zomatcopy_cn kernels with RVV 1.0 intrin…
Jan 8, 2025
2891fd8
Replace while loop with for
martin-frbg Jan 8, 2025
09e75f1
fix absurd typo
martin-frbg Jan 8, 2025
a9eec23
Update FreeBSD jobs to 14.1
martin-frbg Jan 9, 2025
b67a963
gcc remains at 13 even for freebsd-14.1
martin-frbg Jan 9, 2025
21013fd
Merge pull request #5058 from martin-frbg/cirrusbsd3
martin-frbg Jan 9, 2025
a588ea9
Merge pull request #5052 from martin-frbg/issue5051
martin-frbg Jan 9, 2025
8cc32f5
Merge branch 'OpenMathLib:develop' into issue5050
martin-frbg Jan 9, 2025
d91d4fa
convert the beta=0 branch to a for loop as well
martin-frbg Jan 9, 2025
e074858
LoongArch64: Update dsymv LASX version
XiWeiGu Jan 10, 2025
20a8e48
LoongArch64: Update ssymv LASX version
XiWeiGu Jan 10, 2025
c31f148
Merge pull request #5061 from XiWeiGu/la64_update_symv
martin-frbg Jan 10, 2025
0b9de3e
Merge pull request #5042 from tingboliao/develop
martin-frbg Jan 10, 2025
30f7a41
Merge pull request #5056 from tingboliao/dev_omatcopy_20250108
martin-frbg Jan 11, 2025
d74eb02
Merge pull request #5057 from martin-frbg/issue5050
martin-frbg Jan 11, 2025
4c1a236
Remove comparison that is always false (Reference-LAPACK PR 1062)
martin-frbg Jan 11, 2025
5527eda
Merge pull request #5063 from martin-frbg/lapack1062
martin-frbg Jan 11, 2025
ed51699
replace ?larft with a recursive implementation (Reference-LAPACK PR 1…
martin-frbg Jan 11, 2025
0c4b4cd
move the non-recursive original ?larft here (Reference-LAPACK PR 1080)
martin-frbg Jan 11, 2025
459fa81
Create subdirectory for the old non-recursive ?larft
martin-frbg Jan 11, 2025
d035e80
move the original non-recursive ?LARFT here (Reference-LAPACK PR 1080)
martin-frbg Jan 11, 2025
ce66ffe
Update the Changelog for version 0.3.29
martin-frbg Jan 11, 2025
f422845
Merge pull request #5064 from martin-frbg/lapack1080
martin-frbg Jan 12, 2025
20f6114
add descriptions of build/runtime vars to 0.3.29 improvements
martin-frbg Jan 12, 2025
7f5b703
Merge pull request #5065 from martin-frbg/changelog0329
martin-frbg Jan 12, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
25 changes: 9 additions & 16 deletions .cirrus.yml
Original file line number Diff line number Diff line change
Expand Up @@ -89,20 +89,13 @@ task:
type: text/plain

macos_instance:
image: ghcr.io/cirruslabs/macos-monterey-xcode:latest
image: ghcr.io/cirruslabs/macos-sonoma-xcode:latest
task:
name: AppleM1/LLVM armv7-androidndk xbuild
compile_script:
- brew install android-ndk
- export #PATH=/opt/homebrew/opt/llvm/bin:$PATH
- export #LDFLAGS="-L/opt/homebrew/opt/llvm/lib"
- export #CPPFLAGS="-I/opt/homebrew/opt/llvm/include"
- ls /System/Volumes/Data/opt/homebrew
- ls -l /System/Volumes/Data/opt/homebrew/Caskroom/android-ndk
- find /System/Volumes/Data/opt/homebrew -name "armv7a-linux-androideabi*-ranlib"
- #export CC=/Applications/Xcode-13.4.1.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/clang
- #export CFLAGS="-O2 -unwindlib=none -Wno-macro-redefined -isysroot /Applications/Xcode-13.4.1.app/Contents/Developer/Platforms/iPhoneOS.platform/Developer/SDKs/iPhoneOS16.0.sdk -arch arm64 -miphoneos-version-min=10.0"
- export CC=/System/Volumes/Data/opt/homebrew/Caskroom/android-ndk/26d/AndroidNDK*.app/Contents/NDK/toolchains/llvm/prebuilt/darwin-x86_64/bin/armv7a-linux-androideabi23-clang
- brew install --cask android-ndk
- export ANDROID_NDK_HOME="/opt/homebrew/share/android-ndk"
- export CC=/opt/homebrew/share/android-ndk/toolchains/llvm/prebuilt/darwin-x86_64/bin/armv7a-linux-androideabi23-clang
- make TARGET=ARMV7 ARM_SOFTFP_ABI=1 NUM_THREADS=32 HOSTCC=clang NOFORTRAN=1 RANLIB="ls -l"
always:
config_artifacts:
Expand Down Expand Up @@ -132,9 +125,9 @@ task:
- make USE_OPENMP=1

FreeBSD_task:
name: FreeBSD-gcc12
name: FreeBSD-gcc
freebsd_instance:
image_family: freebsd-13-3
image_family: freebsd-14-1
install_script:
- pkg update -f && pkg upgrade -y && pkg install -y gmake gcc
compile_script:
Expand All @@ -143,9 +136,9 @@ FreeBSD_task:


FreeBSD_task:
name: freebsd-gcc12-ilp64
name: freebsd-gcc-ilp64
freebsd_instance:
image_family: freebsd-13-3
image_family: freebsd-14-1
install_script:
- pkg update -f && pkg upgrade -y && pkg install -y gmake gcc
compile_script:
Expand All @@ -155,7 +148,7 @@ FreeBSD_task:
FreeBSD_task:
name: FreeBSD-clang-openmp
freebsd_instance:
image_family: freebsd-13-3
image_family: freebsd-14-1
install_script:
- pkg update -f && pkg upgrade -y && pkg install -y gmake gcc
- ln -s /usr/local/lib/gcc13/libgfortran.so.5.0.0 /usr/lib/libgfortran.so
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/docs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ jobs:
python-version: "3.10"

- name: Install MkDocs and doc theme packages
run: pip install mkdocs mkdocs-material mkdocs-git-revision-date-localized-plugin
run: pip install mkdocs mkdocs-material mkdocs-git-revision-date-localized-plugin mkdocs-mermaid2-plugin

- name: Build docs site
run: mkdocs build
Expand Down
17 changes: 1 addition & 16 deletions .github/workflows/dynamic_arch.yml
Original file line number Diff line number Diff line change
Expand Up @@ -158,7 +158,7 @@ jobs:
strategy:
fail-fast: false
matrix:
msystem: [UCRT64, MINGW32, CLANG64, CLANG32]
msystem: [UCRT64, MINGW32, CLANG64]
idx: [int32, int64]
build-type: [Release]
include:
Expand All @@ -174,14 +174,6 @@ jobs:
idx: int32
target-prefix: mingw-w64-clang-x86_64
fc-pkg: fc
# Compiling with Flang 16 seems to cause test errors on machines
# with AVX512 instructions. Revisit after MSYS2 distributes Flang 17.
no-avx512-flags: -DNO_AVX512=1
- msystem: CLANG32
idx: int32
target-prefix: mingw-w64-clang-i686
fc-pkg: cc
c-lapack-flags: -DC_LAPACK=ON
- msystem: UCRT64
idx: int64
idx64-flags: -DBINARY=64 -DINTERFACE64=1
Expand All @@ -192,9 +184,6 @@ jobs:
idx64-flags: -DBINARY=64 -DINTERFACE64=1
target-prefix: mingw-w64-clang-x86_64
fc-pkg: fc
# Compiling with Flang 16 seems to cause test errors on machines
# with AVX512 instructions. Revisit after MSYS2 distributes Flang 17.
no-avx512-flags: -DNO_AVX512=1
- msystem: UCRT64
idx: int32
target-prefix: mingw-w64-ucrt-x86_64
Expand All @@ -203,8 +192,6 @@ jobs:
exclude:
- msystem: MINGW32
idx: int64
- msystem: CLANG32
idx: int64

defaults:
run:
Expand Down Expand Up @@ -280,8 +267,6 @@ jobs:
-DNUM_THREADS=64 \
-DTARGET=CORE2 \
${{ matrix.idx64-flags }} \
${{ matrix.c-lapack-flags }} \
${{ matrix.no-avx512-flags }} \
-DCMAKE_C_COMPILER_LAUNCHER=ccache \
-DCMAKE_Fortran_COMPILER_LAUNCHER=ccache \
..
Expand Down
37 changes: 37 additions & 0 deletions .github/workflows/harmonyos.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
name: harmonyos

on: [push, pull_request]

concurrency:
group: ${{ github.workflow }}-${{ github.head_ref || github.run_id }}
cancel-in-progress: true

permissions:
contents: read # to fetch code (actions/checkout)

jobs:
build:
if: "github.repository == 'OpenMathLib/OpenBLAS'"
runs-on: ubuntu-latest
env:
OHOS_NDK_CMAKE: $GITHUB_WORKSPACE/ohos-sdk/linux/native/build-tools/cmake/bin/cmake
COMMON_CMAKE_OPTIONS: |
-DCMAKE_TOOLCHAIN_FILE=$GITHUB_WORKSPACE/ohos-sdk/linux/native/build/cmake/ohos.toolchain.cmake \
-DCMAKE_INSTALL_PREFIX=install \
-DCMAKE_BUILD_TYPE=Release \
steps:
- uses: actions/checkout@v4
- name: ndk-install
run: |
wget https://repo.huaweicloud.com/harmonyos/os/4.1.1-Release/ohos-sdk-windows_linux-public.tar.gz
tar -xf ohos-sdk-windows_linux-public.tar.gz
cd ohos-sdk/linux
unzip -q native-linux-x64-4.1.7.8-Release.zip
cd -
- name: build-armv8
run: |
mkdir build && cd build
${{ env.OHOS_NDK_CMAKE }} ${{ env.COMMON_CMAKE_OPTIONS }} -DOHOS_ARCH="arm64-v8a" \
-DTARGET=ARMV8 -DNOFORTRAN=1 ..
${{ env.OHOS_NDK_CMAKE }} --build . -j $(nproc)

132 changes: 59 additions & 73 deletions .github/workflows/loongarch64.yml
Original file line number Diff line number Diff line change
Expand Up @@ -9,22 +9,31 @@ concurrency:
jobs:
TEST:
if: "github.repository == 'OpenMathLib/OpenBLAS'"
runs-on: ubuntu-latest
runs-on: ubuntu-24.04
strategy:
fail-fast: false
matrix:
include:
- target: LOONGSONGENERIC
triple: loongarch64-unknown-linux-gnu
triple: loongarch64-linux-gnu
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LOONGSONGENERIC
- target: LOONGSON3R5
triple: loongarch64-unknown-linux-gnu
triple: loongarch64-linux-gnu
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LOONGSON3R5
- target: LOONGSON2K1000
triple: loongarch64-unknown-linux-gnu
triple: loongarch64-linux-gnu
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LOONGSON2K1000
- target: LA64_GENERIC
triple: loongarch64-linux-gnu
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LA64_GENERIC
- target: LA464
triple: loongarch64-linux-gnu
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LA464
- target: LA264
triple: loongarch64-linux-gnu
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LA264
- target: DYNAMIC_ARCH
triple: loongarch64-unknown-linux-gnu
triple: loongarch64-linux-gnu
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=GENERIC

steps:
Expand All @@ -33,33 +42,9 @@ jobs:

- name: Install APT deps
run: |
sudo apt-get update
sudo apt-get install autoconf automake autotools-dev ninja-build make ccache

- name: Download and install loongarch64-toolchain
run: |
wget https://github.com/sunhaiyong1978/CLFS-for-LoongArch/releases/download/8.1/CLFS-loongarch64-8.1-x86_64-cross-tools-gcc-glibc.tar.xz
#wget https://github.com/loongson/build-tools/releases/download/2023.08.08/CLFS-loongarch64-8.1-x86_64-cross-tools-gcc-glibc.tar.xz
tar -xf CLFS-loongarch64-8.1-x86_64-cross-tools-gcc-glibc.tar.xz -C /opt

- name: Checkout qemu
uses: actions/checkout@v3
with:
repository: qemu/qemu
path: qemu
ref: master

- name: Install qemu
run: |
cd qemu
./configure --prefix=$GITHUB_WORKSPACE/qemu-install --target-list=loongarch64-linux-user --disable-system --static
make -j$(nproc)
make install

- name: Set env
run: |
echo "LD_LIBRARY_PATH=/opt/cross-tools/target/usr/lib64:/opt/cross-tools/loongarch64-unknown-linux-gnu/lib64:$LD_LIBRARY_PATH" >> $GITHUB_ENV
echo "PATH=$GITHUB_WORKSPACE:/opt/cross-tools/bin:$PATH" >> $GITHUB_ENV
sudo apt-get update && \
sudo apt-get install autoconf automake autotools-dev ninja-build make ccache qemu-user-static \
gcc-14-loongarch64-linux-gnu g++-14-loongarch64-linux-gnu gfortran-14-loongarch64-linux-gnu

- name: Compilation cache
uses: actions/cache@v3
Expand All @@ -80,54 +65,55 @@ jobs:
- name: Disable utest dsdot:dsdot_n_1
run: |
echo -n > utest/test_dsdot.c
echo "Due to the qemu versions 7.2 causing utest cases to fail,"
echo "Due to the current version of qemu causing utest cases to fail,"
echo "the utest dsdot:dsdot_n_1 have been temporarily disabled."

- name: Build OpenBLAS
run: make CC='ccache ${{ matrix.triple }}-gcc -static' FC='ccache ${{ matrix.triple }}-gfortran -static' ${{ matrix.opts }} HOSTCC='ccache gcc' -j$(nproc)
run: |
make CC='ccache ${{ matrix.triple }}-gcc-14 -static' FC='ccache ${{ matrix.triple }}-gfortran-14 -static' \
RANLIB='ccache ${{ matrix.triple }}-gcc-ranlib-14' ${{ matrix.opts }} HOSTCC='ccache gcc' -j$(nproc)

- name: Test
run: |
export PATH=$GITHUB_WORKSPACE/qemu-install/bin/:$PATH
qemu-loongarch64 ./utest/openblas_utest
qemu-loongarch64 ./utest/openblas_utest_ext
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xscblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xdcblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xccblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xzcblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xscblat2 < ./ctest/sin2
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xdcblat2 < ./ctest/din2
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xccblat2 < ./ctest/cin2
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xzcblat2 < ./ctest/zin2
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xscblat3 < ./ctest/sin3
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xdcblat3 < ./ctest/din3
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xccblat3 < ./ctest/cin3
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./ctest/xzcblat3 < ./ctest/zin3
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/sblat1
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/dblat1
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/cblat1
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/zblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/sblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/dblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/cblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/zblat1
qemu-loongarch64-static ./utest/openblas_utest
qemu-loongarch64-static ./utest/openblas_utest_ext
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xscblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xdcblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xccblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xzcblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xscblat2 < ./ctest/sin2
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xdcblat2 < ./ctest/din2
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xccblat2 < ./ctest/cin2
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xzcblat2 < ./ctest/zin2
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xscblat3 < ./ctest/sin3
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xdcblat3 < ./ctest/din3
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xccblat3 < ./ctest/cin3
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./ctest/xzcblat3 < ./ctest/zin3
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/sblat1
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/dblat1
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/cblat1
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/zblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/sblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/dblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/cblat1
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/zblat1
rm -f ./test/?BLAT2.SUMM
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/sblat2 < ./test/sblat2.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/dblat2 < ./test/dblat2.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/cblat2 < ./test/cblat2.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/zblat2 < ./test/zblat2.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/sblat2 < ./test/sblat2.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/dblat2 < ./test/dblat2.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/cblat2 < ./test/cblat2.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/zblat2 < ./test/zblat2.dat
rm -f ./test/?BLAT2.SUMM
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/sblat2 < ./test/sblat2.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/dblat2 < ./test/dblat2.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/cblat2 < ./test/cblat2.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/zblat2 < ./test/zblat2.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/sblat2 < ./test/sblat2.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/dblat2 < ./test/dblat2.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/cblat2 < ./test/cblat2.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/zblat2 < ./test/zblat2.dat
rm -f ./test/?BLAT3.SUMM
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/sblat3 < ./test/sblat3.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/dblat3 < ./test/dblat3.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/cblat3 < ./test/cblat3.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64 ./test/zblat3 < ./test/zblat3.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/sblat3 < ./test/sblat3.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/dblat3 < ./test/dblat3.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/cblat3 < ./test/cblat3.dat
OPENBLAS_NUM_THREADS=1 OMP_NUM_THREADS=1 qemu-loongarch64-static ./test/zblat3 < ./test/zblat3.dat
rm -f ./test/?BLAT3.SUMM
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/sblat3 < ./test/sblat3.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/dblat3 < ./test/dblat3.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/cblat3 < ./test/cblat3.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64 ./test/zblat3 < ./test/zblat3.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/sblat3 < ./test/sblat3.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/dblat3 < ./test/dblat3.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/cblat3 < ./test/cblat3.dat
OPENBLAS_NUM_THREADS=2 qemu-loongarch64-static ./test/zblat3 < ./test/zblat3.dat
6 changes: 6 additions & 0 deletions .github/workflows/loongarch64_clang.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,12 @@ jobs:
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LOONGSON3R5
- target: LOONGSON2K1000
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LOONGSON2K1000
- target: LA64_GENERIC
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LA64_GENERIC
- target: LA464
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LA464
- target: LA264
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=LA264
- target: DYNAMIC_ARCH
opts: NO_SHARED=1 DYNAMIC_ARCH=1 TARGET=GENERIC

Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/nightly-Homebrew-build.yml
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,7 @@ jobs:
mv *.bottle.tar.gz bottles

- name: Upload bottle
uses: actions/upload-artifact@v1
uses: actions/upload-artifact@v4
with:
name: openblas--HEAD.catalina.bottle.tar.gz
path: bottles
Expand Down
18 changes: 14 additions & 4 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2,13 +2,14 @@
## Author: Hank Anderson <[email protected]>
##

cmake_minimum_required(VERSION 2.8.5)
cmake_minimum_required(VERSION 3.16.0)

set (CMAKE_ASM_SOURCE_FILE_EXTENSIONS "S")
project(OpenBLAS C ASM)

set(OpenBLAS_MAJOR_VERSION 0)
set(OpenBLAS_MINOR_VERSION 3)
set(OpenBLAS_PATCH_VERSION 28)
set(OpenBLAS_PATCH_VERSION 28.dev)

set(OpenBLAS_VERSION "${OpenBLAS_MAJOR_VERSION}.${OpenBLAS_MINOR_VERSION}.${OpenBLAS_PATCH_VERSION}")

Expand Down Expand Up @@ -102,6 +103,10 @@ endif()

message(WARNING "CMake support is experimental. It does not yet support all build options and may not produce the same Makefiles that OpenBLAS ships with.")

if (USE_OPENMP)
find_package(OpenMP REQUIRED)
endif ()

include("${PROJECT_SOURCE_DIR}/cmake/utils.cmake")
include("${PROJECT_SOURCE_DIR}/cmake/system.cmake")

Expand Down Expand Up @@ -258,8 +263,13 @@ if (${CMAKE_SYSTEM_NAME} MATCHES "AIX|Android|Linux|FreeBSD|OpenBSD|NetBSD|Drago
endif()
endif()

if (APPLE AND BUILD_SHARED_LIBS)
set(CMAKE_MACOSX_RPATH ON)
if (USE_OPENMP)
if(BUILD_STATIC_LIBS)
target_link_libraries(${OpenBLAS_LIBNAME}_static OpenMP::OpenMP_C)
endif()
if(BUILD_SHARED_LIBS)
target_link_libraries(${OpenBLAS_LIBNAME}_shared OpenMP::OpenMP_C)
endif()
endif()

# Seems that this hack doesn't required since macOS 11 Big Sur
Expand Down
Loading