Skip to content

Merge for v2 #77

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 200 commits into from
Dec 11, 2020
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
200 commits
Select commit Hold shift + click to select a range
3c47a93
Start tet1d module
Dec 20, 2019
6e8cf68
Update tet1d module
Jan 6, 2020
480e6a8
Add CUDA support for tet1d module
Jan 14, 2020
f32c5e7
Add scalar input support in tet1d module
Jan 17, 2020
0d9a2ee
simplify command line arguments for hatch.py
Jan 20, 2020
9be62f9
Add support for generation of modules
Jan 20, 2020
afc76ac
Make tet1d a module into the new system
Jan 20, 2020
0816949
Merge branch 'master' into tet_1d
Jan 20, 2020
85daa16
Add scalar version of some function from libm + reinterpret
Jan 23, 2020
0ccb815
Fix refactoring
Jan 23, 2020
d80a8f6
Before merging master
Jan 29, 2020
edc79bd
Merge branch 'master' into tet_1d
Jan 29, 2020
20ca2f7
Fixes after merge
Jan 29, 2020
d0f7c68
For backup
Feb 19, 2020
99f095d
Fix CUDA
Feb 19, 2020
66e775c
Add forgotten files
Feb 20, 2020
d5e973c
Working => backup
Mar 5, 2020
69160ec
Fixes
Mar 6, 2020
c378928
All tests are passing f16 included
Mar 13, 2020
7c26ab2
For backup
Mar 15, 2020
46acd2f
COVID-19
Mar 17, 2020
32e25bc
Fixes
Mar 18, 2020
d8bd09a
Fixes
Mar 19, 2020
a9a6490
Fixes: all test compile with nvcc
Mar 23, 2020
1d5c842
ROCm support, addition on f32 and f16 are compiling
Mar 29, 2020
0e33332
TET1D tests are compiling with both nvcc and hipcc
Mar 30, 2020
f960e51
Merge CUDA and ROCm when code is the same
Apr 1, 2020
7832e1b
Forgot files
Apr 1, 2020
3ad694e
Merge branch 'master' into tet_1d
Apr 1, 2020
e6c7c09
Now we can list generated files
Apr 1, 2020
8882b39
Forgot to merge nsimd.h
Apr 1, 2020
ec4d0d3
Forgot to push
Apr 4, 2020
7b07e48
Update .gitignore with the new file generated by the tet1d module.
Apr 6, 2020
58bbd4b
Return allocated arrays for tests
Apr 6, 2020
d294984
Increase the minimum size of the tests array
Apr 6, 2020
0b537ad
Fix segfault
Apr 7, 2020
a7db980
Fix segfault
Apr 7, 2020
8125024
Add mask[oz]_load[zu] and mask_store[au] operators for CPU
Apr 12, 2020
113f303
For backup
Apr 13, 2020
82a48a1
For backup
Apr 14, 2020
f4c0987
Fix for SSE
Apr 16, 2020
d93e130
Fix fma for C89
Apr 16, 2020
6b37c3b
Remove warning from GCC when using long long in C98 and C++98
Apr 16, 2020
d174b63
Fix warnings for C98 and C++98 and AVX512
Apr 16, 2020
74e48fe
Add set1l, iota, mask_for_loop_tail for ARM
Apr 16, 2020
53f31f6
Before merging master
Apr 18, 2020
ec793e8
Merge branch 'master' into spmd
Apr 18, 2020
47be0f9
Fix ARM mask[oz]_load[au]
Apr 18, 2020
eb854f9
Fixes for ARM SVE
Apr 18, 2020
a834608
Fix warning when using __f16's
Apr 18, 2020
901140f
Add alignment-templated masked loads/stores
Apr 20, 2020
b1b65de
Rewrite friendly_but_not_optimized stuff
Apr 22, 2020
f46e44c
Forgot file
Apr 22, 2020
f150f43
Fix ARM
Apr 22, 2020
82219ff
Fix ARM
Apr 22, 2020
7da09ed
Cosmetic
Apr 23, 2020
ab2430d
Backup
Apr 25, 2020
b4a5321
Backup
Apr 30, 2020
27ae566
Backup
May 6, 2020
a4a4c5f
Backup
May 8, 2020
41bb6ee
Forgot file
May 8, 2020
125751b
For backup
May 9, 2020
a8854e7
For backup
May 9, 2020
c3a7358
Refactoring of documentation
May 16, 2020
91f3c31
Add build.nsconfig + fix warning in fixed_point exp
May 17, 2020
576fe1f
Fix warning in SPMd module
May 17, 2020
4a13a61
Add forgotten file
May 19, 2020
f5202c7
Fixes for CUDA
May 21, 2020
8d128ba
Fixes for CPU
May 21, 2020
3cf0a18
Fixes
May 22, 2020
0afe30c
Add gather/scatter for cpu and x86
May 28, 2020
0bcb4f9
Add gather/scatter for arm (not tested yet)
May 28, 2020
ac3387a
Fix gather/scatter for arm
May 29, 2020
dfe3571
Deactivate tet1d module
May 29, 2020
91b68ef
Cleanup
May 29, 2020
57a7a29
Add scripts for building
Jun 2, 2020
f9773af
Fix setup and build script for Linux
Jun 3, 2020
c8b1d2e
Changing computer
Jun 3, 2020
fd0bc60
Backup
Jun 4, 2020
f9ce017
Fix script/setup.sh
Jun 5, 2020
a8229ac
Fixes for fixed size SVE
Jun 5, 2020
46a254a
Fix Windows scripts
Jun 6, 2020
7470b10
Fix scripts for Linux
Jun 7, 2020
8a01c6d
Fix Makefile.nix for md2html
Jun 8, 2020
747e978
Fix Makefile.win for md2html
Jun 8, 2020
891cf19
Fix generation of documentation
Jun 8, 2020
4b3907f
Add mask scatter for cpu
Jun 12, 2020
808ecf1
Add mask_scatter for x86
Jun 13, 2020
0ef6e02
Forgot a file
Jun 13, 2020
7dfdc10
Add mask_scatter for arm
Jun 13, 2020
9261960
Add masked gather for cpu
Jun 15, 2020
fec28da
Add masked gather for x86
Jun 16, 2020
a5391bd
Add masked gather for arm
Jun 16, 2020
d68aec6
Fix masked gather for f16's
Jun 16, 2020
42aeac6
Adapt SVE typedefs to new GCC 10
Jun 16, 2020
19ab895
Fixes for x86
Jun 17, 2020
dd1cd4a
Fix tet1d tests for CUDA
Jun 17, 2020
fe21e8d
Fixes for HIP
Jun 18, 2020
1ee8081
Fix warning fr ROCm/HIP
Jun 18, 2020
7d1628f
Various fixes
Jun 19, 2020
b9e8377
Fix tests for rec11, rec8, rsqrt11 and rsqrt8
Jun 19, 2020
7982cc7
Fix rec11, rec8, rsqrt11, rsqrt8 tests
Jun 20, 2020
127ff98
Improve gather/scatter for neon128 and aarch64
Jun 20, 2020
86abc6f
Add gather_linear + scatter_linear and remove masked gather and scatter
Jun 24, 2020
0eb97d3
Add linear gather + scatter
Jun 24, 2020
4a4b58f
Fix gather_linear for neon128 + aarch64
Jun 25, 2020
de5c778
Improve gather on aarch64 + neon128
Jun 25, 2020
2ef6154
Add documentation for module TET1d
Jun 29, 2020
9b96ee9
Update README
Jun 29, 2020
48a3e13
Add documentation for module TET1d
Jun 29, 2020
f321ea5
Improve README with nsconfig stuff
Jun 30, 2020
79cb600
Improve README
Jun 30, 2020
0ca1cb1
Improve README
Jun 30, 2020
f34c03f
Improve README
Jun 30, 2020
45a824b
Improve README
Jul 1, 2020
7fe1b71
Fix warning for armclang
Jul 2, 2020
a9c7f4b
Fix warning when compiling with Clang and C++98/03
Jul 13, 2020
08820bb
Fix generation of benches
Jul 14, 2020
7b3e748
For backup
Jul 17, 2020
2d36e78
First version (not finished yet)
Jul 18, 2020
c0f1bdd
Add support for non closed operators
Jul 21, 2020
d105b1a
Improve doc
Jul 26, 2020
e73920c
Improve documentation
Aug 1, 2020
2c3da76
More fixes
Aug 1, 2020
01aa9ef
Fix broken link in README
Aug 2, 2020
c9d667f
Add CONTRIBUTING.md
Aug 3, 2020
339a7f1
Improve documentation
Aug 6, 2020
fcc7b21
Improve documentation
Aug 6, 2020
de049b1
Improve documentation + simplify scoped_aligned_mem_for
Aug 6, 2020
7609ac0
Fix scoped_aligned_mem
Aug 7, 2020
b6b0b94
Fixed errors in nsimd.h
Aug 7, 2020
b4404cc
Improve documentation
Aug 11, 2020
a3fcd97
Improve documentation
Aug 12, 2020
946651c
Improve documentation
Aug 13, 2020
72418bd
Replace some print left by common.myprint
Aug 13, 2020
fa0625d
Fixed multiple declarations
Aug 24, 2020
0b4f299
Let benches generate despite the new function set1l
Jun 2, 2020
58f1854
Add a module offering a vectorized random generator
Aug 24, 2020
f77a5d1
Only generate rand module if flags passed from hatch are correct
Aug 26, 2020
211428c
Removed F-strings
Sep 14, 2020
28a5c9e
Fix build.nsconfig
rodolphe-c Sep 17, 2020
4c4df4a
Fix generation of rand module
Sep 21, 2020
0fcfa9a
Building the library does not require C++14 anymore, C++98 is more th…
Sep 21, 2020
1dfb24d
Update README
Sep 21, 2020
8f27c7d
Update README
Sep 21, 2020
e468fb6
Setup.sh clone nstools using the same protocol as nsimd
Sep 29, 2020
1d1d00d
Add possibility to ignore tests/benches/...
Sep 29, 2020
1ad8ed2
Add C++20 concepts to nsimd.h
Oct 1, 2020
f317ebb
Add C++20 concepts to cxx_adv_api.hpp
Oct 5, 2020
3624827
Add C++20 concepts to Python-generated functions
Oct 12, 2020
fdc1ef9
Fix C++20 concepts
Oct 13, 2020
e55a7ea
Prepare support for oneAPI
Oct 14, 2020
f7d75d9
Add C++20 concepts doc
Oct 14, 2020
c0bdf3b
Modify the rand module to allow generation with python 3.5 and earlier
Oct 14, 2020
b46f330
Improve doc + rename module rand --> random
Oct 21, 2020
a07141e
Fix menu of doc of random module
Oct 21, 2020
ad45c85
Fix availability of scoped_mem...
Oct 23, 2020
eecde37
Fix tests to_pack*
Oct 27, 2020
a2f31c1
Tests are dependant of the SIMD architecture
Nov 10, 2020
a3eaf08
Improvements for Intel + Fixes for KNL
Nov 13, 2020
ad1e727
More fixes for KNL and C89
Nov 15, 2020
917c3cc
More fixes
Nov 22, 2020
28e6344
Fix fms/fnms for aarch64
Nov 27, 2020
60dd572
Fixes for SVE
Nov 27, 2020
c6ca5a7
Fix warning whe compiling for 32-bits targets
Nov 27, 2020
e59e4e1
Cleaning in tests generation
Nov 29, 2020
0fe0ef2
Fix ULP bounds for some operators
Nov 29, 2020
415eca2
Almost all tests are passing on 32-bits platform
Nov 30, 2020
30faa8a
No more warning for 32-bits compilations
Nov 30, 2020
2835544
Forgot a file
Nov 30, 2020
f683f27
Fix last errors in philox
Nov 30, 2020
6fb3f46
First version of quick'n'dirty CI
Nov 30, 2020
72870bb
Fix warnings
Dec 2, 2020
98395f4
Fix more warnings
Dec 3, 2020
0b1fbea
Fix Pyhon generation for module/random
Dec 3, 2020
0b99216
Fix fnms for SSE2 and SSE42
Dec 3, 2020
5c5d72c
Try again to fix warnings for GCC
Dec 3, 2020
c25e77f
Fix warnings for Clang
Dec 5, 2020
8176137
Add variable to compile for a given CUDA GPU
Dec 6, 2020
def6d79
Fix warnings for ROCm/HIP
Dec 6, 2020
a9c3bdd
Fix CUDA f16 implementation
Dec 7, 2020
75388b0
Fix CUDA f16 implementation
Dec 7, 2020
37d1689
Fix CUDA f16 implementation
Dec 7, 2020
f620ba9
Reduce size of arrays for GPU testing
Dec 7, 2020
34f5bed
Reduce size of arrays for GPU testing
Dec 7, 2020
cd895c6
Compile .so with nvcc and hipcc for binary compatibility
Dec 8, 2020
6a49e6e
Fix build.nsconfig
Dec 8, 2020
2aee251
Fix build.nsconfig
Dec 8, 2020
a398f38
Fix build.nsconfig
Dec 8, 2020
0bcb59e
Fix build.nsconfig
Dec 8, 2020
536b622
Improve CI script + add static in NSIMD_INLINE
Dec 9, 2020
2afdc1d
Fix build.nsconfig for HIP
Dec 9, 2020
e40e522
Last fixes
Dec 9, 2020
6ddd56d
Fix issue: __popcnt64 not available in 32-bits mode
Dec 9, 2020
1b01abb
Fix DLL specifier of *logulps*
Dec 9, 2020
2b9059d
Fix MSVC 32-bits related issues
Dec 9, 2020
3fbaea3
Cosmetic
Dec 11, 2020
018b487
Add __vectorcall for MSVC 32-bits
Dec 11, 2020
d6f06e2
Update .gitignore
Dec 11, 2020
5f2045b
Merge branch 'master' into spmd
Dec 11, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions .clang-format
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
Standard: Cpp03
ColumnLimit: 79
45 changes: 38 additions & 7 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,34 +1,65 @@
## Build system
build
# Common build dirs
build*/

## Auto-generated
# Dependencies
nstools/

# Binaries
*.o
*.so
*.pyc
*.exe
*.dll
*.dylib

# Generated files
## API
src/api_*.cpp
src/api_*

## Plateform specific code
include/nsimd/arm
include/nsimd/cpu
include/nsimd/cxx_adv_api_functions.hpp
include/nsimd/friendly_but_not_optimized.hpp
include/nsimd/functions.h
include/nsimd/ppc
include/nsimd/x86
src/api_*

## Tests
tests/c_base
tests/cxx_base
tests/cxx_adv
tests/modules/tet1d/
tests/modules/fixed_point/
tests/modules/rand/*.cpp
tests/modules/spmd/
tests/modules/random/

## Benches
benches/cxx_adv
_deps
_install
doc/html

## Modules
include/nsimd/modules/tet1d/
include/nsimd/modules/spmd/
include/nsimd/modules/fixed_point/
include/nsimd/scalar_utilities.h

## Doc
doc/html
doc/markdown/overview.md
doc/markdown/api.md
doc/markdown/api_*.md
doc/markdown/module_fixed_point_api*.md
doc/markdown/module_fixed_point_overview.md
doc/markdown/module_spmd_api*.md
doc/markdown/module_spmd_overview.md
doc/markdown/module_memory_management_overview.md
doc/md2html
doc/tmp.html

## Ulps
ulps/

## CI
_ci/
286 changes: 0 additions & 286 deletions CMakeLists.txt

This file was deleted.

Loading