Commits on Source
3231
11d4c1ac
Improve error message for prefech-on-indirect-access, add test (#104 on gitlab)
Nov 23, 2017
cfd5b944
Introduce separate UnableToConvertToAffineExpression exception
Nov 23, 2017
1e52119a
Fix py2 incompatibility in test_prefetch_through_indirect_access
Nov 23, 2017
3e1faa27
added truediv to guarded poly
Jan 30, 2018
f89abc95
added count_granularity to Op
Jan 30, 2018
feee7c96
added count_granularity to MemAccess
Jan 30, 2018
f82702de
setting count_granularity in MemAccessCounter
Jan 30, 2018
8b4750d5
get_mem_access_map() using count_granularity in counting
Jan 30, 2018
62932b14
added wsize argument to get_mem_access_map() for count_granularity
Jan 30, 2018
3c84a241
updated stats tests to use/test count_granularity
Jan 30, 2018
70d17d3b
updated tutorial doctests for addition of count_granularity, still needs explanations
Jan 30, 2018
152e17ea
changed default count granularity to None instead of thread, updated tests and...
Jan 30, 2018
88e505b0
flake8 fixes
Jan 30, 2018
c5cff697
factoring out m in polynomial
Jan 30, 2018
bc1b9f71
changed dependency url
Jan 30, 2018
777eb04a
renamed warp->subgroup
Jan 30, 2018
80c6f8b4
renaming thread->workitem
Jan 30, 2018
46f9acab
renaming thread->workitem in tutorial
Jan 30, 2018
9ba5d96d
renaming warp->subgroup in tutorial
Jan 30, 2018
61595cb3
inheriting from record in Op and MemAccess
Jan 30, 2018
3d8945c3
line too long, shortened
Jan 30, 2018
db6d9a4a
updated docstrings
Jan 30, 2018
93bda9b2
removing finished TODO
Jan 30, 2018
d1433a2a
no longer modifying maps in place when converting to numpy types
Jan 30, 2018
dcd7259f
fixing formatting problems
Jan 30, 2018
6dfc346b
ensuring count_granularity values are valid in Op.__init__ and MemAccess.__init__
Jan 30, 2018
2705f321
fixing more flake8 issues
Jan 30, 2018
8c75f8ee
changed truediv->floordiv, added ceildiv
Jan 30, 2018
0c6fa601
fixed count_granularity rounding behavior for groups not evenly divisible by...
Jan 30, 2018
aa7fb37a
removed now-unused div functions in GuardedPwQPolynomial
Jan 30, 2018
f591dac0
fixed flake8 issues
Jan 30, 2018
d1df5444
fixed flake8 issue
Jan 30, 2018
fffeb48b
updated doctests for count_granularity rounding changes
Jan 30, 2018
0df9212c
renamed kernel
Jan 30, 2018
ca62d049
putting unnecessary default arg back into call to count_insn_runs
Jan 30, 2018
9b760d4a
made test_mem_access_counter_mixed handle non-barvinok counting
Jan 30, 2018
5e94786c
added warning, get_insn_count uses upper bound for group size
Jan 30, 2018
b2636837
re-worded warning
Jan 30, 2018
ef79671a
made docstring comment more precise about usage of counts
Feb 08, 2018
d56044f8
can't pass kwarg to get_insn_count when using @memoize_in, so passing...
Feb 08, 2018
7e237cf0
added unused subgroup_size argument to get_op_map and get_syncronization_map...
Feb 08, 2018
6fbf29d7
fixed flake8 issue
Feb 08, 2018
f790e439
comment documenting reason for overriding Record.__repr__
Feb 09, 2018
3f8d6a2c
remember instructions sorting between recursions
Feb 16, 2018
039177e9
invalidate instruction sorting when active group changes
Feb 16, 2018
479d89b8
made CountGranularity class to contain cg strings
Feb 19, 2018
c5a06ed8
attempt to get subgroup size from device, don't guess unless explicitly told
Feb 19, 2018
8af713d3
fixing flake8 issues
Feb 19, 2018
6ffb9e3a
passing subgroup size to get_mem_access_map in tutorial
Feb 19, 2018
66693459
added CountGranularity to loopy.__init__
Feb 19, 2018
dd57c360
fixed flake8 issue
Feb 20, 2018
d1a587c3
added CountGranularity to __all__
Feb 20, 2018
4af3e4be
passing subgroup_size (now required) in test_gnuma_hroiz_kernel
Feb 20, 2018
b03a9222
defined work-item, sub-group, and group
Feb 20, 2018
b5024933
made docstring indentation more consistent
Feb 20, 2018
8406772c
more subgroup_size and count_granularity doc
Feb 20, 2018
4243a563
renamed group->work-group to match opencl terminology
Feb 20, 2018
c44ebfd0
fixed merge conflicts in tutorial.rst
Feb 20, 2018
f8cf6fcf
added CountGranularity.ALL to list all granularities
Feb 20, 2018
98d7b546
using enum instead of string for count granularity in tutorial
Feb 20, 2018
abf3f04b
reorganized subgroup_size processing/guessing for clarity
Feb 21, 2018
509692c6
fixing flake8 issues
Feb 21, 2018
309775a7
updated subgroup_size explanation slightly
Feb 21, 2018
d00bfdee
improved docs
Feb 23, 2018
60ed8850
Bump pytools requirement
Feb 23, 2018
c4425fed
Merge branch 'gc-before-scheduler-return' into 'master'
Feb 24, 2018
68a108ba
string->str in docs
Feb 24, 2018
4691e39b
Merge branch 'count-granularity2' into 'master'
Feb 24, 2018
cadc7fce
no longer ignoring local stores in mem access counting
Feb 25, 2018
54982a9d
Merge branch 'include-local-stores-in-mem-access-map' into 'master'
Feb 25, 2018
8a2f855a
Fix, test half-complex-half-not conditionals
Feb 26, 2018
c3045342
Merge branch 'fix-half-complex-conditionals' into 'master'
Feb 26, 2018
b7e39fd0
Fix pytest script-based test invocation
Feb 27, 2018
c3a384bd
Remove unused domain parameter finder
Feb 27, 2018
70e7c816
Merge branch 'remove-domain-par-finder' into 'master'
Feb 27, 2018
681b4133
invalidate inst_ids_to_try when active group changes
Feb 28, 2018
68cbf7ab
Merge branch 'insn_sort_cache' into 'master'
Feb 28, 2018
713a8244
now collecting all gmem strides for all local ids in the index
Mar 03, 2018
428547a2
fixing flake8 issues
Mar 04, 2018
fa12cdb4
switched lid_strides from list to dict in order to distinguish between two...
Mar 04, 2018
19525df9
updated docstrings
Mar 04, 2018
a42e7ffb
fixing grammar in doc string
Mar 04, 2018
24abd998
updated tutorial for lid_strides change
Mar 04, 2018
358f2a8b
removed extra blank line
Mar 04, 2018
5f433a50
corrected lid_strides doc
Mar 04, 2018
6a71d56b
Merge branch 'adding-all-local-strides-to-mem-access' into 'master'
Mar 06, 2018
2c8b8576
Speed up overlap checking by batching/caching access range
Mar 08, 2018
9d50fa92
Merge branch 'speed-up-access-range-overlap-check' into 'master'
Mar 08, 2018
7460728d
Stop using apparently-deprecated Set.add in save/load transform
Mar 10, 2018
ea41fefc
Switch from AMD CPU to Intel CPU tests
Mar 10, 2018
55374b52
Stop testing on proprietary CL implementations altogether
Mar 10, 2018
32bf4dd2
Merge branch 'set-add-to-set-add-dims' into 'master'
Mar 10, 2018
8ed23652
Try hard to find a git revision to use as cache key
Mar 11, 2018
5b5b5cb3
Add missing paren in comment
Mar 11, 2018
b1bcb00e
Merge branch 'use-git-rev-as-cache-key' into 'master'
Mar 11, 2018
5835b35d
Allow two CentOS build retries on failure
Mar 11, 2018
f21ed48e
Prevent import from breaking version-number finding in doc build
Mar 11, 2018
21c0b651
Added support for ScopedFunctions
Mar 13, 2018
47a73915
Added support for scoping functions at creation time.
Mar 13, 2018
0a7c4263
Checked that the functions are scoped.
Mar 13, 2018
3,131 additional commits have been omitted to prevent performance issues.
Loading
Loading