Commits on Source
4947
b3375f95
Bump version
Jan 30, 2016
f16c6bef
created memoized insn count function
Feb 06, 2016
f6688709
working on memoizing, get_dram_access_poly currently broken
Feb 07, 2016
c3085dea
Merge branch 'master' of github.com:jdsteve2/loopy into uniform
Feb 08, 2016
27b94c94
Make loopy kernel Python-hashable
Feb 08, 2016
8a747238
Finish up uniform load counting
Feb 08, 2016
32ae06ce
Add rename_argument
Feb 11, 2016
24442143
rename_argument docs
Feb 11, 2016
9b6b7dcc
Consider substitution rules in remove_unused_arguments
Feb 12, 2016
6584d232
Consider substitution rules and reduction inames in remove_unused_inames
Feb 12, 2016
ec7562d0
added special func counting
Feb 18, 2016
8c3c641a
Tweak ispc target for ispc 1.9
Feb 18, 2016
4c99d2c8
Add ispc stream harness
Feb 18, 2016
562b11d6
Merge porter:src/loopy
Feb 18, 2016
eec284dd
Take into account strides in counting fallback
Feb 23, 2016
ef12b19f
Merge remote-tracking branch 'upstream/master'
Feb 23, 2016
844ff292
Start to clean out the Augean stables of loopy docs
Feb 24, 2016
17785183
ISPC: Don't assume a local size is present
Feb 24, 2016
f6985106
Don't import pyopencl at the top level in pyopencl target
Feb 24, 2016
994b2600
Improve split_iname docs
Feb 24, 2016
3db1c3f3
Add (undocumented) separate_loop_head_tail_slab
Feb 24, 2016
10d550be
Tweak ispc test harness
Feb 24, 2016
e8707f94
Avoid AMD CPU CL for ref test
Feb 25, 2016
e6623cb9
Blacklist Intel CPU CL as well
Feb 25, 2016
9e187a7c
For loop generation: Use loopy PODs to generate correct types
Feb 25, 2016
fe922f94
ISPC Target: build custom type registry
Feb 25, 2016
fb3a9f2c
ISPC stream harness seems to get ICC performance (fingers crossed :)
Feb 25, 2016
943f16b2
More tweaking the ISPC stream harness
Feb 25, 2016
872e78b8
Fix botched timing in ISPC harness (vrrrroom?)
Feb 25, 2016
00e9cd3d
ISPC run harness: Initialize arrays, bring performance in line with non-goldilocks ICC perf
Feb 25, 2016
727f0ddc
Remove unused harness codegen code
Feb 25, 2016
17e883b9
Fix pip install docs
Feb 25, 2016
86d7b15b
Don't hardcode the ispc binary location
Feb 25, 2016
d2c85cdb
XFail a test on AMD
Feb 26, 2016
0e00a7ec
XFail axpy test on AMD (really?)
Feb 26, 2016
e850b82e
Fix usage of loopy POD in for loop codegen
Feb 26, 2016
c6547382
Add support for vendor blacklisting in autotest
Feb 26, 2016
0335c148
Blacklist pocl for test_magma_fermi_matrix_mul
Feb 26, 2016
e54d5008
Blacklist AMD for axpy ref test
Feb 26, 2016
17bf626a
Merge porter:src/loopy
Feb 26, 2016
7ce132e7
Add data_flow argument to fuser
Feb 26, 2016
1e04e149
Fix missed use of expand_subst'ed kernel in assignment_to_subst
Feb 26, 2016
25ebd6d1
Preserve insn order in fusion
Feb 26, 2016
23600777
Use islpy.SuppressedWarnings to avoid terminal noise
Feb 27, 2016
229f649d
PEP8 fix
Feb 27, 2016
71123367
Fix, enable self-check for gNUMA diff
Feb 27, 2016
766de527
Add support for colorization in loopy terminal output
Feb 28, 2016
6e176aed
Better dead-end errors, scheduler debug colors
Feb 28, 2016
2d6f6c4d
generate_code: call prior codegen steps if not done already
Feb 28, 2016
d27f70c1
Fix terminal color stubs
Feb 28, 2016
272ec1a9
Fix doctests for terminal color
Feb 28, 2016
e0f7f9f8
Enforce uniqueness of reduction inames
Feb 28, 2016
a15d6e60
Remove debug output
Feb 28, 2016
b372f216
Make identity mappers work on reduction inames
Feb 29, 2016
cbe3d249
Fix tests for simul_reduce
Feb 29, 2016
f7b38a4a
Fix Poisson test
Feb 29, 2016
971f3759
Add make_reduction_inames_unique
Feb 29, 2016
b88c9990
remove_unused_arguments: Consider uses in data shapes/offsets/strides
Mar 01, 2016
538e29e9
Move shared ispc tools to loopy.tools
Mar 03, 2016
9ea045bb
Merge branch 'master' of
ssh://github.com/inducer/loopy
Mar 03, 2016
a5eb463f
Fix specifying targets in parse_fortran
Mar 03, 2016
c74b1b69
Fix generation of (empty) launch specs in ISPC
Mar 03, 2016
e336b4da
Fix declaration of temporaries in ISPC target
Mar 03, 2016
16400720
Typo fix
Mar 03, 2016
c017b0f2
Add 'sequential' flag to to_batched
Mar 03, 2016
9a9ac204
Merge remote-tracking branch 'upstream/master'
Mar 09, 2016
3fd9a701
changed call type to func:name in op counting
Mar 10, 2016
be06a916
Merge pull request #19 from jdsteve2/master
Mar 10, 2016
0494c442
replaced consec, uniform, and nonconsec subscript types with DataAccess class
Mar 12, 2016
09420000
subscript counter only looking for lid0 now, if not found, setting stride to maxsize
Mar 13, 2016
6d49fcad
Add Python sparse matvec example
Mar 14, 2016
a9951731
For 1D arrays, allow undetermined shape in shape determination
Mar 15, 2016
2dbeb387
now calculating strides greater than 1
Mar 15, 2016
0573d545
added StridedGmemAccess class, now used as key in gmem access dicts
Mar 15, 2016
c0dcf557
replaced operation tuples with TypedOp class
Mar 15, 2016
0f2a2519
Merge remote-tracking branch 'upstream/master'
Mar 16, 2016
ca39958d
Remove debug print
Mar 16, 2016
bfd67b90
Enable/define language representation of atomic ops
Mar 16, 2016
be7c7963
Enable persistent hashing for loopy types
Mar 16, 2016
63369be5
Fix mis-implemented __ne__
Mar 16, 2016
dc2c7736
More numpy type system fallout
Mar 16, 2016
0c23b682
Fix types in stats gathering
Mar 16, 2016
78230b72
Some missing 'NumpyType' wrappers in nbody
Mar 16, 2016
e7142044
Fix image data fetch in brave new type system
Mar 16, 2016
8c673827
Bump data model to void broken caches
Mar 16, 2016
ebeac660
Mark atomics unvectorizable
Mar 16, 2016
8efe8657
More type system fixery regarding vectors
Mar 16, 2016
75842489
Some missing NumpyDtype wrapping
Mar 21, 2016
3e188fa7
Improve distinguishability between numpy dtypes and loopy types
Mar 21, 2016
86292def
Fix another bare-numpy-type escape
Mar 21, 2016
256cea87
Fix pow() code generation for new type system
Mar 21, 2016
a47e6677
Fix tracing after type system apocalypse
Mar 21, 2016
92b3ef6c
Support generating atomic operations (at least for OpenCL 1.x)
Mar 21, 2016
147d27f5
Type system fixes
Mar 21, 2016
8d294c35
Skip int64 test on old PyOpenCL
Mar 22, 2016
60cd3aad
Introduce a notion of 'scope' for temporary variables, in preparation for global temporaries
Mar 22, 2016
e39085bd
Allow atomics on local variables
Mar 22, 2016
b320b9c5
Add atomic ops to tutorial
Mar 22, 2016
e652f9ac
Doc reference fixes
Mar 22, 2016
4b460f71
Switch 'breadth' tests to use POCL
Mar 22, 2016
4,847 additional commits have been omitted to prevent performance issues.
Loading
Loading