Commits on Source
6186
8f088865
SEM test updates.
Nov 08, 2011
ee602dc2
Merge
git://github.com/inducer/loopy
Nov 08, 2011
6ef2c660
Add edit_code flag to automated testing harness.
Nov 08, 2011
b5d7ff2d
Move CSE check to scheduling entrypoint.
Nov 08, 2011
41a88c2c
Properly recurse in CSE expansion.
Nov 08, 2011
7c8dcfa0
Give a better error message for non-matching matching variables.
Nov 08, 2011
be92e8ba
Test hackery.
Nov 09, 2011
5d0d02c8
Add lmem SEM Laplacian back to code.
Nov 09, 2011
73b991f1
Try to schedule entirely without boosting first before allowing it.
Nov 09, 2011
12452330
Cache result of dim_{max,min}.
Nov 09, 2011
1af94f84
An initial attempt at dealing with equality constraints.
Nov 09, 2011
abbded75
Kill CSEs. Add substitution rules, precomputes.
Nov 10, 2011
69bd5756
Fixes for the subst rule/precompute rewrite.
Nov 10, 2011
6f7ba19a
Many more fixes to subst-precompute.
Nov 11, 2011
3a86e395
Attempt simple hull if outcome of get_footprint is not convex.
Nov 11, 2011
bfd2cc5f
Expand reductions outer-to-inner using a queue.
Nov 11, 2011
7f257cfb
Adapt iname dependency finding to properly exclude reduction inames.
Nov 11, 2011
de8e8429
Improvements to FEM assembly.
Nov 11, 2011
e9baab6d
Preserve tags when duplicating reduction inames, plus more FEM assembly fixes.
Nov 11, 2011
9ad444e6
Pick (more) sensible names for reduction inames. Remove non-ND FEM quadrature.
Nov 11, 2011
3cd2819c
Add some documentation. Expose 'with_annotation' flag.
Nov 11, 2011
2f5da323
Add a better output format for kernels.
Nov 11, 2011
847ffd67
Upgrade precomputation to make storage dims and sweep dims independent, allow iname sweeps.
Nov 11, 2011
e2e48066
Fix exposure of with_annotation flag.
Nov 11, 2011
c700eb6d
Switch ill-advised outer 'unroll' on split automatic axis back to sequential.
Nov 11, 2011
8fe68119
Fix doc upload script.
Nov 11, 2011
6d6de311
Many scattered fixes. Full-prefetch SEM with ILP generates code.
Nov 12, 2011
c05844ee
Fix up FEM quadrature example.
Nov 12, 2011
c7913fcf
One more FEM tweak.
Nov 12, 2011
421c41d9
Make better message about device used for ref calculation.
Nov 12, 2011
ac02f82d
Only include floor_div code if needed.
Nov 20, 2011
bf38b67c
First shot at instance-based subst rule precomputation.
Nov 20, 2011
0d8961db
Tim's SEM tweaks.
Nov 20, 2011
04ff7222
Merge branch 'master' of
ssh://forge.tiker.net:2234/loopy
Nov 20, 2011
0327e979
Use PyOpenCL to translate dtypes into C types.
Jan 16, 2012
5e72d5e3
Hack auto test rig to support vector types, add flag about output filling.
Jan 16, 2012
4b34ba94
Parse vectorized reductions. Spelling: 'sum_vec_float4'
Jan 16, 2012
26a50987
Test float4 functionality.
Jan 16, 2012
9e2640d2
Fix prefill criterion.
Jan 17, 2012
c8064886
Fix FEM assembly code.
Jan 17, 2012
8ba555dc
Expose reduction realization to the user. Add more notes.
Jan 18, 2012
0902408b
Fix Fig 3.3 variant of FEM assembly.
Jan 18, 2012
72fdc4d2
Provide interactive assistance if the scheduler fails.
Jan 18, 2012
9e30e55e
Implement complex-valued arithmetic.
Jan 23, 2012
8dd4ed28
Meeting notes w/ Tim at Reagan.
Feb 05, 2012
ea359a6a
Make switch to turn off complex-valued expression codegen.
Feb 05, 2012
ff0933a5
"Properly" parse instructions with multiple equal signs.
Feb 05, 2012
59587042
Add parsing for "if()".
Feb 05, 2012
cda499e4
Add some doc hints that I'm likely to forget.
Feb 05, 2012
3933b49a
Provide a better error message for inappropriate assignment lhs.
Feb 05, 2012
8e4493cd
Default sweep_dims kwarg on add_prefetch() to empty.
Feb 05, 2012
b16f4dc8
Parse "N" as a shape/stride spec in kernel input.
Feb 12, 2012
04e4ee91
Before ILP refactor.
Feb 20, 2012
de71343d
Make ILP go through regular loop emission, implement ilp.seq.
Feb 21, 2012
fd78a38d
Remove TODO from FEM assembly ilp.seq example.
Feb 21, 2012
d20e0267
Add footprint generators for prefetch.
Mar 02, 2012
d68e69a8
Add SEM test case from Reagan airport with Tim.
Mar 02, 2012
5def2580
Tweaks to (help) the n-body test case.
Mar 02, 2012
6d8feb56
Multple things, mainly related to prefetch.
Mar 03, 2012
c2e5f5cd
auto_test_vs_ref: (somewhat) adaptively choose trip count for timing.
Mar 25, 2012
a4ee6435
Print error value in auto_test_vs_ref().
Mar 26, 2012
fdd57589
Some documentation work.
Mar 26, 2012
82da94f8
Factor out find_all_insn_inames() into separate function.
Mar 26, 2012
6b4d220f
Rename find_{readers,writers} -> {reader,writer}_map.
Mar 26, 2012
db4194c1
Avoid write races when determining whether a temporary should be local.
Mar 26, 2012
6a40ca21
Use '$' to tag prefetches, precomputes.
Mar 26, 2012
e7c6ea1f
More doc work.
Mar 26, 2012
655b07d2
Improve the user interface of precompute(), by unifying subst_name and footprint_generators.
Mar 26, 2012
9ecdddce
Fix a variety of bugs in tagged subst rule use.
Mar 26, 2012
7b4424f2
Fix precompute/prefetch bugs that kept the n-body test from working.
Mar 26, 2012
6e56a13e
Fix tag matching condition in precompute().
Mar 26, 2012
91f88bcb
Introduce TypedCSE.
Mar 29, 2012
c4a53779
Fix criterion for when a temporary wants to be local. (respect no-parallel-inames case)
Mar 29, 2012
35278764
Accept a list of strings for the preamble. Helps with different indentations.
Mar 29, 2012
5979b90a
Be more flexible when accepting shape tuples of arrays: ("n", expr) is now ok.
Mar 29, 2012
b3cc950a
Allow the user to supply type information about functions.
Mar 29, 2012
7529d1a4
Let TypeInferenceMapper take its own temporary_variables array. (Also muck...
Mar 29, 2012
7ca32e27
Upon kernel construction, if an insn doesn't have an id, make one.
Mar 29, 2012
0b7362ae
Fixes to auto-test.
Mar 29, 2012
070a6483
Expand CSEs. Factor make_kernel into multiple parts in a separate file.
Mar 29, 2012
0ff6e477
Fix a parenthization bug in complex expression generation.
Mar 29, 2012
df7dbba8
Find sensible default for whether to allow complex-valued variables.
Mar 30, 2012
a0ab7ceb
Make convenient interface for calling CompiledKernel instances.
Mar 30, 2012
b60f753d
Do not default in-creation duplicated inames to local. Bad idea.
Mar 30, 2012
dd451906
Fix terminology in preprocess.mark_local.
Mar 30, 2012
f55f17e0
CompiledKernel: return outputs as a separate list.
Mar 31, 2012
52af11d2
Add correct type inference for struct member lookup.
Apr 02, 2012
a9606c7f
Generate correct code for widening to complex on assignment.
Apr 02, 2012
1c48b2c1
Fix CSE error message in expr codegen.
Apr 02, 2012
c021313e
CompiledKernel: Default to host output if only host input encountered.
Apr 02, 2012
94b1b76e
Add LoopKernel.get_var_descriptor().
Apr 02, 2012
5021fd9e
Add a facility for type inference on temporary variables.
Apr 02, 2012
06c0bfc2
Better __repr__ for ArrayArg.
Apr 05, 2012
4ebc1af1
Make checking in CompiledKernel use assert.
Apr 05, 2012
b481ad2a
Report errors for incorrectly indexed arguments.
Apr 05, 2012
97ceed45
CompiledKernel: Provide error in case of unneeded keyword arguments.
Apr 06, 2012
9b751a81
Rename ArrayArg -> GlobalArg, ConstantArrayArg -> ConstantArg.
Apr 17, 2012
e2a84f85
Add deduplicating preambles mechanism.
Apr 17, 2012
4342e827
Defer temporary type inference until preprocessing.
Apr 17, 2012
009b410f
Allow ()-shaped global args that work properly.
Apr 17, 2012
6,086 additional commits have been omitted to prevent performance issues.
Loading
Loading