Commits on Source
6158
c7913fcf
One more FEM tweak.
Nov 12, 2011
421c41d9
Make better message about device used for ref calculation.
Nov 12, 2011
ac02f82d
Only include floor_div code if needed.
Nov 20, 2011
bf38b67c
First shot at instance-based subst rule precomputation.
Nov 20, 2011
0d8961db
Tim's SEM tweaks.
Nov 20, 2011
04ff7222
Merge branch 'master' of
ssh://forge.tiker.net:2234/loopy
Nov 20, 2011
0327e979
Use PyOpenCL to translate dtypes into C types.
Jan 16, 2012
5e72d5e3
Hack auto test rig to support vector types, add flag about output filling.
Jan 16, 2012
4b34ba94
Parse vectorized reductions. Spelling: 'sum_vec_float4'
Jan 16, 2012
26a50987
Test float4 functionality.
Jan 16, 2012
9e2640d2
Fix prefill criterion.
Jan 17, 2012
c8064886
Fix FEM assembly code.
Jan 17, 2012
8ba555dc
Expose reduction realization to the user. Add more notes.
Jan 18, 2012
0902408b
Fix Fig 3.3 variant of FEM assembly.
Jan 18, 2012
72fdc4d2
Provide interactive assistance if the scheduler fails.
Jan 18, 2012
9e30e55e
Implement complex-valued arithmetic.
Jan 23, 2012
8dd4ed28
Meeting notes w/ Tim at Reagan.
Feb 05, 2012
ea359a6a
Make switch to turn off complex-valued expression codegen.
Feb 05, 2012
ff0933a5
"Properly" parse instructions with multiple equal signs.
Feb 05, 2012
59587042
Add parsing for "if()".
Feb 05, 2012
cda499e4
Add some doc hints that I'm likely to forget.
Feb 05, 2012
3933b49a
Provide a better error message for inappropriate assignment lhs.
Feb 05, 2012
8e4493cd
Default sweep_dims kwarg on add_prefetch() to empty.
Feb 05, 2012
b16f4dc8
Parse "N" as a shape/stride spec in kernel input.
Feb 12, 2012
04e4ee91
Before ILP refactor.
Feb 20, 2012
de71343d
Make ILP go through regular loop emission, implement ilp.seq.
Feb 21, 2012
fd78a38d
Remove TODO from FEM assembly ilp.seq example.
Feb 21, 2012
d20e0267
Add footprint generators for prefetch.
Mar 02, 2012
d68e69a8
Add SEM test case from Reagan airport with Tim.
Mar 02, 2012
5def2580
Tweaks to (help) the n-body test case.
Mar 02, 2012
6d8feb56
Multple things, mainly related to prefetch.
Mar 03, 2012
c2e5f5cd
auto_test_vs_ref: (somewhat) adaptively choose trip count for timing.
Mar 25, 2012
a4ee6435
Print error value in auto_test_vs_ref().
Mar 26, 2012
fdd57589
Some documentation work.
Mar 26, 2012
82da94f8
Factor out find_all_insn_inames() into separate function.
Mar 26, 2012
6b4d220f
Rename find_{readers,writers} -> {reader,writer}_map.
Mar 26, 2012
db4194c1
Avoid write races when determining whether a temporary should be local.
Mar 26, 2012
6a40ca21
Use '$' to tag prefetches, precomputes.
Mar 26, 2012
e7c6ea1f
More doc work.
Mar 26, 2012
655b07d2
Improve the user interface of precompute(), by unifying subst_name and footprint_generators.
Mar 26, 2012
9ecdddce
Fix a variety of bugs in tagged subst rule use.
Mar 26, 2012
7b4424f2
Fix precompute/prefetch bugs that kept the n-body test from working.
Mar 26, 2012
6e56a13e
Fix tag matching condition in precompute().
Mar 26, 2012
91f88bcb
Introduce TypedCSE.
Mar 29, 2012
c4a53779
Fix criterion for when a temporary wants to be local. (respect no-parallel-inames case)
Mar 29, 2012
35278764
Accept a list of strings for the preamble. Helps with different indentations.
Mar 29, 2012
5979b90a
Be more flexible when accepting shape tuples of arrays: ("n", expr) is now ok.
Mar 29, 2012
b3cc950a
Allow the user to supply type information about functions.
Mar 29, 2012
7529d1a4
Let TypeInferenceMapper take its own temporary_variables array. (Also muck...
Mar 29, 2012
7ca32e27
Upon kernel construction, if an insn doesn't have an id, make one.
Mar 29, 2012
0b7362ae
Fixes to auto-test.
Mar 29, 2012
070a6483
Expand CSEs. Factor make_kernel into multiple parts in a separate file.
Mar 29, 2012
0ff6e477
Fix a parenthization bug in complex expression generation.
Mar 29, 2012
df7dbba8
Find sensible default for whether to allow complex-valued variables.
Mar 30, 2012
a0ab7ceb
Make convenient interface for calling CompiledKernel instances.
Mar 30, 2012
b60f753d
Do not default in-creation duplicated inames to local. Bad idea.
Mar 30, 2012
dd451906
Fix terminology in preprocess.mark_local.
Mar 30, 2012
f55f17e0
CompiledKernel: return outputs as a separate list.
Mar 31, 2012
52af11d2
Add correct type inference for struct member lookup.
Apr 02, 2012
a9606c7f
Generate correct code for widening to complex on assignment.
Apr 02, 2012
1c48b2c1
Fix CSE error message in expr codegen.
Apr 02, 2012
c021313e
CompiledKernel: Default to host output if only host input encountered.
Apr 02, 2012
94b1b76e
Add LoopKernel.get_var_descriptor().
Apr 02, 2012
5021fd9e
Add a facility for type inference on temporary variables.
Apr 02, 2012
06c0bfc2
Better __repr__ for ArrayArg.
Apr 05, 2012
4ebc1af1
Make checking in CompiledKernel use assert.
Apr 05, 2012
b481ad2a
Report errors for incorrectly indexed arguments.
Apr 05, 2012
97ceed45
CompiledKernel: Provide error in case of unneeded keyword arguments.
Apr 06, 2012
9b751a81
Rename ArrayArg -> GlobalArg, ConstantArrayArg -> ConstantArg.
Apr 17, 2012
e2a84f85
Add deduplicating preambles mechanism.
Apr 17, 2012
4342e827
Defer temporary type inference until preprocessing.
Apr 17, 2012
009b410f
Allow ()-shaped global args that work properly.
Apr 17, 2012
aaa27768
Add type inference on reduction expressions.
Apr 17, 2012
a9e2e164
Allow more than one function return type inference function.
Apr 17, 2012
796f5530
Hack reduction machinery enough to implement argmax.
Apr 17, 2012
bedbc2a9
Add "no_run" to CompiledKernel execution call, for debugging.
Apr 18, 2012
791820ca
Fix type inference for temporaries.
Apr 18, 2012
627fdc03
Duplicate reduction inames before expanding CSEs.
Apr 18, 2012
8bb953ff
Improve ArrayArg deprecation warning.
Apr 18, 2012
d18db766
Fix CSE type assignment if no type specified.
Apr 18, 2012
53c3a882
Add result dtype getters to reductions.
Apr 18, 2012
c60c9495
Enable the use of type inference for reduction.
Apr 18, 2012
e1192c05
Some whitespace fixes.
Apr 18, 2012
8f4f32cd
Cache bounds checks in build_loop_nest.
Apr 19, 2012
d35842be
print_highlighted_code -> get_highlighted_code.
Apr 25, 2012
014a06cb
Fix code generation for scalar global array indices without shape info.
Apr 25, 2012
497978b0
Fix bounds checking cache in control structure generation.
Apr 25, 2012
93f09b9f
Minor doc addition.
Apr 26, 2012
68c8912f
Add symbol (i.e. variable/constant) mangling.
May 05, 2012
32edb1e6
Add function mangler registration.
May 08, 2012
57aedc85
CSE-on-creation: Don't create variables aliasing other variables.
May 10, 2012
52f0da69
Compiled: Don't flag KeyboardInterrupt as a build failure.
May 10, 2012
c2536330
Match code style of mixed complex/real code to real-only.
May 10, 2012
2fd1fe21
Allow manglers to demand argument casts. Add manglers for pyopencl-complex.h.
May 10, 2012
51f50b66
Use better file name to get syntax highlighting on edit_code=True.
May 13, 2012
33157609
Improve progress feedback from scheduler.
May 13, 2012
e0363bfb
Add memo note about groupwise scheduling.
May 16, 2012
05f97b43
Initial work towards making loopy work on multiple (potentially nested) domains.
May 20, 2012
6815239e
Add Tim's DG loopy prototypes.
Jun 21, 2012
c09a975a
Allow passing a scheduled-kernel generator to CompiledKernel.
Jun 25, 2012
6,058 additional commits have been omitted to prevent performance issues.
Loading
Loading