Commits on Source
6349
5fa87e7c
Turn off slabs in plain matmul test.
Sep 10, 2011
af8c75e0
Add a zoo of Magma-like mxm kernels.
Sep 10, 2011
c03cf818
Use an appropriate size in axpy test.
Sep 10, 2011
5d2c4d05
Merge box:src-deb/loopy
Sep 10, 2011
a1838e53
Sketch out new UI.
Sep 10, 2011
683ced82
Merge box:src-deb/loopy
Sep 10, 2011
a86fd84e
Towards insn-based loopy.
Sep 24, 2011
dbf5012f
Allow reductions on multiple variables, plus many fixes.
Sep 24, 2011
44305ffa
Many fixes for late reduction realization.
Sep 24, 2011
e3fcfca6
Automatic dependency finding.
Sep 24, 2011
cb92057b
First cut of 'obstacle course' scheduling.
Sep 24, 2011
3e24063b
Much more schedule hacking--not far from working.
Sep 26, 2011
ab8b37d1
Build out scheduler.
Sep 27, 2011
02cc8491
MEMO updates.
Sep 28, 2011
f29c680a
Automatic axis finding. Rename WORK_ITEM_IDX -> LOCAL_IDX.
Oct 02, 2011
accffe38
Inching towards codegen.
Oct 09, 2011
a439e009
First passing test for new-style loopy!
Oct 11, 2011
8925c5d4
Clean up.
Oct 12, 2011
c9e35248
Make syntax for explicit loop dependencies.
Oct 12, 2011
86d6bece
Fix insn parser again.
Oct 12, 2011
a1a552c4
Remove reduction tags. Slicker reduction syntax.
Oct 12, 2011
00872d1d
Change label syntax. Add syntax for forced iname deps.
Oct 12, 2011
ac32b82b
Highlight source code on output if Pygments is present.
Oct 12, 2011
e606dd46
First cut of conditional hoisting.
Oct 12, 2011
e2e44a1d
Check for non-use of hardware axes.
Oct 12, 2011
624cef06
Restrict condition hoisting to actually used inames.
Oct 12, 2011
25d000fc
Add remove_inames_for_shared_hw_axes.
Oct 12, 2011
b2104ff2
Clean up tag, module names.
Oct 15, 2011
9058893b
Be smarter about (split and such) automatic local dimensions.
Oct 16, 2011
1b8354b2
Fix a bug in barrier insertion. (missing [0]) Shuffle code around.
Oct 16, 2011
6f140ea6
Iname dependency cleanups.
Oct 17, 2011
2513ee98
Automatically find idempotent instructions. Minor tweaks.
Oct 17, 2011
dd2b6570
Reimplement add_prefetch().
Oct 17, 2011
4a1d2e03
Fix up axpy test.
Oct 19, 2011
80e1e9dd
Eliminate last occurrence of project_out.
Oct 22, 2011
eede2432
Remove some dead code from unroll.
Oct 22, 2011
8fad429b
Add isl_helpers.iname_rel_aff.
Oct 22, 2011
fba77761
Base slab decomposition on absolute loop bounds.
Oct 22, 2011
4cc7bd0e
Remove deprecated bounds.get_bounds().
Oct 22, 2011
c962f82c
Clarify semantics of generate_bounds_checks vs get_bounds_constraints.
Oct 22, 2011
8c769337
Add static_value_of_pw_aff(), use it for lower bounds in unroll.
Oct 22, 2011
e963f741
Add CPU, GPU variants of axpy.
Oct 22, 2011
de13c6e1
Allow loop scheduling prioritization.
Oct 22, 2011
10aec918
Add sanity check for implemented domain, use new islpy operators.
Oct 22, 2011
d05f2d27
Allow None for implemented_domain in GeneratedInstruction.
Oct 22, 2011
3bb12005
Fix sanity check for matmul test.
Oct 22, 2011
340676c4
Adapt barrier generation to upward code metadata propagation.
Oct 22, 2011
4ebc5c98
Integrate conditions generated at instruction level with the sanity check.
Oct 22, 2011
1bfded08
Fix up more tests to run with new-style loopy.
Oct 22, 2011
3e099068
Towards ILP: Only allow non-ILP parallel inames in hoisted conditionals.
Oct 23, 2011
6c1bee78
Towards ILP: Duplicate reduction accumulators for ILP.
Oct 23, 2011
1aacbf22
Towards ILP: Implement ILP instance generation.
Oct 23, 2011
357ad97e
Check for write races and access to inactive inames.
Oct 23, 2011
26ced0af
Scheduler: Be less eager in scheduling of idempotent insns inside extra loops.
Oct 23, 2011
e35bf0d0
Fix Reduction() stringification.
Oct 23, 2011
afed4a24
Centralize sanity checking in loopy.check.
Oct 23, 2011
530bec87
Fix check_for_inactive_iname_access.
Oct 23, 2011
e413c8ca
Better error message for wrong invocation of reduction ops.
Oct 24, 2011
51c2369e
Add a preemptive check for write races to realize_cse().
Oct 24, 2011
a4c4969c
MEMO updates.
Oct 24, 2011
3cecf117
Rename loopy.codegen.{dispatch->control}.
Oct 24, 2011
85194627
Revamp conditional hoist algorithm.
Oct 25, 2011
fb8e9afb
Provide better error messages in a few situations.
Oct 25, 2011
7b91b1a3
Properly recurse in FunctionToPrimitiveMapper.
Oct 25, 2011
7ef26e17
Fix iname_to_tag lookup in realize_reduction to use get().
Oct 25, 2011
e03ce30c
Make sure TemporaryVariable.nbytes computes an int.
Oct 25, 2011
76c59c5c
Whitespace changes.
Oct 25, 2011
cc5a489a
IndexVariableFinder: Add flag include_reduction_inames.
Oct 25, 2011
0026eb8b
Use loopy's own SubstitutionMapper (not pymbolic's substitute) in realize_cse().
Oct 25, 2011
c619c54f
Fix for isl 0.08: set intersect doesn't align spaces automatically--do it manually.
Oct 25, 2011
9b8de2d6
Only test that reduction depends on *one* of its inames, not all of them.
Oct 25, 2011
16290f2b
Create default/empty assumptions as parameter domain.
Oct 25, 2011
44c1beaa
Guess which iname should be l.0 by ranking, not by pointing at one.
Oct 25, 2011
e352f730
Split off preprocess_kernel from scheduling.
Oct 25, 2011
b201a4e2
Consolidate MEMO.
Oct 25, 2011
63893d99
More MEMO hacking.
Oct 25, 2011
45657b41
Factor out loopy.isl_helpers.duplicate_axes from CSE realization.
Oct 25, 2011
b4e8ee54
Split out kernel preprocessing into separate file.
Oct 25, 2011
a721b3cf
Fix dependencies of main reduction instruction: Must depend on all reduction axes.
Oct 25, 2011
3213b9cb
Make a parse syntax for the 'None' tag.
Oct 25, 2011
fa54e763
Add iname duplication from parsed instructions.
Oct 25, 2011
76becd30
Add spectral-element tests.
Oct 25, 2011
94107a5e
Allow array declaration from within an instruction.
Oct 26, 2011
020203ae
Give a good error message for missing variables in get_problems().
Oct 26, 2011
426c0319
Implement reduction iname uniquification.
Oct 28, 2011
4fc947f6
Fix storage order in SEM example.
Oct 28, 2011
3f8f1593
Look at individual variable writes/reads to see if barriers are needed.
Oct 29, 2011
7a368750
Give user control over whether reduction inames are duplicated.
Oct 29, 2011
94eec370
Rename idempotent->boostable. Be more restrictive in marking insns boostable.
Oct 29, 2011
17ddb114
Allow unroll of sequential loops.
Oct 29, 2011
4853b6d5
Fix image arguments.
Oct 29, 2011
480223d2
Add transpose test.
Oct 29, 2011
487e9c3a
Revive fancy_matmul. Fix assert child_iname <= parent_iname condition.
Oct 29, 2011
ca93d84e
A few fixes. Some code shifting. Loosen up owed_barriers checking.
Oct 29, 2011
b8c98645
Add user interface for dim length prescription, test for workgroup prescribed too small.
Oct 29, 2011
e3c575dd
Some test shuffling.
Oct 29, 2011
5d6b9aea
Various fixes, keep insn dependencies as sets.
Oct 30, 2011
5b5cacd1
Implement dimension joining.
Oct 30, 2011
73951d70
Make a better implementation of duplicate_axes().
Oct 30, 2011
724bf1c3
Allow multiple references to a CSE with different indices in each.
Oct 31, 2011
6,249 additional commits have been omitted to prevent performance issues.
Loading
Loading