Commits on Source
6390
bb5b0325
Some test changes.
Jul 21, 2011
55c79f28
Make a way to pass in CL build options.
Jul 21, 2011
a5c5a1a9
Generalize image access to float64.
Jul 21, 2011
4d112561
Print kernel code if compile fails.
Jul 21, 2011
ed9f7589
Fix lower slab bounds.
Jul 21, 2011
394e8fe0
Clean up constraint solving, fix the "19" bug.
Jul 21, 2011
631efdd5
Specialize int floor div for positive denominators.
Jul 21, 2011
dea47d64
Be smarter about choosing conflict-free lmem layout.
Jul 22, 2011
e4fe5396
Fix, better messages from conflict detector.
Jul 22, 2011
b7682eaf
Padding of work item axes. A mockup of DG. 'edit' flag.
Jul 22, 2011
cbad2f81
Revive image matmul.
Jul 22, 2011
758a66db
Remove version from islpy dependency.
Jul 22, 2011
e92d16d7
Retire multi-var unroll. Add incr unroll. Fix a bug in sequential fetch.
Jul 22, 2011
d123027b
ILP generating, still subtly broken. (ILP 8 bug)
Jul 23, 2011
b08fb8cb
Minor testing tweaks, still debugging ILP.
Aug 04, 2011
72331a20
Fix block_shift_constraint for new version of isl.
Aug 07, 2011
a6a85dca
Only define ILP'd loop variables when they actually have a valid value.
Aug 07, 2011
5454d56c
Test tweaks for ILP example.
Aug 07, 2011
221bbe2a
Split loopy into multiple, smaller files.
Aug 08, 2011
6ef0f2e5
Add automated test.
Aug 08, 2011
29f26f72
Avoid lmem bank conflicts only on hardware sporting real lmem.
Aug 08, 2011
fb42deff
Emit prettier code when ILP is not in use.
Aug 08, 2011
bf722a8f
Only warn about L1/LMEM overruns on non-genuine-lmem architectures.
Aug 08, 2011
9500d86d
Change get_invalid_reason to get_problems, plus dg example tweaks.
Aug 08, 2011
39f63b41
Ensure all work items hit all barriers. Add condition hoisting. Tests pass.
Aug 09, 2011
1ae906bf
Introduce notion of a 'unique' tag. (ILP et al are not unique, for example.)
Aug 09, 2011
bf21cb86
Add documentation rudiments.
Aug 09, 2011
fc0076d2
Allow use of constant memory.
Aug 09, 2011
6f48d80b
Clean up loop bound handling. Don't remove div constraints wholesale.
Aug 10, 2011
ec65fed7
Mini-addition to TODO list.
Aug 10, 2011
6ae4c97f
Fetch dimension merging. Code readability. No spurious barriers.
Aug 14, 2011
da7957b2
Add distribute_setup.py.
Aug 22, 2011
81060c93
Fix setup.py.
Aug 22, 2011
125a5286
Make axpy work.
Aug 24, 2011
71d0d6f6
Quadrature example, todo additions, minor debug msg fix.
Aug 25, 2011
cd6480ed
Make iname_to_tag more usable.
Aug 25, 2011
111a4f1d
Lots of post-CFEM-quadrature changes.
Aug 28, 2011
aed3a1f3
Fitch local fetch op in CFEM quadrature.
Aug 28, 2011
8fd88781
Todo list tweaks.
Sep 10, 2011
3323cff3
Fix schedule hinting.
Sep 10, 2011
6de3f0e9
Parameter sanity checking, no_slabs parameter in split_dimension.
Sep 10, 2011
5fa87e7c
Turn off slabs in plain matmul test.
Sep 10, 2011
af8c75e0
Add a zoo of Magma-like mxm kernels.
Sep 10, 2011
c03cf818
Use an appropriate size in axpy test.
Sep 10, 2011
5d2c4d05
Merge box:src-deb/loopy
Sep 10, 2011
a1838e53
Sketch out new UI.
Sep 10, 2011
683ced82
Merge box:src-deb/loopy
Sep 10, 2011
a86fd84e
Towards insn-based loopy.
Sep 24, 2011
dbf5012f
Allow reductions on multiple variables, plus many fixes.
Sep 24, 2011
44305ffa
Many fixes for late reduction realization.
Sep 24, 2011
e3fcfca6
Automatic dependency finding.
Sep 24, 2011
cb92057b
First cut of 'obstacle course' scheduling.
Sep 24, 2011
3e24063b
Much more schedule hacking--not far from working.
Sep 26, 2011
ab8b37d1
Build out scheduler.
Sep 27, 2011
02cc8491
MEMO updates.
Sep 28, 2011
f29c680a
Automatic axis finding. Rename WORK_ITEM_IDX -> LOCAL_IDX.
Oct 02, 2011
accffe38
Inching towards codegen.
Oct 09, 2011
a439e009
First passing test for new-style loopy!
Oct 11, 2011
8925c5d4
Clean up.
Oct 12, 2011
c9e35248
Make syntax for explicit loop dependencies.
Oct 12, 2011
86d6bece
Fix insn parser again.
Oct 12, 2011
a1a552c4
Remove reduction tags. Slicker reduction syntax.
Oct 12, 2011
00872d1d
Change label syntax. Add syntax for forced iname deps.
Oct 12, 2011
ac32b82b
Highlight source code on output if Pygments is present.
Oct 12, 2011
e606dd46
First cut of conditional hoisting.
Oct 12, 2011
e2e44a1d
Check for non-use of hardware axes.
Oct 12, 2011
624cef06
Restrict condition hoisting to actually used inames.
Oct 12, 2011
25d000fc
Add remove_inames_for_shared_hw_axes.
Oct 12, 2011
b2104ff2
Clean up tag, module names.
Oct 15, 2011
9058893b
Be smarter about (split and such) automatic local dimensions.
Oct 16, 2011
1b8354b2
Fix a bug in barrier insertion. (missing [0]) Shuffle code around.
Oct 16, 2011
6f140ea6
Iname dependency cleanups.
Oct 17, 2011
2513ee98
Automatically find idempotent instructions. Minor tweaks.
Oct 17, 2011
dd2b6570
Reimplement add_prefetch().
Oct 17, 2011
4a1d2e03
Fix up axpy test.
Oct 19, 2011
80e1e9dd
Eliminate last occurrence of project_out.
Oct 22, 2011
eede2432
Remove some dead code from unroll.
Oct 22, 2011
8fad429b
Add isl_helpers.iname_rel_aff.
Oct 22, 2011
fba77761
Base slab decomposition on absolute loop bounds.
Oct 22, 2011
4cc7bd0e
Remove deprecated bounds.get_bounds().
Oct 22, 2011
c962f82c
Clarify semantics of generate_bounds_checks vs get_bounds_constraints.
Oct 22, 2011
8c769337
Add static_value_of_pw_aff(), use it for lower bounds in unroll.
Oct 22, 2011
e963f741
Add CPU, GPU variants of axpy.
Oct 22, 2011
de13c6e1
Allow loop scheduling prioritization.
Oct 22, 2011
10aec918
Add sanity check for implemented domain, use new islpy operators.
Oct 22, 2011
d05f2d27
Allow None for implemented_domain in GeneratedInstruction.
Oct 22, 2011
3bb12005
Fix sanity check for matmul test.
Oct 22, 2011
340676c4
Adapt barrier generation to upward code metadata propagation.
Oct 22, 2011
4ebc5c98
Integrate conditions generated at instruction level with the sanity check.
Oct 22, 2011
1bfded08
Fix up more tests to run with new-style loopy.
Oct 22, 2011
3e099068
Towards ILP: Only allow non-ILP parallel inames in hoisted conditionals.
Oct 23, 2011
6c1bee78
Towards ILP: Duplicate reduction accumulators for ILP.
Oct 23, 2011
1aacbf22
Towards ILP: Implement ILP instance generation.
Oct 23, 2011
357ad97e
Check for write races and access to inactive inames.
Oct 23, 2011
26ced0af
Scheduler: Be less eager in scheduling of idempotent insns inside extra loops.
Oct 23, 2011
e35bf0d0
Fix Reduction() stringification.
Oct 23, 2011
afed4a24
Centralize sanity checking in loopy.check.
Oct 23, 2011
530bec87
Fix check_for_inactive_iname_access.
Oct 23, 2011
e413c8ca
Better error message for wrong invocation of reduction ops.
Oct 24, 2011
51c2369e
Add a preemptive check for write races to realize_cse().
Oct 24, 2011
6,290 additional commits have been omitted to prevent performance issues.
Loading
Loading