Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
P
pyopencl
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Andreas Klöckner
pyopencl
Commits
1c39545e
Commit
1c39545e
authored
6 years ago
by
Andreas Klöckner
Browse files
Options
Downloads
Plain Diff
Merge branch 'intel-gpu-fixes' into 'master'
Intel gpu fixes See merge request
inducer/pyopencl!42
parents
5d37e4aa
8c277f47
No related branches found
Branches containing commit
No related tags found
1 merge request
!42
Intel gpu fixes
Pipeline
#13085
passed with warnings
6 years ago
Stage: test
Changes
3
Pipelines
2
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
pyopencl/scan.py
+10
-1
10 additions, 1 deletion
pyopencl/scan.py
test/test_clrandom.py
+3
-0
3 additions, 0 deletions
test/test_clrandom.py
test/test_wrapper.py
+1
-0
1 addition, 0 deletions
test/test_wrapper.py
with
14 additions
and
1 deletion
pyopencl/scan.py
+
10
−
1
View file @
1c39545e
...
...
@@ -1227,8 +1227,17 @@ class GenericScanKernel(_GenericScanKernelBase):
max_scan_wg_size
=
min
(
dev
.
max_work_group_size
for
dev
in
self
.
devices
)
wg_size_multiples
=
64
# Intel beignet fails "Out of shared local memory" in test_scan int64
# and asserts in test_sort with this enabled:
# https://github.com/inducer/pyopencl/pull/238
# A beignet bug report (outside of pyopencl) suggests packed structs
# (which this is) can even give wrong results:
# https://bugs.freedesktop.org/show_bug.cgi?id=98717
# TODO: does this also affect Intel Compute Runtime?
use_bank_conflict_avoidance
=
(
self
.
dtype
.
itemsize
>
4
and
self
.
dtype
.
itemsize
%
8
==
0
and
is_gpu
)
self
.
dtype
.
itemsize
>
4
and
self
.
dtype
.
itemsize
%
8
==
0
and
is_gpu
and
"
beignet
"
not
in
self
.
devices
[
0
].
platform
.
version
.
lower
())
# k_group_size should be a power of two because of in-kernel
# division by that number.
...
...
This diff is collapsed.
Click to expand it.
test/test_clrandom.py
+
3
−
0
View file @
1c39545e
...
...
@@ -31,6 +31,7 @@ import pyopencl.clrandom as clrandom
from
pyopencl.tools
import
(
# noqa
pytest_generate_tests_for_pyopencl
as
pytest_generate_tests
)
from
pyopencl.characterize
import
has_double_support
try
:
import
faulthandler
...
...
@@ -59,6 +60,8 @@ def make_ranlux_generator(cl_ctx):
cltypes
.
float4
])
def
test_clrandom_dtypes
(
ctx_factory
,
rng_class
,
dtype
):
cl_ctx
=
ctx_factory
()
if
dtype
==
np
.
float64
and
not
has_double_support
(
cl_ctx
.
devices
[
0
]):
pytest
.
skip
(
"
double precision not supported on this device
"
)
rng
=
rng_class
(
cl_ctx
)
size
=
10
...
...
This diff is collapsed.
Click to expand it.
test/test_wrapper.py
+
1
−
0
View file @
1c39545e
...
...
@@ -953,6 +953,7 @@ def test_coarse_grain_svm(ctx_factory):
dev
=
ctx
.
devices
[
0
]
has_svm
=
(
ctx
.
_get_cl_version
()
>=
(
2
,
0
)
and
ctx
.
devices
[
0
].
_get_cl_version
()
>=
(
2
,
0
)
and
cl
.
get_cl_header_version
()
>=
(
2
,
0
))
if
dev
.
platform
.
name
==
"
Portable Computing Language
"
:
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment