naga/back/hlsl/mod.rs
1/*!
2Backend for [HLSL][hlsl] (High-Level Shading Language).
3
4# Supported shader model versions:
5- 5.0
6- 5.1
7- 6.0
8
9# Layout of values in `uniform` buffers
10
11WGSL's ["Internal Layout of Values"][ilov] rules specify how each WGSL
12type should be stored in `uniform` and `storage` buffers. The HLSL we
13generate must access values in that form, even when it is not what
14HLSL would use normally.
15
16Matching the WGSL memory layout is a concern only for `uniform`
17variables. WGSL `storage` buffers are translated as HLSL
18`ByteAddressBuffers`, for which we generate `Load` and `Store` method
19calls with explicit byte offsets. WGSL pipeline inputs must be scalars
20or vectors; they cannot be matrices, which is where the interesting
21problems arise. However, when an affected type appears in a struct
22definition, the transformations described here are applied without
23consideration of where the struct is used.
24
25Access to storage buffers is implemented in `storage.rs`. Access to
26uniform buffers is implemented where applicable in `writer.rs`.
27
28## Row- and column-major ordering for matrices
29
30WGSL specifies that matrices in uniform buffers are stored in
31column-major order. This matches HLSL's default, so one might expect
32things to be straightforward. Unfortunately, WGSL and HLSL disagree on
33what indexing a matrix means: in WGSL, `m[i]` retrieves the `i`'th
34*column* of `m`, whereas in HLSL it retrieves the `i`'th *row*. We
35want to avoid translating `m[i]` into some complicated reassembly of a
36vector from individually fetched components, so this is a problem.
37
38However, with a bit of trickery, it is possible to use HLSL's `m[i]`
39as the translation of WGSL's `m[i]`:
40
41- We declare all matrices in uniform buffers in HLSL with the
42 `row_major` qualifier, and transpose the row and column counts: a
43 WGSL `mat3x4<f32>`, say, becomes an HLSL `row_major float3x4`. (Note
44 that WGSL and HLSL type names put the row and column in reverse
45 order.) Since the HLSL type is the transpose of how WebGPU directs
46 the user to store the data, HLSL will load all matrices transposed.
47
48- Since matrices are transposed, an HLSL indexing expression retrieves
49 the "columns" of the intended WGSL value, as desired.
50
51- For vector-matrix multiplication, since `mul(transpose(m), v)` is
52 equivalent to `mul(v, m)` (note the reversal of the arguments), and
53 `mul(v, transpose(m))` is equivalent to `mul(m, v)`, we can
54 translate WGSL `m * v` and `v * m` to HLSL by simply reversing the
55 arguments to `mul`.
56
57## Padding in two-row matrices
58
59An HLSL `row_major floatKx2` matrix has padding between its rows that
60the WGSL `matKx2<f32>` matrix it represents does not. HLSL stores all
61matrix rows [aligned on 16-byte boundaries][16bb], whereas WGSL says
62that the columns of a `matKx2<f32>` need only be [aligned as required
63for `vec2<f32>`][ilov], which is [eight-byte alignment][8bb].
64
65To compensate for this, any time a `matKx2<f32>` appears in a WGSL
66`uniform` value or as part of a struct/array, we actually emit `K`
67separate `float2` members, and assemble/disassemble the matrix from its
68columns (in WGSL; rows in HLSL) upon load and store.
69
70For example, the following WGSL struct type:
71
72```ignore
73struct Baz {
74 m: mat3x2<f32>,
75}
76```
77
78is rendered as the HLSL struct type:
79
80```ignore
81struct Baz {
82 float2 m_0; float2 m_1; float2 m_2;
83};
84```
85
86The `wrapped_struct_matrix` functions in `help.rs` generate HLSL
87helper functions to access such members, converting between the stored
88form and the HLSL matrix types appropriately. For example, for reading
89the member `m` of the `Baz` struct above, we emit:
90
91```ignore
92float3x2 GetMatmOnBaz(Baz obj) {
93 return float3x2(obj.m_0, obj.m_1, obj.m_2);
94}
95```
96
97We also emit an analogous `Set` function, as well as functions for
98accessing individual columns by dynamic index.
99
100## Sampler Handling
101
102Due to limitations in how sampler heaps work in D3D12, we need to access samplers
103through a layer of indirection. Instead of directly binding samplers, we bind the entire
104sampler heap as both a standard and a comparison sampler heap. We then use a sampler
105index buffer for each bind group. This buffer is accessed in the shader to get the actual
106sampler index within the heap. See the wgpu_hal dx12 backend documentation for more
107information.
108
109# External textures
110
111Support for [`crate::ImageClass::External`] textures is implemented by lowering
112each external texture global variable to 3 `Texture2D<float4>`s, and a `cbuffer`
113of type `NagaExternalTextureParams`. This provides up to 3 planes of texture
114data (for example single planar RGBA, or separate Y, Cb, and Cr planes), and the
115parameters buffer containing information describing how to handle these
116correctly. The bind target to use for each of these globals is specified via
117[`Options::external_texture_binding_map`].
118
119External textures are supported by WGSL's `textureDimensions()`,
120`textureLoad()`, and `textureSampleBaseClampToEdge()` built-in functions. These
121are implemented using helper functions. See the following functions for how
122these are generated:
123 * `Writer::write_wrapped_image_query_function`
124 * `Writer::write_wrapped_image_load_function`
125 * `Writer::write_wrapped_image_sample_function`
126
127Ideally the set of global variables could be wrapped in a single struct that
128could conveniently be passed around. But, alas, HLSL does not allow structs to
129have `Texture2D` members. Fortunately, however, external textures can only be
130used as arguments to either built-in or user-defined functions. We therefore
131expand any external texture function argument to four consecutive arguments (3
132textures and the params struct) when declaring user-defined functions, and
133ensure our built-in function implementations take the same arguments. Then,
134whenever we need to emit an external texture in `Writer::write_expr`, which
135fortunately can only ever be for a global variable or function argument, we
136simply emit the variable name of each of the three textures and the parameters
137struct in a comma-separated list. This won't win any awards for elegance, but
138it works for our purposes.
139
140[hlsl]: https://docs.microsoft.com/en-us/windows/win32/direct3dhlsl/dx-graphics-hlsl
141[ilov]: https://gpuweb.github.io/gpuweb/wgsl/#internal-value-layout
142[16bb]: https://github.com/microsoft/DirectXShaderCompiler/wiki/Buffer-Packing#constant-buffer-packing
143[8bb]: https://gpuweb.github.io/gpuweb/wgsl/#alignment-and-size
144*/
145
146mod conv;
147mod help;
148mod keywords;
149mod ray;
150mod storage;
151mod writer;
152
153use alloc::{string::String, vec::Vec};
154use core::fmt::Error as FmtError;
155
156use thiserror::Error;
157
158use crate::{back, ir, proc};
159
160/// Direct3D 12 binding information for a global variable.
161///
162/// This type provides the HLSL-specific information Naga needs to declare and
163/// access an HLSL global variable that cannot be derived from the `Module`
164/// itself.
165///
166/// An HLSL global variable declaration includes details that the Direct3D API
167/// will use to refer to it. For example:
168///
169/// RWByteAddressBuffer s_sasm : register(u0, space2);
170///
171/// This defines a global `s_sasm` that a Direct3D root signature would refer to
172/// as register `0` in register space `2` in a `UAV` descriptor range. Naga can
173/// infer the register's descriptor range type from the variable's address class
174/// (writable [`Storage`] variables are implemented by Direct3D Unordered Access
175/// Views, the `u` register type), but the register number and register space
176/// must be supplied by the user.
177///
178/// The [`back::hlsl::Options`] structure provides `BindTarget`s for various
179/// situations in which Naga may need to generate an HLSL global variable, like
180/// [`binding_map`] for Naga global variables, or [`push_constants_target`] for
181/// a module's sole [`PushConstant`] variable. See those fields' documentation
182/// for details.
183///
184/// [`Storage`]: crate::ir::AddressSpace::Storage
185/// [`back::hlsl::Options`]: Options
186/// [`binding_map`]: Options::binding_map
187/// [`push_constants_target`]: Options::push_constants_target
188/// [`PushConstant`]: crate::ir::AddressSpace::PushConstant
189#[derive(Copy, Clone, Debug, Default, PartialEq, Eq, Hash)]
190#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
191#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
192pub struct BindTarget {
193 pub space: u8,
194 /// For regular bindings this is the register number.
195 ///
196 /// For sampler bindings, this is the index to use into the bind group's sampler index buffer.
197 pub register: u32,
198 /// If the binding is an unsized binding array, this overrides the size.
199 pub binding_array_size: Option<u32>,
200 /// This is the index in the buffer at [`Options::dynamic_storage_buffer_offsets_targets`].
201 pub dynamic_storage_buffer_offsets_index: Option<u32>,
202 /// This is a hint that we need to restrict indexing of vectors, matrices and arrays.
203 ///
204 /// If [`Options::restrict_indexing`] is also `true`, we will restrict indexing.
205 #[cfg_attr(any(feature = "serialize", feature = "deserialize"), serde(default))]
206 pub restrict_indexing: bool,
207}
208
209#[derive(Clone, Debug, Default, PartialEq, Eq, Hash)]
210#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
211#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
212/// BindTarget for dynamic storage buffer offsets
213pub struct OffsetsBindTarget {
214 pub space: u8,
215 pub register: u32,
216 pub size: u32,
217}
218
219#[cfg(any(feature = "serialize", feature = "deserialize"))]
220#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
221#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
222struct BindingMapSerialization {
223 resource_binding: crate::ResourceBinding,
224 bind_target: BindTarget,
225}
226
227#[cfg(feature = "deserialize")]
228fn deserialize_binding_map<'de, D>(deserializer: D) -> Result<BindingMap, D::Error>
229where
230 D: serde::Deserializer<'de>,
231{
232 use serde::Deserialize;
233
234 let vec = Vec::<BindingMapSerialization>::deserialize(deserializer)?;
235 let mut map = BindingMap::default();
236 for item in vec {
237 map.insert(item.resource_binding, item.bind_target);
238 }
239 Ok(map)
240}
241
242// Using `BTreeMap` instead of `HashMap` so that we can hash itself.
243pub type BindingMap = alloc::collections::BTreeMap<crate::ResourceBinding, BindTarget>;
244
245/// A HLSL shader model version.
246#[allow(non_snake_case, non_camel_case_types)]
247#[derive(Copy, Clone, Debug, Hash, Eq, PartialEq, PartialOrd)]
248#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
249#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
250pub enum ShaderModel {
251 V5_0,
252 V5_1,
253 V6_0,
254 V6_1,
255 V6_2,
256 V6_3,
257 V6_4,
258 V6_5,
259 V6_6,
260 V6_7,
261}
262
263impl ShaderModel {
264 pub const fn to_str(self) -> &'static str {
265 match self {
266 Self::V5_0 => "5_0",
267 Self::V5_1 => "5_1",
268 Self::V6_0 => "6_0",
269 Self::V6_1 => "6_1",
270 Self::V6_2 => "6_2",
271 Self::V6_3 => "6_3",
272 Self::V6_4 => "6_4",
273 Self::V6_5 => "6_5",
274 Self::V6_6 => "6_6",
275 Self::V6_7 => "6_7",
276 }
277 }
278}
279
280impl crate::ShaderStage {
281 pub const fn to_hlsl_str(self) -> &'static str {
282 match self {
283 Self::Vertex => "vs",
284 Self::Fragment => "ps",
285 Self::Compute => "cs",
286 Self::Task => "as",
287 Self::Mesh => "ms",
288 }
289 }
290}
291
292impl crate::ImageDimension {
293 const fn to_hlsl_str(self) -> &'static str {
294 match self {
295 Self::D1 => "1D",
296 Self::D2 => "2D",
297 Self::D3 => "3D",
298 Self::Cube => "Cube",
299 }
300 }
301}
302
303#[derive(Clone, Copy, Debug, Hash, Eq, Ord, PartialEq, PartialOrd)]
304#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
305#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
306pub struct SamplerIndexBufferKey {
307 pub group: u32,
308}
309
310#[derive(Clone, Debug, Hash, PartialEq, Eq)]
311#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
312#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
313#[cfg_attr(feature = "deserialize", serde(default))]
314pub struct SamplerHeapBindTargets {
315 pub standard_samplers: BindTarget,
316 pub comparison_samplers: BindTarget,
317}
318
319impl Default for SamplerHeapBindTargets {
320 fn default() -> Self {
321 Self {
322 standard_samplers: BindTarget {
323 space: 0,
324 register: 0,
325 binding_array_size: None,
326 dynamic_storage_buffer_offsets_index: None,
327 restrict_indexing: false,
328 },
329 comparison_samplers: BindTarget {
330 space: 1,
331 register: 0,
332 binding_array_size: None,
333 dynamic_storage_buffer_offsets_index: None,
334 restrict_indexing: false,
335 },
336 }
337 }
338}
339
340#[cfg(any(feature = "serialize", feature = "deserialize"))]
341#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
342#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
343struct SamplerIndexBufferBindingSerialization {
344 group: u32,
345 bind_target: BindTarget,
346}
347
348#[cfg(feature = "deserialize")]
349fn deserialize_sampler_index_buffer_bindings<'de, D>(
350 deserializer: D,
351) -> Result<SamplerIndexBufferBindingMap, D::Error>
352where
353 D: serde::Deserializer<'de>,
354{
355 use serde::Deserialize;
356
357 let vec = Vec::<SamplerIndexBufferBindingSerialization>::deserialize(deserializer)?;
358 let mut map = SamplerIndexBufferBindingMap::default();
359 for item in vec {
360 map.insert(
361 SamplerIndexBufferKey { group: item.group },
362 item.bind_target,
363 );
364 }
365 Ok(map)
366}
367
368// We use a BTreeMap here so that we can hash it.
369pub type SamplerIndexBufferBindingMap =
370 alloc::collections::BTreeMap<SamplerIndexBufferKey, BindTarget>;
371
372#[cfg(any(feature = "serialize", feature = "deserialize"))]
373#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
374#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
375struct DynamicStorageBufferOffsetTargetSerialization {
376 index: u32,
377 bind_target: OffsetsBindTarget,
378}
379
380#[cfg(feature = "deserialize")]
381fn deserialize_storage_buffer_offsets<'de, D>(
382 deserializer: D,
383) -> Result<DynamicStorageBufferOffsetsTargets, D::Error>
384where
385 D: serde::Deserializer<'de>,
386{
387 use serde::Deserialize;
388
389 let vec = Vec::<DynamicStorageBufferOffsetTargetSerialization>::deserialize(deserializer)?;
390 let mut map = DynamicStorageBufferOffsetsTargets::default();
391 for item in vec {
392 map.insert(item.index, item.bind_target);
393 }
394 Ok(map)
395}
396
397pub type DynamicStorageBufferOffsetsTargets = alloc::collections::BTreeMap<u32, OffsetsBindTarget>;
398
399/// HLSL binding information for a Naga [`External`] image global variable.
400///
401/// See the module documentation's section on [External textures][mod] for details.
402///
403/// [`External`]: crate::ir::ImageClass::External
404/// [mod]: #external-textures
405#[derive(Copy, Clone, Debug, Default, PartialEq, Eq, Hash)]
406#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
407#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
408pub struct ExternalTextureBindTarget {
409 /// HLSL binding information for the individual plane textures.
410 ///
411 /// Each of these should refer to an HLSL `Texture2D<float4>` holding one
412 /// plane of data for the external texture. The exact meaning of each plane
413 /// varies at runtime depending on where the external texture's data
414 /// originated.
415 pub planes: [BindTarget; 3],
416
417 /// HLSL binding information for a buffer holding the sampling parameters.
418 ///
419 /// This should refer to a cbuffer of type `NagaExternalTextureParams`, that
420 /// the code Naga generates for `textureSampleBaseClampToEdge` consults to
421 /// decide how to combine the data in [`planes`] to get the result required
422 /// by the spec.
423 ///
424 /// [`planes`]: Self::planes
425 pub params: BindTarget,
426}
427
428#[cfg(any(feature = "serialize", feature = "deserialize"))]
429#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
430#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
431struct ExternalTextureBindingMapSerialization {
432 resource_binding: crate::ResourceBinding,
433 bind_target: ExternalTextureBindTarget,
434}
435
436#[cfg(feature = "deserialize")]
437fn deserialize_external_texture_binding_map<'de, D>(
438 deserializer: D,
439) -> Result<ExternalTextureBindingMap, D::Error>
440where
441 D: serde::Deserializer<'de>,
442{
443 use serde::Deserialize;
444
445 let vec = Vec::<ExternalTextureBindingMapSerialization>::deserialize(deserializer)?;
446 let mut map = ExternalTextureBindingMap::default();
447 for item in vec {
448 map.insert(item.resource_binding, item.bind_target);
449 }
450 Ok(map)
451}
452pub type ExternalTextureBindingMap =
453 alloc::collections::BTreeMap<crate::ResourceBinding, ExternalTextureBindTarget>;
454
455/// Shorthand result used internally by the backend
456type BackendResult = Result<(), Error>;
457
458#[derive(Clone, Debug, PartialEq, thiserror::Error)]
459#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
460#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
461pub enum EntryPointError {
462 #[error("mapping of {0:?} is missing")]
463 MissingBinding(crate::ResourceBinding),
464}
465
466/// Configuration used in the [`Writer`].
467#[derive(Clone, Debug, Hash, PartialEq, Eq)]
468#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
469#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
470#[cfg_attr(feature = "deserialize", serde(default))]
471pub struct Options {
472 /// The hlsl shader model to be used
473 pub shader_model: ShaderModel,
474
475 /// HLSL binding information for each Naga global variable.
476 ///
477 /// This maps Naga [`GlobalVariable`]'s [`ResourceBinding`]s to a
478 /// [`BindTarget`] specifying its register number and space, along with
479 /// other details necessary to generate a full HLSL declaration for it,
480 /// or to access its value.
481 ///
482 /// This must provide a [`BindTarget`] for every [`GlobalVariable`] in the
483 /// [`Module`] that has a [`binding`].
484 ///
485 /// [`GlobalVariable`]: crate::ir::GlobalVariable
486 /// [`ResourceBinding`]: crate::ir::ResourceBinding
487 /// [`Module`]: crate::ir::Module
488 /// [`binding`]: crate::ir::GlobalVariable::binding
489 #[cfg_attr(
490 feature = "deserialize",
491 serde(deserialize_with = "deserialize_binding_map")
492 )]
493 pub binding_map: BindingMap,
494
495 /// Don't panic on missing bindings, instead generate any HLSL.
496 pub fake_missing_bindings: bool,
497 /// Add special constants to `SV_VertexIndex` and `SV_InstanceIndex`,
498 /// to make them work like in Vulkan/Metal, with help of the host.
499 pub special_constants_binding: Option<BindTarget>,
500
501 /// HLSL binding information for the [`PushConstant`] global, if present.
502 ///
503 /// If a module contains a global in the [`PushConstant`] address space, the
504 /// `dx12` backend stores its value directly in the root signature as a
505 /// series of [`D3D12_ROOT_PARAMETER_TYPE_32BIT_CONSTANTS`], whose binding
506 /// information is given here.
507 ///
508 /// [`PushConstant`]: crate::ir::AddressSpace::PushConstant
509 /// [`D3D12_ROOT_PARAMETER_TYPE_32BIT_CONSTANTS`]: https://learn.microsoft.com/en-us/windows/win32/api/d3d12/ne-d3d12-d3d12_root_parameter_type
510 pub push_constants_target: Option<BindTarget>,
511
512 /// HLSL binding information for the sampler heap and comparison sampler heap.
513 pub sampler_heap_target: SamplerHeapBindTargets,
514
515 /// Mapping of each bind group's sampler index buffer to a bind target.
516 #[cfg_attr(
517 feature = "deserialize",
518 serde(deserialize_with = "deserialize_sampler_index_buffer_bindings")
519 )]
520 pub sampler_buffer_binding_map: SamplerIndexBufferBindingMap,
521 /// Bind target for dynamic storage buffer offsets
522 #[cfg_attr(
523 feature = "deserialize",
524 serde(deserialize_with = "deserialize_storage_buffer_offsets")
525 )]
526 pub dynamic_storage_buffer_offsets_targets: DynamicStorageBufferOffsetsTargets,
527 #[cfg_attr(
528 feature = "deserialize",
529 serde(deserialize_with = "deserialize_external_texture_binding_map")
530 )]
531
532 /// HLSL binding information for [`External`] image global variables.
533 ///
534 /// See [`ExternalTextureBindTarget`] for details.
535 ///
536 /// [`External`]: crate::ir::ImageClass::External
537 pub external_texture_binding_map: ExternalTextureBindingMap,
538
539 /// Should workgroup variables be zero initialized (by polyfilling)?
540 pub zero_initialize_workgroup_memory: bool,
541 /// Should we restrict indexing of vectors, matrices and arrays?
542 pub restrict_indexing: bool,
543 /// If set, loops will have code injected into them, forcing the compiler
544 /// to think the number of iterations is bounded.
545 pub force_loop_bounding: bool,
546}
547
548impl Default for Options {
549 fn default() -> Self {
550 Options {
551 shader_model: ShaderModel::V5_1,
552 binding_map: BindingMap::default(),
553 fake_missing_bindings: true,
554 special_constants_binding: None,
555 sampler_heap_target: SamplerHeapBindTargets::default(),
556 sampler_buffer_binding_map: alloc::collections::BTreeMap::default(),
557 push_constants_target: None,
558 dynamic_storage_buffer_offsets_targets: alloc::collections::BTreeMap::new(),
559 external_texture_binding_map: ExternalTextureBindingMap::default(),
560 zero_initialize_workgroup_memory: true,
561 restrict_indexing: true,
562 force_loop_bounding: true,
563 }
564 }
565}
566
567impl Options {
568 fn resolve_resource_binding(
569 &self,
570 res_binding: &crate::ResourceBinding,
571 ) -> Result<BindTarget, EntryPointError> {
572 match self.binding_map.get(res_binding) {
573 Some(target) => Ok(*target),
574 None if self.fake_missing_bindings => Ok(BindTarget {
575 space: res_binding.group as u8,
576 register: res_binding.binding,
577 binding_array_size: None,
578 dynamic_storage_buffer_offsets_index: None,
579 restrict_indexing: false,
580 }),
581 None => Err(EntryPointError::MissingBinding(*res_binding)),
582 }
583 }
584
585 fn resolve_external_texture_resource_binding(
586 &self,
587 res_binding: &crate::ResourceBinding,
588 ) -> Result<ExternalTextureBindTarget, EntryPointError> {
589 match self.external_texture_binding_map.get(res_binding) {
590 Some(target) => Ok(*target),
591 None if self.fake_missing_bindings => {
592 let fake = BindTarget {
593 space: res_binding.group as u8,
594 register: res_binding.binding,
595 binding_array_size: None,
596 dynamic_storage_buffer_offsets_index: None,
597 restrict_indexing: false,
598 };
599 Ok(ExternalTextureBindTarget {
600 planes: [fake, fake, fake],
601 params: fake,
602 })
603 }
604 None => Err(EntryPointError::MissingBinding(*res_binding)),
605 }
606 }
607}
608
609/// Reflection info for entry point names.
610#[derive(Default)]
611pub struct ReflectionInfo {
612 /// Mapping of the entry point names.
613 ///
614 /// Each item in the array corresponds to an entry point index. The real entry point name may be different if one of the
615 /// reserved words are used.
616 ///
617 /// Note: Some entry points may fail translation because of missing bindings.
618 pub entry_point_names: Vec<Result<String, EntryPointError>>,
619}
620
621/// A subset of options that are meant to be changed per pipeline.
622#[derive(Debug, Default, Clone)]
623#[cfg_attr(feature = "serialize", derive(serde::Serialize))]
624#[cfg_attr(feature = "deserialize", derive(serde::Deserialize))]
625#[cfg_attr(feature = "deserialize", serde(default))]
626pub struct PipelineOptions {
627 /// The entry point to write.
628 ///
629 /// Entry points are identified by a shader stage specification,
630 /// and a name.
631 ///
632 /// If `None`, all entry points will be written. If `Some` and the entry
633 /// point is not found, an error will be thrown while writing.
634 pub entry_point: Option<(ir::ShaderStage, String)>,
635}
636
637#[derive(Error, Debug)]
638pub enum Error {
639 #[error(transparent)]
640 IoError(#[from] FmtError),
641 #[error("A scalar with an unsupported width was requested: {0:?}")]
642 UnsupportedScalar(crate::Scalar),
643 #[error("{0}")]
644 Unimplemented(String), // TODO: Error used only during development
645 #[error("{0}")]
646 Custom(String),
647 #[error("overrides should not be present at this stage")]
648 Override,
649 #[error(transparent)]
650 ResolveArraySizeError(#[from] proc::ResolveArraySizeError),
651 #[error("entry point with stage {0:?} and name '{1}' not found")]
652 EntryPointNotFound(ir::ShaderStage, String),
653}
654
655#[derive(PartialEq, Eq, Hash)]
656enum WrappedType {
657 ZeroValue(help::WrappedZeroValue),
658 ArrayLength(help::WrappedArrayLength),
659 ImageSample(help::WrappedImageSample),
660 ImageQuery(help::WrappedImageQuery),
661 ImageLoad(help::WrappedImageLoad),
662 ImageLoadScalar(crate::Scalar),
663 Constructor(help::WrappedConstructor),
664 StructMatrixAccess(help::WrappedStructMatrixAccess),
665 MatCx2(help::WrappedMatCx2),
666 Math(help::WrappedMath),
667 UnaryOp(help::WrappedUnaryOp),
668 BinaryOp(help::WrappedBinaryOp),
669 Cast(help::WrappedCast),
670}
671
672#[derive(Default)]
673struct Wrapped {
674 types: crate::FastHashSet<WrappedType>,
675 /// If true, the sampler heaps have been written out.
676 sampler_heaps: bool,
677 // Mapping from SamplerIndexBufferKey to the name the namer returned.
678 sampler_index_buffers: crate::FastHashMap<SamplerIndexBufferKey, String>,
679}
680
681impl Wrapped {
682 fn insert(&mut self, r#type: WrappedType) -> bool {
683 self.types.insert(r#type)
684 }
685
686 fn clear(&mut self) {
687 self.types.clear();
688 }
689}
690
691/// A fragment entry point to be considered when generating HLSL for the output interface of vertex
692/// entry points.
693///
694/// This is provided as an optional parameter to [`Writer::write`].
695///
696/// If this is provided, vertex outputs will be removed if they are not inputs of this fragment
697/// entry point. This is necessary for generating correct HLSL when some of the vertex shader
698/// outputs are not consumed by the fragment shader.
699pub struct FragmentEntryPoint<'a> {
700 module: &'a crate::Module,
701 func: &'a crate::Function,
702}
703
704impl<'a> FragmentEntryPoint<'a> {
705 /// Returns `None` if the entry point with the provided name can't be found or isn't a fragment
706 /// entry point.
707 pub fn new(module: &'a crate::Module, ep_name: &'a str) -> Option<Self> {
708 module
709 .entry_points
710 .iter()
711 .find(|ep| ep.name == ep_name)
712 .filter(|ep| ep.stage == crate::ShaderStage::Fragment)
713 .map(|ep| Self {
714 module,
715 func: &ep.function,
716 })
717 }
718}
719
720pub struct Writer<'a, W> {
721 out: W,
722 names: crate::FastHashMap<proc::NameKey, String>,
723 namer: proc::Namer,
724 /// HLSL backend options
725 options: &'a Options,
726 /// Per-stage backend options
727 pipeline_options: &'a PipelineOptions,
728 /// Information about entry point arguments and result types.
729 entry_point_io: crate::FastHashMap<usize, writer::EntryPointInterface>,
730 /// Set of expressions that have associated temporary variables
731 named_expressions: crate::NamedExpressions,
732 wrapped: Wrapped,
733 written_committed_intersection: bool,
734 written_candidate_intersection: bool,
735 continue_ctx: back::continue_forward::ContinueCtx,
736
737 /// A reference to some part of a global variable, lowered to a series of
738 /// byte offset calculations.
739 ///
740 /// See the [`storage`] module for background on why we need this.
741 ///
742 /// Each [`SubAccess`] in the vector is a lowering of some [`Access`] or
743 /// [`AccessIndex`] expression to the level of byte strides and offsets. See
744 /// [`SubAccess`] for details.
745 ///
746 /// This field is a member of [`Writer`] solely to allow re-use of
747 /// the `Vec`'s dynamic allocation. The value is no longer needed
748 /// once HLSL for the access has been generated.
749 ///
750 /// [`Storage`]: crate::AddressSpace::Storage
751 /// [`SubAccess`]: storage::SubAccess
752 /// [`Access`]: crate::Expression::Access
753 /// [`AccessIndex`]: crate::Expression::AccessIndex
754 temp_access_chain: Vec<storage::SubAccess>,
755 need_bake_expressions: back::NeedBakeExpressions,
756}