Celeritas
0.5.0-86+4a8eea4
|
Dynamically allocate arbitrary data on a stack. More...
#include <StackAllocator.hh>
Public Types | |
Type aliases | |
using | value_type = T |
using | result_type = value_type * |
using | Data = StackAllocatorData< T, Ownership::reference, MemSpace::native > |
Public Member Functions | |
CELER_FUNCTION | StackAllocator (Data const &data) |
Construct with defaults. | |
CELER_FUNCTION size_type | capacity () const |
Get the maximum number of values that can be allocated. | |
CELER_FUNCTION void | clear () |
Clear the stack allocator. More... | |
CELER_FUNCTION result_type | operator() (size_type count) |
Allocate space for a given number of items. More... | |
CELER_FUNCTION size_type | size () const |
Get the number of items currently present. More... | |
CELER_FUNCTION Span< value_type > | get () |
View all allocated data. More... | |
CELER_FUNCTION Span< value_type const > | get () const |
View all allocated data (const). More... | |
Dynamically allocate arbitrary data on a stack.
The stack allocator view acts as a functor and accessor to the allocated data. It enables very fast on-device dynamic allocation of data, such as secondaries or detector hits. As an example, inside a hypothetical physics Interactor class, you could create two particles with the following code:
A later kernel could then iterate over the secondaries to apply cutoffs:
You cannot safely access the current size of the stack in the same kernel that's modifying it – if the stack attempts to allocate beyond the end, then the size()
call will reflect that overflowed state, rather than the corrected size reflecting the failed allocation.
A third kernel with a single thread would then be responsible for clearing the data:
These separate kernel launches are needed as grid-level synchronization points.
const_reference
to the StackAllocatorData. Then the rule will be "you can't create a StackAllocator in the same kernel
that you directly access a StackAllocation".
|
inline |
Clear the stack allocator.
This sets the size to zero. It should ideally only be called by a single thread (though multiple threads resetting it should also be OK), but cannot be used in the same kernel that is allocating or viewing it. This is because the access times between different threads or thread-blocks is indeterminate inside of a single kernel.
|
inline |
View all allocated data.
This cannot be called while any running kernel could be modifiying the size.
|
inline |
View all allocated data (const).
This cannot be called while any running kernel could be modifiying the size.
|
inline |
Allocate space for a given number of items.
Returns NULL if allocation failed due to out-of-memory. Ensures that the shared size reflects the amount of data allocated.
|
inline |
Get the number of items currently present.
This value may not be meaningful (may be less than "actual" size) if called in the same kernel as other threads that are allocating.