struct AllocRequest : public faiss::gpu::AllocInfo
Information on what/where an allocation is, along with how big it should be.
inline AllocRequest(const AllocInfo &info, size_t sz)
std::string toString() const
Returns a string representation of this request.
size_t size = 0
The size in bytes of the allocation.
int device = 0
The device on which the allocation is happening.
cudaStream_t stream = nullptr
The stream on which new work on the memory will be ordered (e.g., if a piece of memory cached and to be returned for this call was last used on stream 3 and a new memory request is for stream 4, the memory manager will synchronize stream 4 to wait for the completion of stream 3 via events or other stream synchronization.
The memory manager guarantees that the returned memory is free to use without data races on this stream specified.
- inline AllocRequest()