This is caused by two buffer sizes (16 KB and 4 MB) used in the default configuration. As described in the Results Reproducibility section in the cuBLAS Library User Guide, numerical results may not be deterministic when cuBLAS APIs are launched in more than one CUDA stream via the same cuBLAS handle. This is the result of a new buffer management and heuristics in the cuBLAS library. RNN and multi-head attention API calls may exhibit non-deterministic behavior when the cuDNN 7.6.5 library is built with CUDA Toolkit 10.2 or higher. Published Best Practices For Using cuDNN 3D Convolutions.įor the latest compatibility software versions of the OS, CUDA, the CUDA driver, and the NVIDIA hardware, see the cuDNN Support Matrix for v7.6.5. Separated the cuDNN datatype references and APIs from the cuDNN Developer Guide into a new cuDNN API. Made performance improvements to several APIs including cudnnAddTensor, cudnnOpTensor, cudnnActivationForward and cudnnActivationBackward. The following features and enhancements have been added to this release: These release notes are applicable to both cuDNN and JetPack users unless appended specifically with (not applicable for Jetson platforms).įor previous cuDNN release notes, see the cuDNN Archived Documentation. This release includes fixes from the previous cuDNN v7.x.x releases as well as the following additional changes.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |