timemory 3.3.0
Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
extern.hpp
Go to the documentation of this file.
1// MIT License
2//
3// Copyright (c) 2020, The Regents of the University of California,
4// through Lawrence Berkeley National Laboratory (subject to receipt of any
5// required approvals from the U.S. Dept. of Energy). All rights reserved.
6//
7// Permission is hereby granted, free of charge, to any person obtaining a copy
8// of this software and associated documentation files (the "Software"), to deal
9// in the Software without restriction, including without limitation the rights
10// to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
11// copies of the Software, and to permit persons to whom the Software is
12// furnished to do so, subject to the following conditions:
13//
14// The above copyright notice and this permission notice shall be included in all
15// copies or substantial portions of the Software.
16//
17// THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
18// IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
19// FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
20// AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
21// LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
22// OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
23// SOFTWARE.
24
25/**
26 * \file timemory/components/roofline/extern.hpp
27 * \brief Include the extern declarations for roofline components
28 */
29
30#pragma once
31
35
36#if defined(TIMEMORY_USE_PAPI)
39
40TIMEMORY_EXTERN_COMPONENT(cpu_roofline_sp_flops, true,
41 std::pair<std::vector<long long>, double>)
42TIMEMORY_EXTERN_COMPONENT(cpu_roofline_dp_flops, true,
43 std::pair<std::vector<long long>, double>)
44TIMEMORY_EXTERN_COMPONENT(cpu_roofline_flops, true,
45 std::pair<std::vector<long long>, double>)
46extern template struct tim::component::cpu_roofline<float>;
47extern template struct tim::component::cpu_roofline<double>;
48extern template struct tim::component::cpu_roofline<float, double>;
49#endif
50
51#if defined(TIMEMORY_USE_CUPTI)
54
55# if defined(TIMEMORY_USE_CUDA_HALF)
58 std::tuple<typename ::tim::component::cupti_activity::value_type,
59 typename ::tim::component::cupti_counters::value_type>)
60# endif
63 std::tuple<typename ::tim::component::cupti_activity::value_type,
64 typename ::tim::component::cupti_counters::value_type>)
67 std::tuple<typename ::tim::component::cupti_activity::value_type,
68 typename ::tim::component::cupti_counters::value_type>)
71 std::tuple<typename ::tim::component::cupti_activity::value_type,
72 typename ::tim::component::cupti_counters::value_type>)
73#endif
74
75#if defined(TIMEMORY_USE_CUPTI)
76extern template struct tim::component::gpu_roofline<float>;
77extern template struct tim::component::gpu_roofline<double>;
78# if defined(TIMEMORY_USE_CUDA_HALF)
81# else
83# endif
84#endif
Include the extern declarations for cupti components.
#define TIMEMORY_DECLARE_EXTERN_COMPONENT(NAME, HAS_DATA,...)
Definition: macros.hpp:720
Include the extern declarations for papi components.
Implementation of the roofline component(s)
STL namespace.
gpu_roofline< float, double > gpu_roofline_flops
Definition: types.hpp:86
gpu_roofline< cuda::fp16_t > gpu_roofline_hp_flops
A specialization of tim::component::gpu_roofline for 16-bit floating point operations (depending on a...
Definition: types.hpp:64
gpu_roofline< double > gpu_roofline_dp_flops
A specialization of tim::component::gpu_roofline for 64-bit floating point operations.
Definition: types.hpp:76
gpu_roofline< float > gpu_roofline_sp_flops
A specialization of tim::component::gpu_roofline for 32-bit floating point operations.
Definition: types.hpp:70
cpu_roofline< double > cpu_roofline_dp_flops
A specialization of tim::component::cpu_roofline for 64-bit floating point operations.
Definition: types.hpp:51
cpu_roofline< float > cpu_roofline_sp_flops
A specialization of tim::component::cpu_roofline for 32-bit floating point operations.
Definition: types.hpp:45
cpu_roofline< float, double > cpu_roofline_flops
Definition: types.hpp:58
Definition: kokkosp.cpp:39
Combines hardware counters and timers and executes the empirical roofline toolkit during application ...
typename typename typename
Definition: types.hpp:226