File tree
488 files changed
+23178
-10463
lines changed- benchmarks
- cpp
- python
- cpp
- include/tensorrt_llm
- batch_manager
- common
- executor
- runtime
- tensorrt_llm
- batch_manager
- aarch64-linux-gnu
- x86_64-linux-gnu
- common
- cutlass_extensions/include/cutlass_extensions
- epilogue
- thread
- threadblock
- gemm
- device
- kernel
- threadblock
- warp
- transform/threadblock
- executor
- aarch64-linux-gnu
- x86_64-linux-gnu
- kernels
- contextFusedMultiHeadAttention
- cutlass_kernels
- fpA_intB_gemm
- launchers
- int8_gemm
- moe_gemm
- decoderMaskedMultiheadAttention
- mixtureOfExperts
- onlineSoftmaxBeamsearchKernels
- parallelDecoding
- unfusedAttentionKernels
- weightOnlyBatchedGemv
- sm90
- layers
- plugins
- api
- bertAttentionPlugin
- common
- gemmPlugin
- gptAttentionCommon
- gptAttentionPlugin
- identityPlugin
- layernormQuantizationPlugin
- lookupPlugin
- loraPlugin
- mixtureOfExperts
- ncclPlugin
- quantizePerTokenPlugin
- quantizeTensorPlugin
- rmsnormQuantizationPlugin
- selectiveScanPlugin
- smoothQuantGemmPlugin
- weightOnlyGroupwiseQuantMatmulPlugin
- weightOnlyQuantMatmulPlugin
- pybind
- batch_manager
- utils
- runtime
- utils
- thop
- tests
- kernels
- sampling
- layers
- resources/scripts
- runtime
- docker
- common
- docs/source
- examples
- baichuan
- bloom
- chatglm
- cpp_library
- enc_dec
- bart
- falcon
- gemma
- gpt
- gptj
- gptneox
- high-level-api
- internlm
- llama
- mamba
- medusa
- mixtral
- mpt
- multimodal
- openai_triton/manual_plugin
- opt
- phi
- quantization
- qwen
- qwenvl
- server
- skywork
- whisper
- scripts
- tensorrt_llm
- auto_parallel
- tensor_parallel
- plugin_nodes
- commands
- hlapi
- layers
- models
- enc_dec
- gemma
- gpt
- llama
- mamba
- mpt
- phi
- qwen
- plugin
- quantization
- runtime
- tools/plugin_gen/templates
- tests
- attention
- bindings
- functional
- hlapi
- model
- model_api
- quantization
- utils
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
488 files changed
+23178
-10463
lines changedOriginal file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
59 | 59 |
| |
60 | 60 |
| |
61 | 61 |
| |
| 62 | + | |
62 | 63 |
| |
63 | 64 |
| |
64 | 65 |
| |
|
+10
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
17 | 17 |
| |
18 | 18 |
| |
19 | 19 |
| |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
20 | 30 |
| |
21 | 31 |
| |
22 | 32 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
355 | 355 |
| |
356 | 356 |
| |
357 | 357 |
| |
| 358 | + | |
| 359 | + | |
| 360 | + | |
358 | 361 |
| |
359 | 362 |
| |
360 | 363 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
39 | 39 |
| |
40 | 40 |
| |
41 | 41 |
| |
42 |
| - | |
43 | 42 |
| |
44 | 43 |
| |
45 | 44 |
| |
| |||
50 | 49 |
| |
51 | 50 |
| |
52 | 51 |
| |
53 |
| - | |
54 | 52 |
| |
55 | 53 |
| |
56 | 54 |
| |
| |||
125 | 123 |
| |
126 | 124 |
| |
127 | 125 |
| |
128 |
| - | |
129 | 126 |
| |
130 | 127 |
| |
131 | 128 |
| |
| |||
135 | 132 |
| |
136 | 133 |
| |
137 | 134 |
| |
138 |
| - | |
139 | 135 |
| |
140 | 136 |
| |
141 | 137 |
| |
| |||
165 | 161 |
| |
166 | 162 |
| |
167 | 163 |
| |
168 |
| - | |
169 | 164 |
| |
170 | 165 |
| |
171 | 166 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
237 | 237 |
| |
238 | 238 |
| |
239 | 239 |
| |
240 |
| - | |
| 240 | + | |
241 | 241 |
| |
242 | 242 |
| |
243 | 243 |
| |
|
0 commit comments