Skip to content

Question: Disable Optimizations for TensorRT #4075

@YixuanSeanZhou

Description

@YixuanSeanZhou

Question

Because there are so many optimizations that TRT performs, sometimes it is very hard to isolate the issue if we see regression in model accuracy. I know we have the builder_optimization_level flag, but it seems to only control which kernel is used when executing the model.

I wonder if there is more fine-grained control? For example, I want to prevent fusions, or prevent removing dead code.

To give more context: In my specific use case, I am interested in isolating whether resolving Q/DQ nodes can causes regression in model. What I am interested to achieve is to only enable Q/DQ resolution and disable all other optimizations. Is this achievable?

Thanks in advance

Metadata

Metadata

Assignees

Labels

InvestigatingIssue is under investigation by TensorRT devsModule:Engine BuildIssues with building TensorRT engines

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions