fixes instruction properties #1000

gitoleg · 2019-10-15T20:04:37Z

The instructions properties are computed from the semantics tags
provided by a backend and from the BIL provided by a lifter.
There was a bug the algorithm that was analyzing the BIL code,
which, in particular, lead to a classification of a machine instruction
as a barrier, when it wasn't.

The essence of the bug

When the semantic tags were inferred from the BIL we didn't accumulate
newly inferred tags to the list of existing ones but, instead, were
overriding it. As a result, if an instruction contained more than one
jump only the last jump was taken into account. This is a rare case,
that was happening with the rep prefixed instructions (which had three)
jumps with the latter being unconditional.
Therefore such instructions were classified as barriers.

Solution

First, we accumulate tags to make sure we won't override anything.
Second, we hold an invariant that instruction can't possess both
Conditional_branch and Unconditional_branch tags at the same time.
That means that an instruction with a conditional jump will preserve this
property, even in the presence of unconditional jumps.

Better solution

Going back to the rep prefixed instructions, they shouldn't be
classified as jumps at all, and the right solution for them would be
using the while statement (or repeat from the Core Theory), so that
they will be classified as data having only data effects.
But this is a story for a completely different PR

The instructions properties are computed from the semantics tags provided by a backend and from the BIL provided by a lifter. There was a bug the algorithm that was analyzing the BIL code, which, in particular, lead to a classification of a machine instruction as a barrier, when it wasn't. When the semantic tags were inferred from the BIL we didn't accumulate newly inferred tags to the list of existing ones but, instead, were overriding it. As a result, if an instruction contained more than one jump only the last jump was taken into account. This is a rare case, that was happening with the rep prefixed instructions (which had three) jumps with the latter being unconditional. Therefore such instructions were classified as barriers. First, we accumulate tags to make sure we won't override anything. Second, we hold an invariant that instruction can't possess both `Conditional_branch` and `Unconditional_branch` tags at the same time. That means that an instruction with a conditional jump will preserve this property, even in the presence of unconditional jumps. Going back to the rep prefixed instructions, they shouldn't be classified as jumps at all, and the right solution for them would be using the while statement (or repeat from the Core Theory), so that they will be classified as data having only data effects. But this is a story for a completely different PR

ivg approved these changes Oct 15, 2019

View reviewed changes

ivg merged commit 2db6590 into BinaryAnalysisPlatform:master Oct 16, 2019

ivg mentioned this pull request Nov 14, 2019

BAP 2.0.0 release ocaml/opam-repository#15284

Merged

gitoleg deleted the fix-instruction-properties branch May 13, 2020 21:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fixes instruction properties #1000

fixes instruction properties #1000

Uh oh!

gitoleg commented Oct 15, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fixes instruction properties #1000

fixes instruction properties #1000

Uh oh!

Conversation

gitoleg commented Oct 15, 2019

The essence of the bug

Solution

Better solution

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants