4

Anyone know of any tools or scripts that can help in corpus distillation ? I know of Peach Minset, but not other than that. Appreciate if anyone could share.

perror
  • 19,083
  • 29
  • 87
  • 150
d123
  • 201
  • 1
  • 6

2 Answers2

3

Some time ago I wrote minblox for that exact purpose. It relies on DynamoRIO. Compared to minset which uses pin tool, there isn't much of a difference. Tho I think actual set minimization part works faster than minset.

Minblox tool is comprised of two parts.

  • A DynamoRIO instrumentation part (libbbcoverage) tasked with recording all basic block executed during application execution.
  • minblox.py - Python script that runs the DynamoRIO instrumentation and analyzes the log files to minimize the sample set.

Though, do bear in mind that I've only tested this for a specific case I needed it, so your mileage might vary.

0xea
  • 4,904
  • 1
  • 23
  • 30
1

As far as I know, there is nothing like this public. However, you can use DrCov and do yourself the same thing as Minset does and even more powerful things. Indeed, I commonly use this DynamoRIO's tool, DrCov, for doing this exact same thing and others.

joxeankoret
  • 4,488
  • 2
  • 21
  • 35