Enhancing Flaky Test Detection Using iDFlakies and TuscanE

Authors

  • Amanda Nguyen Department of Computer Science, George Mason University, Fairfax, VA
  • Md. Mahmudul Hasan Pious Department of Computer Science, George Mason University, Fairfax, VA
  • August Shi Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, TX
  • Wing Lam Department of Computer Science, George Mason University, Fairfax, VA

Abstract

Flaky tests are software tests that lead to inconsistent results, i.e., tests that pass and fail on the same version code. Prior work found that order dependent tests (OD) are one of the most prominent categories of flaky tests. OD tests are tests whose outcomes depend on the order they are run, while non-order dependent tests (NOD) are tests whose outcomes do not depend on the order they are run. Two tools from prior work to detect flaky tests are iDFlakies and TuscanE. iDFlakies is a tool that detects flaky tests by rerunning tests in various orders, detecting whether they are flaky, and categorizes tests as OD or NOD. Previous work has shown that iDFlakies can detect some flaky tests by running tests in randomly generated test orders. TuscanE is a tool that systematically generates test orders, where once all generated orders are run, will also guarantee the detection of certain OD tests. To help improve the detection of OD tests in iDFlakies, we integrate the two tools together to enable one to detect OD tests more efficiently. Our evaluation of the combined tool that systematically generates test orders is able to detect OD tests faster than using iDFlakies with random test orders.

Published

2025-09-25

Issue

Section

College of Engineering and Computing: Department of Computer Science