Avula, Sai KrishnaSai KrishnaAvulaMondal, ShouvickShouvickMondal2025-08-312025-08-312024-07-10[9798400706585]10.1145/3663529.36637972-s2.0-85199036860https://d8.irins.org/handle/IITG2025/28832Modern software repositories serve as valuable sources of information for understanding and addressing software bugs. In this paper, we present MineCPP, a tool designed for large-scale bug-fixing dataset generation, extending the capabilities of a recently proposed approach, namely Minecraft. MineCPP not only captures bug locations and types across multiple programming languages but introduces novel features like offset of a bug in a buggy source file, the sequence of syntactic constructs up to and including the location of the bug, etc. We discuss architectural and operational aspects of MineCPP, and show how it can be used to automatically mine GitHub repositories. A Graphical User Interface (GUI) further enhances user experience by providing interactive visualizations and quantitative analyses, facilitating fine-grained insights about the structure of bug fix pairs. MineCPP serves as a helpful solution for researchers, practitioners, and developers seeking comprehensive bug-fixing datasets and insights into coding practices. Tool demonstration is available at https://youtu.be/ln99irvbADE.trueBug Fixes | Coding Effort | LLMs | Mining Software RepositoriesMineCPP: Mining Bug Fix Pairs and Their StructuresConference Paperhttps://doi.org/10.1145/3663529.3663797552-55610 July 20240cpConference Proceeding0