Resources
Mining Software Engineering Data Bibliography
-- Resources--
Surveys
- Raymond P.L. Buse, Thomas Zimmermann. Analytics for Software Development. In Proceedings of the FSE/SDP Workshop on the Future of Software Engineering Research (FoSER 2010), Santa Fe, New Mexico, USA, November 2010.
- Ahmed E. Hassan and Tao Xie. Software Intelligence: Future of Mining Software Engineering Data. In Proceedings of FSE/SDP Workshop on the Future of Software Engineering Research (FoSER 2010), Santa Fe, NM, November 2010.
- Tao Xie, Suresh Thummalapenta, David Lo, and Chao Liu. Data Mining for Software Engineering. IEEE Computer, 42(8), pp.35-42, August 2009.
- Michael W. Godfrey, Ahmed E. Hassan, James D. Herbsleb, Gail C. Murphy, Martin P. Robillard, Premkumar T. Devanbu, Audris Mockus, Dewayne E. Perry, David Notkin: Future of Mining Software Archives: A Roundtable. IEEE Software 26(1): 67-70 (2009)
- Ahmed E. Hassan: The Road Ahead for Mining Software Repositories. In Proceedings of the Future of Software Maintenance (FoSM) at the 24th IEEE International Conference on Software Maintenance (ICSM) 2008, Beijing, China.
- Huzefa Kagdi, Michael L. Collard, and Jonathan I. Maletic. A Survey and Taxonomy of Approaches for Mining Software Repositories in the Context of Software Evolution, in the Journal of Software Maintenance and Evolution: Research and Practice (JSME), Vol. 19, No. 2, pp. 77-131, March/April 2007.
Bibliography on Mining Software Engineering Data
Resources
- International Workshop on Mining Software Repositories 2007 MSR 2006 MSR 2005 MSR 2004
- TSE special issue on MSR
- UIUC course: Data Mining for Security, Performance, and Software Robustness course
- UIUC's Automatic On-the-fly Detection, Characterization, Recovery, and Correction of Software Bugs in Production Runs (NSF/ITR) Probe group web Probe publications
- Mining Software Engineering Data links
- Component Protocol Inference links
Available mining tools:
- Resources at Illinois
- Machine learning/data mining tools: Infer.NET Weka UIUC software UIUC illimine Other UIUC tools Himalaya data mining tools released by Cornell Researchers Java data mining algorithms Jian Pei's papers CP-MinerSubdue More Graph Mining Tools FFSM: Fast Frequent Subgraph Mining Yadas Clustering Info-Fuzzy Network Machine Learning in Java (MLJ) Liverpool tools YALEJAM Bayesian Network tools in Java (BNJ)JavaBayes jBNC UnBBayes Redwood BioJava.org BioPerl.org GenomeWeb statdistlib SemanticWebMining Chunnan SMILE GGAT -Genetic Algorithm
- Frequent SubGraphs: Pathway
- Finite state machine learner: Kbehavior HMM mini-toolkit by Anand PFSA learner
- Concept analysis tools: Concepts kaba Formal Concept Analysis Formal Concept Analysis Resources tockit toscanaj FCA java tool
- Genetic algorithms: JGAP
- Bytecode analysis tools: Soot@McGill BCEL MBEL bytecode viewers: Jad Jclasslib
- Text analysis: William W. Cohen
- Minorthird, an open-source Java package of information extraction and text classification learning tools.
- SecondString is another open-source Java package, of approximate string matching techniques
- Specification Inference/Code Mining Tools (projects) (mining tool resources): LSC Miner@NUS Nort@NUS Objectflattener@UNIMIB Abadu@Saarland Lever@UIUC TrendProf@Berkeley Hear@UCDavis Google Eclipse Search Programming By Sketch@Berkeley API Birthmark@Saarland Sourcerer@UCI Codegenie@UCI Quack@MIT Fossology.org
- Mining execution traces: Daikon@MIT DIDUCE@Stanford kBehavior@UNIMIB Terracotta@U. Virignia
- Mining version histories: Evolizer@UZurich SemDiff@McGill LSDiff@UTAustin TeMo@Delft eROSE@Saarland
- Mining static code bases: Tikanga@Saarland U. CP-Miner@UIUC LtRrules@OhioU Prospector@UCBerkeley JavaSketch@UCBerkeley Strathcona@Calgary Sniff@UCBerkeley
- Searching code: Tools@UCI Datasets@UCI Sourcerer@UCI Code Conjurer koders-loganalysis@UCI
- Concept Analysis Tools: Colibri-Java@Saarland Colibri/Concepts@Saarland KABA@PASSAU
- Visualization Tools: HeapDBG@IMDEA iSpace@vDarmstadt MVT@HUT EPOSoft@Saarland CCVisu@SFU