Data Mining Behavioral Transitions in Open Source Repositories

Data Mining Behavioral Transitions in Open Source Repositories Open-source repository data can be automatically mined using sequence mining methods to provide high-level feedback on project status. GitHub.com projects are acquired, sequence-mined, clustered, and regressed to analyze project characteristics. Such results can be presented to project managers, as part of a display generated by an automated monitoring system. Such monitoring systems provide high-level feedback in real-time. This project is a preliminary step in a larger research project aimed at understanding and monitoring FLOSS projects using this process modeling approach.