Weka (Java-based)
Scikit-Learn (Python-based)
The Elements of Statistical Learning: Data Mining, Inference, and Prediction.
Second Edition
Trevor Hastie, Robert Tibshirani, Jerome Friedman
S. Shalev-Shwartz Online Learning and Convex Optimization DOI: 10.1561/2200000018
Wolpert, David (1996), "The Lack of A Priori Distinctions between Learning Algorithms", Neural Computation, pp. 1341-1390.
D. Haussler. Overview of the Probably Approximately Correct (PAC) Learning Framework.
Active Learning Toolbox for Matlab
Another Active Learning Toolbox for Matlab
Videos by the mathematical monk
Tom Mitchell's Introduction to Machine Learning (CMU Course 10-701), Spring 2011, including video lectures.
Tom Mitchell and Nina Balcan's Introduction to Machine Learning (CMU Course 10-601), Spring 2015, including video lectures.