Similarity measures for binary data