A clustering method, medium, and apparatus using region division templates. According to the method, medium, and apparatus, in order to more reliably extract semantic concepts included in a photo, multiple content-based feature values can be extracted from region images divided by using region division templates, and the confidence degree of an input image in relation to the local semantic concept, defined by using the feature values, is measured. With respect to the confidence degree, the local semantic concepts of the photo can be merged and a more reliable local semantic concept can be extracted. By using the merged local semantic concept, the confidence degree of a global semantic concept is measured, and according to the confidence, multiple category concepts included in the input photo are extracted. By doing so, photo data can be quickly and effectively used to generate an album.