Skip to content
Pusat Penelitian, Pengabdian kepada Masyarakat dan Publikasi Internasional
twitter
youtube
instagram
Pusat Penelitian, Pengabdian kepada Masyarakat dan Publikasi Internasional
Call Support 0822-7473-7806
Email Support [email protected]
Location Jl. Kolam No. 1 Medan Estate
  • Beranda
  • Tentang
    • Profil
    • Visi dan Misi
    • Struktur Organisasi
    • Pimpinan Pusat
    • Program Kerja
    • Sasaran, Program Strategis dan IK
  • Berita Kegiatan
  • Layanan & Informasi
    • Aplikasi
      • UMA
        • Penjaminan Mutu
        • Himpunan Aplikasi Online
        • Jurnal Ilmiah Online
        • Repositori UMA
        • Open Access Public Catalog
      • Unit
        • Aplikasi Penelitian & Pengabdian (LIPAN)
        • SWAMP-D
        • SUSITAO
        • SINTA Verifikator
        • BIMA Kemdiktisaintek
    • Arsip Digital
    • Helpdesk
    • Pendanaan
      • Penelitian
        • Penelitian Pendanaan Nasional
        • Penelitian Kerjasama Internasional
      • Pengabdian Kepada Masyarakat
        • PKM Pendanaan Nasional
    • Publikasi
      • Internasional Bereputasi
    • Reviewer Penelitian dan PKM
  • Kerjasama
  • Jadwal Kegiatan

Hierarchical Clustering: Building Nested Groups in Unsupervised Learning

Posted on September 24, 2025September 30, 2025 by Fachrur Rozi
0

Introduction

Clustering is a fundamental unsupervised learning technique used to group similar data points together. While K-Means requires us to predefine the number of clusters, Hierarchical Clustering provides a more flexible approach by building a hierarchy of clusters. This method is particularly useful when the number of clusters is unknown or when we want to visualize relationships between data points.

What Is Hierarchical Clustering?

Hierarchical Clustering is an unsupervised machine learning algorithm that builds a tree-like structure of clusters, known as a dendrogram. It works by either merging smaller clusters into larger ones (agglomerative) or splitting larger clusters into smaller ones (divisive).

Key Concepts:

  • Agglomerative Clustering (Bottom-Up): Starts with each data point as its own cluster and merges them step by step.
  • Divisive Clustering (Top-Down): Starts with all data points in one cluster and splits them iteratively.
  • Dendrogram: A tree diagram that shows how clusters are merged or split.
  • Linkage Criteria: Determines how distances between clusters are calculated (single linkage, complete linkage, average linkage, Ward’s method).

How Hierarchical Clustering Works

  1. Calculate the distance (similarity) between all data points.
  2. Merge the two closest clusters.
  3. Recalculate distances between the new cluster and remaining clusters.
  4. Repeat until all points are in a single cluster.
  5. Use the dendrogram to decide the optimal number of clusters.

Applications of Hierarchical Clustering

  • Biology: Grouping genes or species based on similarity.
  • Marketing: Customer segmentation without predefining cluster numbers.
  • Text Mining: Organizing documents by topic similarity.
  • Image Processing: Grouping pixels or image features.
  • Social Networks: Detecting communities in graphs.

Advantages of Hierarchical Clustering

  • No need to predefine K: The number of clusters can be chosen from the dendrogram.
  • Interpretability: Dendrograms provide visual insights into cluster relationships.
  • Flexibility: Works with different distance metrics (Euclidean, Manhattan, cosine similarity).
  • Deterministic: Always produces the same result (unlike K-Means, which depends on initialization).

Challenges and Limitations

  • Computationally expensive: O(n²) complexity, not ideal for very large datasets.
  • Sensitive to noise and outliers: Can distort the dendrogram.
  • Imbalanced clusters: May produce clusters of very different sizes.
  • Irreversible merges/splits: Early decisions cannot be undone.

Improvements and Variants

  • BIRCH (Balanced Iterative Reducing and Clustering using Hierarchies): Scales hierarchical clustering to large datasets.
  • CURE (Clustering Using Representatives): Handles outliers better by using representative points.
  • Hybrid approaches: Combining K-Means with hierarchical methods for efficiency.

Conclusion

Hierarchical Clustering provides a powerful way to uncover nested structures in data. With its ability to visualize relationships through dendrograms and flexibility in not requiring a predefined number of clusters, it remains a valuable tool in exploratory data analysis. Although it struggles with large datasets, improved variants make it suitable for a wide range of real-world applications.

Tags: 2025, Dosen Terbaik, Kampus Internasional, Kampus Terakreditasi, Kampus Unggulan, Mahasiswa Berprestasi, Sustainable University, UMA Keren, Universitas Swasta, Universitas Terbaik

Berita Terbaru
UMA Kukuhkan Posisi sebagai Kampus Swasta Terbaik di Sumut Versi SJR
Universitas Medan Area kembali mencatatkan pencapaian membanggakan di tingkat nasional dengan meraih predikat sebagai perguruan tinggi swasta terbaik di Sumatera...
UMA Terima Kunjungan STIE Graha Kirana: Perkuat Kolaborasi Tridharma dan Pengelolaan HKI
Medan, 24 April 2026 — Universitas Medan Area (UMA) menerima kunjungan akademik dari Sekolah Tinggi Ilmu Ekonomi (STIE) Graha Kirana...
KAMPUS I
Jalan Kolam Nomor 1 Medan Estate / Jalan Gedung PBSI, Medan 20223
(061) 7360168 CALL CENTER : 0811-6013-888
[email protected]
KAMPUS II
Jalan Sei Serayu No. 70 A / Jalan Setia Budi No. 79 B, Medan 20112
(061) 42402994
[email protected]

Statistik Pengunjung

  • 1
  • 36
  • 30
  • 21,764
  • 23,723
@Copyright 2026 BPDI | Universitas Medan Area

This will close in 10 seconds