Skip to content
Pusat Penelitian, Pengabdian kepada Masyarakat dan Publikasi Internasional
twitter
youtube
instagram
Pusat Penelitian, Pengabdian kepada Masyarakat dan Publikasi Internasional
Call Support 0822-7473-7806
Email Support [email protected]
Location Jl. Kolam No. 1 Medan Estate
  • Beranda
  • Tentang
    • Profil
    • Visi dan Misi
    • Struktur Organisasi
    • Pimpinan Pusat
    • Program Kerja
    • Sasaran, Program Strategis dan IK
  • Berita Kegiatan
  • Layanan & Informasi
    • Aplikasi
      • UMA
        • Penjaminan Mutu
        • Himpunan Aplikasi Online
        • Jurnal Ilmiah Online
        • Repositori UMA
        • Open Access Public Catalog
      • Unit
        • Aplikasi Penelitian & Pengabdian (LIPAN)
        • SWAMP-D
        • SUSITAO
        • SINTA Verifikator
        • BIMA Kemdiktisaintek
    • Arsip Digital
    • Helpdesk
    • Pendanaan
      • Penelitian
        • Penelitian Pendanaan Nasional
        • Penelitian Kerjasama Internasional
      • Pengabdian Kepada Masyarakat
        • PKM Pendanaan Nasional
    • Publikasi
      • Internasional Bereputasi
    • Reviewer Penelitian dan PKM
  • Kerjasama
  • Jadwal Kegiatan

Classification in Data Science: Turning Data into Decisions

Posted on June 9, 2025June 28, 2025 by Fachrur Rozi
0

In the landscape of data science, where patterns drive predictions, classification stands out as one of the most widely used techniques for making sense of labeled data. Whether you’re sorting emails as spam or not, diagnosing diseases, or approving credit applications, classification is the logic engine behind the decision-making.


What is Classification?

Classification is a supervised machine learning task where the goal is to predict a categorical label (class) for new input data based on previously labeled training data.

Examples:

  • Email → Spam or Not Spam
  • Transaction → Fraudulent or Legitimate
  • Image → Cat, Dog, or Bird
  • Patient Data → Has Disease or Not

The model learns from past examples and applies that knowledge to unseen data.


How Classification Works

  1. Training Data: A dataset with features (independent variables) and known labels (dependent variable).
  2. Model Training: Algorithms learn from this data to identify patterns.
  3. Prediction: The trained model is used to classify new, unseen data points.
  4. Evaluation: Accuracy, precision, recall, F1-score, and confusion matrix help assess model performance.

Popular Classification Algorithms

Algorithm Best For
Logistic Regression Binary classification, interpretable
K-Nearest Neighbors Simple models, small datasets
Decision Tree Rule-based logic, visualization
Random Forest High accuracy, ensemble modeling
Naive Bayes Text classification, spam filters
Support Vector Machine (SVM) Clear margin of separation
Neural Networks Complex relationships, deep learning
Gradient Boosting (e.g., XGBoost, LightGBM) High accuracy, tabular data

Binary vs. Multiclass Classification

  • Binary Classification: Two possible outcomes
    Example: Predicting if a loan will default or not.
  • Multiclass Classification: More than two outcomes
    Example: Classifying a fruit as apple, banana, or orange.
  • Multilabel Classification: Each instance can belong to multiple classes
    Example: Tagging a movie as both “comedy” and “romance”.

Example: Classification in Python with Scikit-Learn

from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score

# Load data
X, y = load_iris(return_X_y=True)

# Split data
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

# Train model
model = RandomForestClassifier()
model.fit(X_train, y_train)

# Predict and evaluate
y_pred = model.predict(X_test)
print("Accuracy:", accuracy_score(y_test, y_pred))

Evaluation Metrics

  • Accuracy – Correct predictions / total predictions
  • Precision – True Positives / (True Positives + False Positives)
  • Recall (Sensitivity) – True Positives / (True Positives + False Negatives)
  • F1 Score – Harmonic mean of precision and recall
  • Confusion Matrix – Table summarizing prediction results

Applications of Classification

Field Use Case
Healthcare Disease diagnosis
Finance Credit risk assessment
Marketing Customer segmentation
Cybersecurity Intrusion detection systems
E-commerce Product recommendation (as a sub-step)
Email Services Spam filtering

Challenges in Classification

  • Imbalanced classes (e.g., 95% healthy, 5% sick)
  • Overfitting – when the model learns noise instead of patterns
  • Feature selection – irrelevant features can reduce performance
  • Interpretability – especially in deep learning models

Conclusion

Classification is at the heart of many intelligent systems. Its ability to turn labeled data into actionable decisions makes it a cornerstone of applied machine learning. From diagnosing disease to filtering spam, classification helps transform data into clarity and control.

Tags: Digital University, Kampus Internasional, Kampus Terakreditasi, Kampus Terbaik, Mahasiswa Berprestasi, Penelitian, Sustainable University, UMA Terbaik, Universitas Terbaik

Berita Terbaru
UMA Kukuhkan Posisi sebagai Kampus Swasta Terbaik di Sumut Versi SJR
Universitas Medan Area kembali mencatatkan pencapaian membanggakan di tingkat nasional dengan meraih predikat sebagai perguruan tinggi swasta terbaik di Sumatera...
UMA Terima Kunjungan STIE Graha Kirana: Perkuat Kolaborasi Tridharma dan Pengelolaan HKI
Medan, 24 April 2026 — Universitas Medan Area (UMA) menerima kunjungan akademik dari Sekolah Tinggi Ilmu Ekonomi (STIE) Graha Kirana...
KAMPUS I
Jalan Kolam Nomor 1 Medan Estate / Jalan Gedung PBSI, Medan 20223
(061) 7360168 CALL CENTER : 0811-6013-888
[email protected]
KAMPUS II
Jalan Sei Serayu No. 70 A / Jalan Setia Budi No. 79 B, Medan 20112
(061) 42402994
[email protected]

Statistik Pengunjung

  • 0
  • 1
  • 1
  • 21,729
  • 23,694
@Copyright 2026 BPDI | Universitas Medan Area

This will close in 10 seconds