🎲 Probaility Mass Function (Discrete Data)

Used when outcomes are countable — like coin flips, number of visitors, or email opens.

DistributionVariable TypeShape / NatureTypical Use CaseData Science Example
BernoulliBinary (0/1)Two outcomes onlySuccess/FailureSpam (1) or Not Spam (0)
BinomialCount of successes in n trialsSymmetric for p=0.5Series of coin flipsA/B test results
PoissonCount of events in time or spaceSkewed rightNumber of arrivalsWebsite hits/hour
GeometricTrials until first successExponentially decreasingWaiting for first eventFirst customer purchase
Negative BinomialTrials until r-th successSimilar to geometricRepeat experimentsMarketing conversions
MultinomialCategorical outcomes (>2)Multiple discrete barsMulti-class probabilitiesNLP topic modeling
HypergeometricSampling without replacementFinite populationDefects in batchQuality control

🧩 PMF Graph Look: Bars or spikes — each bar = probability of one discrete value.


🌊 Probaility Density Function (Continuous Data)

Used when values are measurable and continuous — like height, time, or temperature

Distribution Variable Type Shape / Nature Typical Use Case Data Science Example
Normal (Gaussian) Continuous Bell curve, symmetric Natural phenomena Regression errors, GMM
Uniform Continuous Flat line Equal probability Random sampling, simulation
Exponential Continuous (positive) Rapid decay Time between events Time-to-failure prediction
Log-Normal Continuous (positive) Right-skewed Positive-only variables Income, transaction values
Gamma Continuous (positive) Right-skewed, flexible Duration modeling Reliability analysis
Beta Continuous (0–1) Bounded (0–1) Probabilities, ratios Bayesian modeling
Chi-Square Continuous (positive) Right-skewed Variance modeling Hypothesis testing
Weibull Continuous (positive) Variable skew Lifetimes, failure Survival analysis
Pareto Continuous (positive) Heavy-tailed Inequality, extremes Customer lifetime value
Cauchy / t-distribution Continuous Heavy tails Outlier-robust modeling Bayesian inference

🧩 PDF Graph Look: Smooth curve — area under curve = total probability = 1.


💡 Quick Comparison Table

CategoryUsed ForKey ExamplesGraph Shape
PMFDiscrete probabilityBernoulli, Poisson, BinomialSpikes / Bars
PDFContinuous probabilityNormal, Exponential, BetaSmooth curve



🧠 In Data Science Pipelines

Model TypeUnderlying DistributionFunction Used
Classification (binary/multi-class)Bernoulli, MultinomialPMF
RegressionNormal (residuals)PDF
ClusteringGaussian (each cluster)PDF
Anomaly DetectionGaussian, ParetoPDF/CDF
Reliability / SurvivalWeibull, GammaPDF + CDF
Logistic RegressionLogisticCDF
Bayesian InferenceBeta, NormalPDF