Vagueness & Uncertainty — Fuzzy + Probabilistic Logic

chatbot methods-of-ai

Companion files for exam prep:

📋 lernzettel_vagueness-uncertainty_30-04-26 — compact 1-page exam sheet

✏️ quiz_vagueness-uncertainty_30-04-26 — 7 questions

🗂 Source slides: Methods of AI, Session 08 (SoSe 2026)

Core Ideas
Glossary
Fuzzy Set Theory
Zadeh operations
t-norms and s-norms — axioms
Algebraic & Quotient norms
Visual — membership function shapes
Fuzzy algebra in code
Visualisations (Python) — 4 figures
Probabilistic Logic
Kolmogorov axioms & derived properties
Conditional probability
Probabilistic knowledge bases
Formulas & Notation
Common Exam Traps
Worked examples
Quick Comparison Table
ALGORITHMS / TECHNIQUES ⭐
- Fuzzy inference pipeline
- Defuzzification
- Bayesian update
- Dempster-Shafer (supplementary)
Where Fuzzy Logic is used today
Where Fuzzy Logic was replaced — and by what
See also

Core Ideas

Classical logic: statements are either true or false — can’t capture partial truth or uncertainty.
Fuzzy logic (Lotfi Zadeh, 1965) addresses vagueness — partial truth (“the ball is reddish”). Truth values live in [0, 1] and represent degree of membership.
Probabilistic logic addresses uncertainty — complete truth we don’t know yet (“probably rain”). Truth values live in [0, 1] and represent probability of a crisp fact.
Fuzzy and probabilistic logic both use [0,1], but the semantics differ: degree of membership ≠ likelihood.
Probabilistic logic preserves more classical properties than fuzzy logic — tautologies always get probability 1, contradictions get 0; fuzzy logic does not guarantee this (intentional for vague predicates).

Glossary — important vocabulary ⭐

Vagueness — predicate boundary is fuzzy. “Tall”, “red”, “heap”. Heap paradox: if removing one grain keeps it a heap, then 0 grains is a heap — contradiction in classical logic.

Uncertainty — predicate boundary is sharp, but we lack information to decide. “It is raining in Berlin” is either true or false; we just don’t know.

Membership function μ_A(x) — function U → [0,1] assigning each x its degree of membership in fuzzy set A.

Fuzzy set A = (U, μ_A) — universe U plus its membership function. Classical (crisp) sets are the special case μ_A: U → {0,1}.

t-norm t(x,y) — binary operator on [0,1] generalising AND. Axioms: neutral element 1, commutative, associative, monotone increasing (if x ≤ x’ and y ≤ y’ then t(x,y) ≤ t(x’,y’)).

s-norm s(x,y) (a.k.a. t-conorm) — generalises OR. Axioms: neutral element 0, commutative, associative, monotone increasing.

Probability measure P — function on events satisfying Kolmogorov’s axioms (see below).

Conditional probability P(B|A) — probability of B given A has occurred, defined only when P(A) > 0.

Tautology T — formula true in every interpretation. In probabilistic logic always gets P(T) = 1. In fuzzy logic not guaranteed.

Contradiction C — formula false in every interpretation. Probabilistic: P(C) = 0. Fuzzy: not guaranteed.

Probabilistic knowledge base — set of probabilistic formulas F : p (P(F) = p) and/or conditional formulas (G|F)[p] (P(G|F) = p).

Fuzzy Set Theory

A fuzzy set generalises a classical (crisp) set:

Classical set A ⊆ U is fully described by its characteristic function χ_A : U → {0, 1}.
A fuzzy set A is the pair (U, μ_A) where μ_A : U → [0, 1].
μ_A(x) = 0.6 means “x belongs to A to degree 0.6”.

Classical: Tall(x) is true or false. Fuzzy: Tall(180 cm) might have membership degree 0.7 — partially tall.

Common membership function shapes:

Triangular — rises linearly to 1, falls linearly back to 0
Trapezoidal — rises, plateaus at 1, falls
Gaussian — bell curve centered on the “ideal” value
Sigmoidal — s-shaped, used for “saturation” concepts

Standard fuzzy operations (Zadeh) ⭐

Operation	Formula	Note
Intersection A ∩ B	μ_{A∩B}(x) = min(μ_A(x), μ_B(x))	t-norm
Union A ∪ B	μ_{A∪B}(x) = max(μ_A(x), μ_B(x))	s-norm
Complement A^C	μ_{A^C}(x) = 1 − μ_A(x)	standard negation

Non-classical behaviour:

A ∩ A^C ≠ ∅: with μ_A(x) = 0.6 we get min(0.6, 0.4) = 0.4 > 0.
A ∪ A^C ≠ U: max(0.6, 0.4) = 0.6 < 1.

This is intentional — for vague statements “red” and “not red”, “x is red or not red” need not be fully true.

t-norms and s-norms (generalising AND/OR)

A t-norm t : [0,1]² → [0,1] satisfies:

Neutral element 1: t(x, 1) = x
Commutativity: t(x, y) = t(y, x)
Associativity: t(x, t(y, z)) = t(t(x, y), z)
Monotonicity: x ≤ x’ and y ≤ y’ ⇒ t(x, y) ≤ t(x’, y’)

An s-norm (t-conorm) s : [0,1]² → [0,1] satisfies the same axioms with neutral element 0 instead of 1.

Concrete t-norm / s-norm families

Name	t-norm	s-norm
Standard (Zadeh)	min(x, y)	max(x, y)
Algebraic (product, Goguen)	x·y	x + y − x·y
Quotient (Hamacher form)	xy / (x + y − xy)	(x + y − 2xy) / (1 − xy)
Łukasiewicz	max(0, x + y − 1)	min(1, x + y)
Drastic (smallest non-trivial)	1 if x = y = 1 else 0	—
Largest t-norm	min(x, y)	—

Useful inequality. For any t-norm t and any x, y ∈ [0,1]: t(x, y) ≤ min(x, y). For any s-norm s: s(x, y) ≥ max(x, y). So min/max are the “extremal” Zadeh choices.

⚠️ Exam trap: the t-norm min and the s-norm max are NOT distributive over each other in general — fuzzy logic doesn’t obey all classical Boolean laws. Specifically, the law of excluded middle fails (x ⊔ ¬x ≠ 1).

Visual — the four membership function shapes

🐍 Code anzeigen / ausblenden

# Pyodide / Obsidian Execute Code: install matplotlib first.
# In normal Python (terminal / Jupyter), delete the next 2 lines.
import micropip
await micropip.install("matplotlib")
 
import numpy as np
import matplotlib.pyplot as plt
 
x = np.linspace(0, 10, 500)
 
def triangular(x, a, b, c):
    return np.maximum(np.minimum((x - a) / (b - a), (c - x) / (c - b)), 0)
 
def trapezoidal(x, a, b, c, d):
    return np.maximum(np.minimum(np.minimum((x - a) / (b - a), 1), (d - x) / (d - c)), 0)
 
def gaussian(x, mean, sigma):
    return np.exp(-0.5 * ((x - mean) / sigma) ** 2)
 
def sigmoid(x, midpoint, slope):
    return 1 / (1 + np.exp(-slope * (x - midpoint)))
 
plt.figure(figsize=(10, 5))
plt.plot(x, triangular(x, 2, 5, 8),       lw=2, label='triangular (a=2, b=5, c=8)')
plt.plot(x, trapezoidal(x, 1, 3, 7, 9),   lw=2, label='trapezoidal (1, 3, 7, 9)')
plt.plot(x, gaussian(x, 5, 1.2),          lw=2, label='gaussian (μ=5, σ=1.2)')
plt.plot(x, sigmoid(x, 5, 1.5),           lw=2, label='sigmoid (mid=5, slope=1.5)')
plt.xlabel('x'); plt.ylabel('μ(x) — membership degree')
plt.title('Common fuzzy membership functions')
plt.legend(); plt.grid(alpha=0.3); plt.ylim(-0.05, 1.05); plt.tight_layout(); plt.show()

What to see:

Triangular — simplest, used when you only need “definitely belongs at this point, fades linearly elsewhere”
Trapezoidal — has a plateau of full membership; common for “tall person = 175–195 cm”
Gaussian — smooth, differentiable (matters for fuzzy neural networks)
Sigmoidal — saturation behaviour; for predicates like “old” (no upper bound)

Real fuzzy controllers usually use triangular or trapezoidal for speed; Gaussian when you need gradients (e.g. ANFIS — Adaptive Neuro-Fuzzy Inference Systems).

See the algebra concretely

🐍 Code anzeigen / ausblenden

import numpy as np
 
# Two fuzzy membership values
x, y = 0.6, 0.4
 
print(f"Fuzzy AND (t-norm):")
print(f"  min(x, y)         = {min(x, y):.3f}     (Zadeh)")
print(f"  x * y             = {x * y:.3f}     (Goguen / product)")
print(f"  max(0, x + y - 1) = {max(0, x + y - 1):.3f}     (Łukasiewicz)")
 
print(f"\nFuzzy OR (s-norm):")
print(f"  max(x, y)         = {max(x, y):.3f}     (Zadeh)")
print(f"  x + y - x*y       = {x + y - x*y:.3f}     (algebraic sum)")
print(f"  min(1, x + y)     = {min(1, x + y):.3f}     (Łukasiewicz)")
 
print(f"\nFuzzy NOT (complement):")
print(f"  1 - x             = {1 - x:.3f}")
 
print(f"\nNon-classical behavior:")
print(f"  μ(A ⊓ A^c) using min  = {min(x, 1 - x):.3f}    (classically should be 0)")
print(f"  μ(A ⊔ A^c) using max  = {max(x, 1 - x):.3f}    (classically should be 1)")

Output:

Fuzzy AND (t-norm):
  min(x, y)         = 0.400     (Zadeh)
  x * y             = 0.240     (Goguen / product)
  max(0, x + y - 1) = 0.000     (Łukasiewicz)

Fuzzy OR (s-norm):
  max(x, y)         = 0.600     (Zadeh)
  x + y - x*y       = 0.760     (algebraic sum)
  min(1, x + y)     = 1.000     (Łukasiewicz)

Non-classical behavior:
  μ(A ⊓ A^c) using min  = 0.400    (classically should be 0)
  μ(A ⊔ A^c) using max  = 0.600    (classically should be 1)

→ Excluded middle (A ∨ ¬A = 1) and non-contradiction (A ∧ ¬A = 0) fail in fuzzy logic. This is by design — vague predicates allow partial overlap with their negation.

Visualisations (Python)

These four figures make the fuzzy logic concepts above concrete. Each toggle contains a self-contained Pyodide block (matplotlib installed on the fly) — open them in Obsidian with the Execute Code plugin to render.

🐍 Figure 1 — Membership functions for "cold / warm / hot" temperature

import micropip
await micropip.install("matplotlib")
 
import numpy as np
import matplotlib.pyplot as plt
 
T = np.linspace(0, 40, 500)
 
def trapezoid(x, a, b, c, d):
    # rises a→b, plateau b→c, falls c→d
    left  = np.clip((x - a) / max(b - a, 1e-9), 0, 1)
    right = np.clip((d - x) / max(d - c, 1e-9), 0, 1)
    return np.minimum(left, right)
 
def triangular(x, a, b, c):
    return np.maximum(np.minimum((x - a) / (b - a), (c - x) / (c - b)), 0)
 
def gaussian(x, mean, sigma):
    return np.exp(-0.5 * ((x - mean) / sigma) ** 2)
 
mu_cold = trapezoid(T, -5, 0, 10, 16)        # cold: definitely cold up to 10 °C
mu_warm = triangular(T, 12, 22, 30)          # warm: triangular peak at 22 °C
mu_hot  = gaussian(T, 32, 4)                 # hot: bell centred on 32 °C
 
fig, ax = plt.subplots(figsize=(10, 5))
ax.plot(T, mu_cold, lw=2.5, color="#3498db", label="μ_cold (trapezoidal)")
ax.plot(T, mu_warm, lw=2.5, color="#f39c12", label="μ_warm (triangular)")
ax.plot(T, mu_hot,  lw=2.5, color="#e74c3c", label="μ_hot (gaussian)")
 
# Annotate the overlap zone — "warm" leaks into "cold" and "hot"
ax.axvspan(12, 16, color="#3498db", alpha=0.08)
ax.axvspan(26, 30, color="#e74c3c", alpha=0.08)
ax.text(14, 0.55, "cold ∩ warm\n(overlap)", ha="center", fontsize=8, color="#555")
ax.text(28, 0.55, "warm ∩ hot\n(overlap)", ha="center", fontsize=8, color="#555")
 
ax.set_xlabel("Temperature (°C)")
ax.set_ylabel("μ(T) — degree of membership")
ax.set_title('Three fuzzy concepts over one variable — a vague "warm" overlaps its neighbours')
ax.legend(loc="upper right")
ax.grid(alpha=0.3); ax.set_ylim(-0.05, 1.08)
plt.tight_layout(); plt.show()

What to see. Each curve is one vague concept. Around T = 14 °C the temperature is both somewhat cold (μ_cold ≈ 0.3) and somewhat warm (μ_warm ≈ 0.2). That overlap is the whole point of fuzzy logic — classical sets would have to draw a hard line at, say, 15 °C and call 14.99 °C “cold” but 15.01 °C “warm”. Notice the three shapes (trapezoidal / triangular / gaussian) all encode the same kind of vague predicate with different boundary behaviour: trapezoidal has a flat plateau of full membership, triangular peaks at one point, gaussian is smooth and differentiable.

🐍 Figure 2 — Min vs algebraic product: two valid t-norms that disagree

import micropip
await micropip.install("matplotlib")
 
import numpy as np
import matplotlib.pyplot as plt
 
x = np.linspace(0, 10, 500)
 
def triangular(x, a, b, c):
    return np.maximum(np.minimum((x - a) / (b - a), (c - x) / (c - b)), 0)
 
mu_A = triangular(x, 1, 4, 7)
mu_B = triangular(x, 3, 6, 9)
 
intersection_min  = np.minimum(mu_A, mu_B)      # Zadeh t-norm
intersection_prod = mu_A * mu_B                  # Goguen / algebraic t-norm
 
fig, ax = plt.subplots(figsize=(10, 5))
ax.plot(x, mu_A, lw=1.5, color="#888", ls="--", label="μ_A(x)")
ax.plot(x, mu_B, lw=1.5, color="#444", ls="--", label="μ_B(x)")
ax.plot(x, intersection_min,  lw=2.8, color="#27ae60",
        label="μ_{A∩B} = min(μ_A, μ_B)  (Zadeh)")
ax.plot(x, intersection_prod, lw=2.8, color="#9b59b6",
        label="μ_{A∩B} = μ_A · μ_B      (algebraic)")
 
# Fill the gap between the two t-norms — that's where they disagree
ax.fill_between(x, intersection_prod, intersection_min,
                where=intersection_min > intersection_prod,
                color="#e67e22", alpha=0.18,
                label="gap: min ≥ product (always)")
 
ax.set_xlabel("x")
ax.set_ylabel("μ(x)")
ax.set_title("Two valid t-norms on the same μ_A, μ_B — they disagree everywhere except at 0 and 1")
ax.legend(loc="upper right", fontsize=9)
ax.grid(alpha=0.3); ax.set_ylim(-0.02, 1.05)
plt.tight_layout(); plt.show()

What to see. Both green (min) and purple (product) satisfy all four t-norm axioms — neutral element 1, commutativity, associativity, monotonicity — yet they give different answers for “A AND B”. The orange band is the gap: min ≥ product for all x, y ∈ [0,1] (general inequality t(x,y) ≤ min(x,y) for any t-norm). At the extremes (μ = 0 or μ = 1) both agree, matching classical AND. The choice of t-norm is a design decision — Zadeh’s min is the textbook default and preserves more “non-classical” structure (e.g. idempotence: min(x,x) = x), while the product is differentiable and behaves more like probabilities of independent events.

🐍 Figure 3 — A ∨ ¬A: fuzzy dips below 1, classical never does

import micropip
await micropip.install("matplotlib")
 
import numpy as np
import matplotlib.pyplot as plt
 
x = np.linspace(0, 10, 500)
 
def triangular(x, a, b, c):
    return np.maximum(np.minimum((x - a) / (b - a), (c - x) / (c - b)), 0)
 
mu_A      = triangular(x, 2, 5, 8)
mu_notA   = 1 - mu_A
excluded_middle_fuzzy = np.maximum(mu_A, mu_notA)   # Zadeh OR
excluded_middle_classical = np.ones_like(x)         # always 1
 
fig, ax = plt.subplots(figsize=(10, 5))
ax.plot(x, mu_A,    lw=1.5, color="#3498db", ls="--", label="μ_A(x)")
ax.plot(x, mu_notA, lw=1.5, color="#e74c3c", ls="--", label="μ_¬A(x) = 1 − μ_A(x)")
ax.plot(x, excluded_middle_classical, lw=2.5, color="#7f8c8d",
        label="classical: A ∨ ¬A = 1 always")
ax.plot(x, excluded_middle_fuzzy, lw=3, color="#27ae60",
        label="fuzzy (Zadeh): max(μ_A, μ_¬A)")
 
# Highlight the dip — where excluded middle fails in fuzzy logic
dip_mask = excluded_middle_fuzzy < 0.99
ax.fill_between(x, excluded_middle_fuzzy, 1, where=dip_mask,
                color="#f1c40f", alpha=0.35,
                label="dip: fuzzy < 1 (excluded middle fails)")
 
# Annotate the minimum
min_idx = np.argmin(excluded_middle_fuzzy)
ax.annotate(f"min = {excluded_middle_fuzzy[min_idx]:.2f}\nat μ_A = 0.5",
            xy=(x[min_idx], excluded_middle_fuzzy[min_idx]),
            xytext=(x[min_idx] + 1.2, 0.35),
            fontsize=9, color="#444",
            arrowprops=dict(arrowstyle="->", color="#444"))
 
ax.set_xlabel("x")
ax.set_ylabel("truth value")
ax.set_title("Law of excluded middle in fuzzy vs classical logic")
ax.legend(loc="lower center", fontsize=9)
ax.grid(alpha=0.3); ax.set_ylim(-0.02, 1.1)
plt.tight_layout(); plt.show()

What to see. In classical logic A ∨ ¬A is the textbook tautology — always 1, no exceptions (grey line). In Zadeh fuzzy logic the same formula becomes max(μ_A, 1 − μ_A), which dips down to 0.5 exactly where μ_A = 0.5 — the most ambiguous point. The yellow region is the “vagueness gap”: every point where the fuzzy interpretation falls short of the classical tautology. This isn’t a bug — it’s the defining feature that lets fuzzy logic talk about predicates with no sharp boundary (“reddish”, “tall”, “warm”). The flip side: tautologies are no longer guaranteed 1, which is why probabilistic logic (P(T) = 1 always) is closer to classical logic than fuzzy logic is.

🐍 Figure 4 — Defuzzification: centroid vs max-membership

import micropip
await micropip.install("matplotlib")
 
import numpy as np
import matplotlib.pyplot as plt
 
# Imagine the aggregated output of a fuzzy controller (e.g. "fan speed")
z = np.linspace(0, 10, 1000)
 
def triangular(x, a, b, c):
    return np.maximum(np.minimum((x - a) / (b - a), (c - x) / (c - b)), 0)
 
# Bimodal aggregated set — two clipped rules
mu_low  = np.minimum(triangular(z, 1, 3, 5), 0.6)   # rule 1 clipped at 0.6
mu_high = np.minimum(triangular(z, 6, 8, 10), 0.9)  # rule 2 clipped at 0.9
mu_agg  = np.maximum(mu_low, mu_high)               # Zadeh union
 
# Centroid (centre-of-gravity)
centroid = np.trapz(z * mu_agg, z) / np.trapz(mu_agg, z)
 
# Max-membership (mean of maxima)
max_val = mu_agg.max()
max_mask = mu_agg >= max_val - 1e-9
mom = z[max_mask].mean()
 
fig, ax = plt.subplots(figsize=(10, 5))
ax.fill_between(z, 0, mu_agg, color="#3498db", alpha=0.25,
                label="μ_agg(z) — aggregated output")
ax.plot(z, mu_agg, lw=2, color="#2c3e50")
 
ax.axvline(centroid, color="#27ae60", lw=2.5, ls="-",
           label=f"centroid z* = {centroid:.2f}")
ax.axvline(mom, color="#e74c3c", lw=2.5, ls="--",
           label=f"max-membership z* = {mom:.2f}")
 
ax.annotate("centroid pulled toward\nthe heavier left lobe",
            xy=(centroid, 0.15), xytext=(centroid - 2.5, 0.45),
            fontsize=8, color="#27ae60",
            arrowprops=dict(arrowstyle="->", color="#27ae60"))
ax.annotate("max-membership\nsees only the peak",
            xy=(mom, max_val), xytext=(mom + 0.3, 0.55),
            fontsize=8, color="#e74c3c",
            arrowprops=dict(arrowstyle="->", color="#e74c3c"))
 
ax.set_xlabel("z (crisp output domain — e.g. fan speed %)")
ax.set_ylabel("μ_agg(z)")
ax.set_title("Defuzzification — centroid and max-membership give different crisp values")
ax.legend(loc="upper left", fontsize=9)
ax.grid(alpha=0.3); ax.set_ylim(-0.02, 1.05)
plt.tight_layout(); plt.show()

What to see. The blue fuzzy set is the aggregated output of two clipped fuzzy rules — a typical bimodal shape after Mamdani inference. Centroid (green) takes the centre of gravity of the entire shape, sitting somewhere between the two lobes — it “knows” both rules fired. Max-membership (red) returns the average of the x-values where μ reaches its maximum — it ignores the smaller lobe entirely and reports a point near the right peak. Both are valid defuzzification methods; they can disagree by several units on the same set. Real fuzzy controllers usually pick centroid for smoothness, max-membership for speed and intuitive “winner-take-all” semantics.

Probabilistic Logic

Propositional atoms are mapped to events in a probability space (Ω, Σ, P).

Kolmogorov’s axioms

Non-negativity: P(E) ≥ 0 for every event E.
Normalisation: P(Ω) = 1.
Finite additivity: for disjoint events A, B: P(A ∪ B) = P(A) + P(B). (σ-additivity for countable disjoint families.)

Derived properties

P(∅) = 0
P(A^C) = 1 − P(A)
A ⊆ B ⇒ P(A) ≤ P(B) (monotonicity)
Inclusion–exclusion: P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
Tautology T: P(T) = P(E_T) = P(Ω) = 1
Contradiction C: P(C) = P(E_C) = P(∅) = 0
For fuzzy logics we do not have this property — A ∨ ¬A need not be 1 (slide 70).

Conditional probability

P(B|A) = P(B ∩ A) / P(A) (requires P(A) > 0)

Same definition in probabilistic logic: P(G|F) = P(G ∧ F) / P(F).

Probabilistic knowledge bases

Two formula types:

F : p — unconditional. P satisfies it iff P(F) = p.
(G | F)[p] — conditional. P satisfies it iff P(G | F) = p (requires P(F) > 0).

A KB is satisfiable if at least one probability measure satisfies every formula. Reasoning means deriving bounds like P(B) ≤ 0.5 from the KB (see worked example below).

Formulas & Notation

Symbol	Meaning
μ_A(x)	Degree of membership of x in fuzzy set A
t(x, y)	t-norm (generalises AND)
s(x, y)	s-norm / t-conorm (generalises OR)
(Ω, Σ, P)	Probability space
P(E)	Probability of event E
P(B\|A)	Conditional probability of B given A
F : p	Probabilistic formula — P(F) = p
(G\|F)[p]	Conditional formula — P(G\|F) = p
T, C	Tautology, contradiction

Common Exam Traps ⚠️

Vagueness ≠ uncertainty. Vagueness = predicate is partial (degree); uncertainty = predicate is sharp, we just don’t know which side x falls on.
Fuzzy: A ∨ ¬A is NOT necessarily 1. With μ_A(x) = 0.6: I(red ∨ ¬red) = max(0.6, 0.4) = 0.6. Intentional for vague predicates.
Probabilistic: tautologies always get P = 1. Contradictions always P = 0. This is one of the main differences from fuzzy logic (Session 08 slide 70).
min and max are one valid Zadeh choice for t-norm / s-norm — not the only ones. Algebraic (x·y, x+y−xy) is equally valid.
Algebraic t-norm ≠ min. x·y ≤ min(x, y), strict when both < 1.
Heap paradox motivates fuzzy logic — classical logic + induction produces a contradiction.
Conditional KB satisfaction: P satisfies (G|F)[p] iff P(G|F) = p, which requires P(F) > 0 (otherwise undefined).
t-norm and s-norm have different neutral elements — 1 for t-norm, 0 for s-norm. Don’t mix up.
Centroid vs. max-membership defuzzification can disagree on the same aggregated fuzzy set.
⚠️ Dempster-Shafer is NOT in Session 08 slides — it is sometimes asked about externally but is not taught in this lecture (see the supplementary section below).

⚠️ Fuzzy vs. Probability — the big distinction

This is the #1 exam trap on Vagueness.

	Fuzzy Logic	Probability
What it captures	Vagueness — predicate has no sharp boundary	Uncertainty — fact is crisp but we don’t know
Statement	”The ball is reddish” (μ_red(ball) = 0.7)	“The ball is red, with probability 0.7”
Operations	`min` / `max` for ∧ / ∨	`·` / `+` for independent events
Complement	μ_A^c(x) = 1 − μ_A(x)	P(¬A) = 1 − P(A)
Excluded middle	A ⊓ A^c ≠ ∅ (can have nonzero μ)	P(A ∩ ¬A) = 0 always
Universe	A ⊔ A^c ≠ U (in general)	P(A ∪ ¬A) = 1 always

Concrete example: “Bob is tall.”

Fuzzy: Bob is 180 cm; he’s tall to degree 0.7. There is no fact of the matter about whether he’s “really” tall.
Probabilistic: I don’t know Bob’s height; I estimate a 70% chance he’s tall (where “tall” is a crisp threshold).

If you ever confuse these on the exam, you lose the entire question. Memorize the example.

Worked examples

Heap paradox (motivates fuzzy logic)

Premise 1: “A pile of 1 000 000 grains of sand is a heap.”
Premise 2: “If n grains form a heap, then n − 1 grains also form a heap.”
By induction → “0 grains form a heap.” Contradiction.

Fuzzy fix. “Is a heap” is a fuzzy predicate with μ_heap(n) decreasing smoothly from 1 (large n) to 0 (small n). The induction step no longer preserves full truth: μ_heap(n − 1) ≈ μ_heap(n), but the tiny loss compounds — and that is fine in fuzzy logic.

Conditional KB satisfaction (slide 76)

KB: Bird : 0.8, (Penguin | Bird)[0.2], (Flies | Bird)[0.7], (Bird | Penguin)[1], (Flies | Penguin)[0].

Derive P(Penguin) ≥ 0.16:

P(Penguin) = P((Bird ∨ ¬Bird) ∧ Penguin)                (Tautology)
           = P((Bird ∧ Penguin) ∨ (¬Bird ∧ Penguin))   (Distributivity)
           = P(Bird ∧ Penguin) + P(¬Bird ∧ Penguin)     (Additivity)
           = P(Penguin | Bird) · P(Bird)                (Conditional)
             + P(Penguin | ¬Bird) · P(¬Bird)
           ≥ P(Penguin | Bird) · P(Bird)                (Non-negativity)
           = 0.2 · 0.8 = 0.16

This is the canonical pattern: split with a tautology A ∨ ¬A, apply distributivity + additivity, then bound with the conditional formula.

Quick Comparison Table

	Classical Logic	Fuzzy Logic	Probabilistic Logic
Truth values	{0, 1}	[0, 1]	[0, 1]
Propositions	True or false	True to a degree	True or false
Truth values represent	True or False	Degree of membership	Probability of truth
A ∨ ¬A	Always 1	Can be < 1	Always 1
A ∧ ¬A	Always 0	Can be > 0	Always 0
Addresses	—	Vagueness	Uncertainty

(“Truth values represent” row matches Session 08 slide 75 “Summary” verbatim.)

ALGORITHMS / TECHNIQUES (full reference) ⭐

Procedural techniques. Items 1–3 follow directly from the Session 08 slides; item 4 (Dempster-Shafer) is not in Session 08 and is included here as supplementary external material only — it is not exam-relevant for the lecture content.

1. Fuzzy inference (rule evaluation)

Setting. A fuzzy rule IF X is A AND Y is B THEN Z is C plus crisp inputs x₀, y₀.

Step 1 — Fuzzification.
    α_A = μ_A(x₀)         # input matches premise A to degree α_A
    α_B = μ_B(y₀)
 
Step 2 — Aggregate premises with a t-norm.
    α = t(α_A, α_B)       # Zadeh: α = min(α_A, α_B)
                          # Algebraic: α = α_A · α_B
 
Step 3 — Apply to consequent (implication / clipping).
    μ_C'(z) = min(α, μ_C(z))      # Mamdani-style: clip C at height α
 
Step 4 — Combine multiple rules with an s-norm.
    μ_aggregate(z) = max_r μ_C_r'(z)     # union of clipped consequents
                                         # (Zadeh) or use a different s-norm
 
Step 5 — Defuzzify (see #2) to obtain a crisp output z*.

Choice of t-norm / s-norm decides the inference style: Zadeh min/max is the textbook default; algebraic product / probabilistic sum is common in control applications.

2. Defuzzification

Turning the aggregated output fuzzy set μ_agg : Z → [0,1] back into a single crisp value z*.

2a. Centroid (centre-of-gravity)

z* = ( ∫ z · μ_agg(z) dz ) / ( ∫ μ_agg(z) dz )

Discrete form: z* = Σ_i z_i · μ_agg(z_i) / Σ_i μ_agg(z_i).

Pros. Smooth, takes the whole shape into account.
Cons. Expensive; can yield a value with low membership if μ_agg is bimodal.

2b. Max-membership (mean of maxima)

M = { z | μ_agg(z) = max_z' μ_agg(z') }
z* = mean(M)                # or smallest / largest element

Pros. Cheap, intuitive.
Cons. Ignores the shape — different aggregate sets can give the same answer.

2c. Other defuzzification methods

Method	How it works	When it matters
Centroid (CoG)	Center of gravity of the output membership function	Default in most control systems
Mean of Maxima (MoM)	Average of the x-values where μ is maximal	Faster, less smooth
First / Last Maximum	First (or last) x where μ reaches max	Used when ties matter
Smallest of Maximum	Smallest x at the max value	Conservative controllers

⚠️ Centroid and max-membership can disagree on the same aggregated set — common exam question. Centroid is pulled by mass; MoM sees only the peak (see Q123–125 in the questions hub).

3. Bayesian update via conditional probability

Given:  prior P(H), likelihood P(E | H), P(E | ¬H)
Goal:   posterior P(H | E) after observing E
 
P(E)     = P(E | H) · P(H) + P(E | ¬H) · P(¬H)        # total probability
P(H | E) = P(E | H) · P(H) / P(E)                     # Bayes' rule

Used in probabilistic logic to update belief in hypothesis H given evidence E. Special case of the conditional-probability definition P(B|A) = P(B ∩ A) / P(A).

4. Dempster-Shafer combination rule (supplementary) ⚠️

⚠️ Not in Session 08 slides — included only for completeness because external questions (e.g. Q56–63) reference it. Do not assume this material is exam-relevant for Methods of AI unless the lecturer explicitly added it.

Setting. Frame of discernment Θ. Two mass functions m₁, m₂ : 2^Θ → [0,1] with Σ_A m_i(A) = 1 and m_i(∅) = 0. Each represents one independent source of evidence.

Combined mass via Dempster’s rule:

K = Σ_{B ∩ C = ∅}  m₁(B) · m₂(C)              # conflict mass
 
m₁₂(A) = ( 1 / (1 − K) ) · Σ_{B ∩ C = A}  m₁(B) · m₂(C)   for A ≠ ∅
m₁₂(∅) = 0

Belief & Plausibility:

Bel(A) = Σ_{B ⊆ A} m(B)
Pl(A)  = Σ_{B ∩ A ≠ ∅} m(B)
        = 1 − Bel(¬A)

Use. Combining evidence from independent sources when you cannot or do not want to commit to a single probability distribution. Reduces to Bayesian update when all masses are on singletons.

Where Fuzzy Logic is used today

Industrial control systems — Sendai subway (Japan, 1987) was the first famous deployment; smooth braking/acceleration via fuzzy controller. Still in production.
Consumer electronics — washing machines (load + dirt sensing), rice cookers, camera autofocus, air conditioners. Hugely popular in Japan.
Anti-lock braking (ABS) in cars — Bosch and others use fuzzy-logic-style controllers for traction control.
Camera image processing — exposure, white balance — fuzzy classifiers for scene type.
Medical decision support — diagnosis systems using fuzzy rules for symptoms (e.g. “high fever AND moderate pain”).
Stock trading systems — fuzzy rules for technical analysis indicators.

Where Fuzzy Logic was replaced — and by what

Domain	Was fuzzy, now …	Why
Pattern recognition / classification	Neural networks	NNs learn membership functions from data instead of needing them hand-designed
Control of complex nonlinear systems	Reinforcement Learning + neural controllers	Model-free RL can learn controllers without needing to specify rules
Natural language understanding	Transformer LLMs	LLMs implicitly handle vagueness without explicit fuzzy formalism
Decision support with uncertainty	Bayesian networks, probabilistic graphical models	When uncertainty (not vagueness) dominates, probability is the right tool

Where fuzzy logic still stands: when you need an interpretable controller (rules can be read by humans), when training data is scarce, and when the system has to handle genuine vagueness rather than uncertainty.

Brain Online

Explorer

Fuzzy Logic

Vagueness & Uncertainty — Fuzzy + Probabilistic Logic

Table of contents

Core Ideas

Glossary — important vocabulary ⭐

Fuzzy Set Theory

Standard fuzzy operations (Zadeh) ⭐

t-norms and s-norms (generalising AND/OR)

Concrete t-norm / s-norm families

Visual — the four membership function shapes

See the algebra concretely

Visualisations (Python)

Probabilistic Logic

Kolmogorov’s axioms

Derived properties

Conditional probability

Probabilistic knowledge bases

Formulas & Notation

Common Exam Traps ⚠️

⚠️ Fuzzy vs. Probability — the big distinction

Worked examples

Heap paradox (motivates fuzzy logic)

Conditional KB satisfaction (slide 76)

Quick Comparison Table

ALGORITHMS / TECHNIQUES (full reference) ⭐

1. Fuzzy inference (rule evaluation)

2. Defuzzification

2a. Centroid (centre-of-gravity)

2b. Max-membership (mean of maxima)

2c. Other defuzzification methods

3. Bayesian update via conditional probability

4. Dempster-Shafer combination rule (supplementary) ⚠️

Where Fuzzy Logic is used today

Where Fuzzy Logic was replaced — and by what

See also

Backlinks

Mika

✨ Features

⚙️ Einstellungen

📚 Chat-Verlauf

📖 Citation Manager

✍️ Writing Assistant

Inhaltsverzeichnis