API Reference ¶

Examples

>>> ks = space_from_prerequisites(
...     ["add", "sub", "mul"],
...     [("add", "sub"), ("sub", "mul")],
... )
>>> ks.n_states
4

knowledgespaces.api.structure_from_skill_map(skill_map, skill_prerequisites=None)[source]¶

Derive a knowledge structure from a skill map (CbKST).

Note: with conjunctive skill maps (items requiring multiple skills) the result is not necessarily union-closed and therefore may not be a knowledge space. Use is_knowledge_space to check.

Parameters:

skill_mapdict[str, list[str]]: For each item, the list of skills it requires. Example: {“q1”: [“s_add”], “q2”: [“s_add”, “s_carry”]}
skill_prerequisiteslist[tuple[str, str]] or None: Pairs (a, b) meaning ‘skill a is a prerequisite of skill b’. If None, skills are treated as independent.

Returns:

KnowledgeStructure: The derived knowledge structure (may or may not be a space).

Parameters:

skill_map (dict[str, list[str]])
skill_prerequisites (list[tuple[str, str]] | None)

Return type:

Examples

>>> ks = structure_from_skill_map(
...     {"q1": ["s1"], "q2": ["s1", "s2"]},
...     [("s1", "s2")],
... )

knowledgespaces.api.space_from_surmise_function(clauses)[source]¶

Build a knowledge space from a surmise function (multiple clauses).

Each item maps to one or more clauses — alternative sets of prerequisites. This generalises space_from_prerequisites, which only allows one prerequisite set per item (ordinal case).

Parameters:

clausesdict[str, list[list[str]]]

For each item, a list of clauses. Each clause is a list of items (must include the item itself).

Example:

{
    "a": [["a"]],
    "b": [["b", "d"], ["a", "b", "c"]],
    "d": [["b", "d"]],
    ...
}

Returns:

KnowledgeStructure: The derived knowledge space (union-closed).

Parameters:

clauses (dict[str, list[list[str]]])

Return type:

Examples

>>> ks = space_from_surmise_function({
...     "a": [["a"]],
...     "b": [["a", "b"], ["b", "c"]],
...     "c": [["c"]],
... })

knowledgespaces.api.space_from_skill_map(skill_map, skill_prerequisites=None)[source]¶

Deprecated: use structure_from_skill_map() instead.

This function was renamed because with conjunctive skill maps the result is not necessarily a knowledge space.

Parameters:

skill_map (dict[str, list[str]])
skill_prerequisites (list[tuple[str, str]] | None)

Return type:

knowledgespaces.api.assess(structure, responses, beta=0.1, eta=0.2, prior=None)[source]¶

Assess a student’s knowledge state from their responses.

Parameters:

structureKnowledgeStructure

The knowledge structure.

responsesdict[str, bool] or list[tuple[str, bool]]

Observed responses. Two formats accepted:

dict: one observation per item, e.g. {"add": True, "sub": False}
list of tuples: multiple observations allowed per item (from different instances), e.g. [("add", True), ("add", True), ("sub", False)]

In the list format, the same item can appear multiple times. Each observation updates the posterior independently (local independence assumption).

betafloat or dict[str, float]

Slip parameter (scalar or per-item).

etafloat or dict[str, float]

Guess parameter (scalar or per-item).

priordict[frozenset[str], float] or None

Optional prior over states (e.g. from a previous EM fit). If None, a uniform prior is used.

Returns:

dict: Keys: ‘state’ (most likely state as a set), ‘probability’, ‘mastery’ (per-item mastery probabilities), ‘inner_fringe’, ‘outer_fringe’.

Parameters:

structure (KnowledgeStructure)
responses (dict[str, bool] | list[tuple[str, bool]])
beta (float | dict[str, float])
eta (float | dict[str, float])
prior (dict[frozenset[str], float] | None)

Return type:

Examples

One observation per item:

result = assess(structure, {"add": True, "sub": True, "mul": False})

Multiple instances of the same item:

result = assess(structure, [("add", True), ("add", True), ("sub", False)])

knowledgespaces.api.assess_from_fit(structure, fit, responses)[source]¶

Assess using estimated parameters from fit_blim().

Equivalent to assess(structure, responses, beta=..., eta=..., prior=...) with values taken from the fit result.

Parameters:

structureKnowledgeStructure: The knowledge structure (same used for fitting).
fitdict: Output of fit_blim().
responsesdict or list: Observed responses (same format as assess()).

Parameters:

structure (KnowledgeStructure)
fit (dict)
responses (dict[str, bool] | list[tuple[str, bool]])

Return type:

knowledgespaces.api.adaptive_assess(structure, ask_fn, *, instances=None, beta=0.1, eta=0.2, prior=None, threshold=0.85, max_questions=25)[source]¶

Run a complete adaptive assessment.

Parameters:

structureKnowledgeStructure: The knowledge structure.
ask_fncallable: If instances is None: takes an item name (str) and returns bool. If instances is provided: takes an instance ID (str) and returns bool.
instancesdict[str, list[str]] or None: Optional mapping {item: [instance_id, …]} for multi-instance assessment. When provided, the engine selects the best un-asked instance (not item) and passes its ID to ask_fn. Different instances of the same item are treated as equivalent by the BLIM.
beta, etafloat or dict[str, float]: BLIM parameters (scalar or per-item).
priordict[frozenset[str], float] or None: Optional prior over states. If None, uniform.
thresholdfloat: Stop when most likely state reaches this probability.
max_questionsint: Maximum number of questions to ask.

Returns:

dict: Keys: ‘state’, ‘probability’, ‘mastery’, ‘inner_fringe’, ‘outer_fringe’, ‘questions_asked’ (int), ‘history’ (list of (instance_or_item, item, response) tuples).

Parameters:

structure (KnowledgeStructure)
ask_fn (Callable[[str], bool])
instances (dict[str, list[str]] | None)
beta (float | dict[str, float])
eta (float | dict[str, float])
prior (dict[frozenset[str], float] | None)
threshold (float)
max_questions (int)

Return type:

Examples

Simple (one instance per item):

result = adaptive_assess(structure, lambda item: item in {"add", "sub"})

With multiple instances:

result = adaptive_assess(
    structure,
    lambda inst_id: ask_student(inst_id),
    instances={
        "addition": ["3+2", "7+5", "12+9"],
        "subtraction": ["8-3", "15-7"],
    },
)

knowledgespaces.api.adaptive_assess_from_fit(structure, fit, ask_fn, *, instances=None, threshold=0.85, max_questions=25)[source]¶

Run adaptive assessment using parameters from fit_blim().

Equivalent to adaptive_assess(structure, ask_fn, beta=..., eta=..., prior=...) with values taken from the fit result.

Parameters:

structureKnowledgeStructure: The knowledge structure (same used for fitting).
fitdict: Output of fit_blim().
ask_fncallable: Question function (same as adaptive_assess()).
instances, threshold, max_questions: Passed through to adaptive_assess().

Parameters:

structure (KnowledgeStructure)
fit (dict)
ask_fn (Callable[[str], bool])
instances (dict[str, list[str]] | None)
threshold (float)
max_questions (int)

Return type:

knowledgespaces.api.fit_blim(structure, items, responses, counts=None)[source]¶

Estimate BLIM parameters from response data via EM.

Parameters:

structureKnowledgeStructure: The knowledge structure.
itemslist[str]: Item names (column labels for the response matrix).
responsesarray-like: Binary response matrix, shape (n_patterns, n_items).
countsarray-like or None: Optional frequency of each pattern.

Returns:

dict: Keys: ‘beta’ (dict item→float), ‘eta’ (dict item→float), ‘pi’ (dict frozenset→float, state prior probabilities), ‘states’ (list of frozensets, ordered as in pi), ‘log_likelihood’ (float), ‘converged’ (bool), ‘n_iterations’ (int), ‘gof’ (dict with G2, df, p_value, npar, AIC, BIC).

Parameters:

structure (KnowledgeStructure)
items (list[str])
responses (ndarray | list[list[int]])
counts (ndarray | list[int] | None)

Return type:

Warning

Emits knowledgespaces.estimation.ConvergenceWarning if EM exhausts max_iter without meeting the convergence tolerance. The returned estimate is still usable but may be a local optimum or require more iterations / multiple restarts.

Examples

>>> result = fit_blim(structure, ["a","b","c"], [[1,1,1],[1,1,0],[1,0,0],[0,0,0]])
>>> result["converged"]
True

knowledgespaces.structures¶

Surmise relations on item domains.

A surmise relation is a quasi-order (reflexive and transitive) on a set of items that encodes prerequisite dependencies: if (a, b) is in the relation, then mastering item a is a prerequisite for mastering item b. On a discriminative item domain — the usual case when QUERY is run on items (rather than instances or skills) — no two distinct items are equivalent, so the relation is additionally antisymmetric, i.e. a partial order. Mutually prerequisite (equivalent) items arise with skills/competencies and are handled by competence-based KST (see knowledgespaces.derivation), not by item-level QUERY.

Storage and views:: The class stores only the strict cover (pairs with a != b); reflexivity is implicit. Membership (in) and to_matrix() expose the reflexive relation — (x, x) is always a member and the matrix diagonal is always 1. The prerequisite/successor accessors (prerequisites_of(), successors_of(), to_adjacency_dict()) return the direct, strict relation (no self, no transitive closure); call transitive_closure() for all transitive prerequisites. Antisymmetry is not enforced at construction — verify with is_antisymmetric().
References:: Doignon, J.-P., & Falmagne, J.-C. (1999). Knowledge Spaces. Springer. Falmagne, J.-C., & Doignon, J.-P. (2011). Learning Spaces. Springer.

class knowledgespaces.structures.relations.SurmiseRelation(items, relations)[source]¶

A surmise relation (quasi-order) representing prerequisite dependencies.

The relation is stored as a set of directed pairs (a, b) meaning ‘a is a prerequisite of b’. Self-loops are excluded from storage (reflexivity is implicit, so membership and to_matrix() are reflexive). On a discriminative item domain the relation is a partial order (antisymmetric); antisymmetry is not enforced at construction time — use is_antisymmetric() to verify.

Parameters:

itemsCollection[str]: The domain of items.
relationsCollection[tuple[str, str]]: Pairs (a, b) where a is a prerequisite of b. Self-loops (a, a) are silently ignored.

Parameters:

items (Collection[str])
relations (Collection[tuple[str, str]])

property items: frozenset[str]¶

property relations: frozenset[tuple[str, str]]¶

property size: int¶: Number of items in the domain.

transitive_closure()[source]¶

Compute the transitive closure.

Returns a new SurmiseRelation containing all transitively implied pairs. If (a, b) and (b, c) are in the relation, (a, c) is added.

Uses an adjacency-set approach: for each item, the set of successors is iteratively expanded until a fixed point is reached.

Return type:: SurmiseRelation

transitive_reduction()[source]¶

Compute the transitive reduction (Hasse diagram).

Returns a new SurmiseRelation containing only the direct prerequisite edges — removes any edge (a, c) when there exists an intermediate item b such that (a, b) and (b, c) are both in the transitive closure.

Raises:

ValueError: If the relation is not antisymmetric (its closure contains a cycle of mutually-prerequisite items). The transitive reduction is only well defined for partial orders; on a cyclic relation it would silently delete real edges. Merge equivalent items first, or check with is_antisymmetric().

Return type:

prerequisites_of(item)[source]¶

Return the prerequisites of an item in the stored relation.

Returns only items directly related in this instance’s edges. To get all transitive prerequisites, call transitive_closure() first, then query the result.

Parameters:: item (str)
Return type:: frozenset[str]

successors_of(item)[source]¶

Return the successors of an item in the stored relation.

Returns only items directly related in this instance’s edges. To get all transitive successors, call transitive_closure() first, then query the result.

Parameters:: item (str)
Return type:: frozenset[str]

minimal_items()[source]¶

Items with no prerequisites (bottom of the order).

Return type:: frozenset[str]

maximal_items()[source]¶

Items that are not prerequisite of anything (top of the order).

Return type:: frozenset[str]

levels()[source]¶

Compute topological levels.

Level 0 items have no prerequisites. Level n items have max predecessor level n-1.

Returns:

dict[str, int]: Mapping from item to its level. Items in cycles are omitted.

Return type:

dict[str, int]

is_antisymmetric()[source]¶

Check if the relation is antisymmetric (i.e. encodes a partial order).

Antisymmetry is evaluated on the transitive closure: the relation is antisymmetric iff its closure contains no symmetric pair (b, a) for a closure pair (a, b) with a != b. This correctly detects cycles of any length (e.g. a -> b -> c -> a), not only directly-stored 2-cycles.

Return type:: bool

to_adjacency_dict()[source]¶

Return {item: set of its direct prerequisites}.

This is the strict (irreflexive) cover: an item is never listed as its own prerequisite, and transitive prerequisites are excluded. Call transitive_closure() first for all prerequisites. For the reflexive boolean form of the relation, use to_matrix().

Return type:: dict[str, set[str]]

to_matrix()[source]¶

Return (sorted items, reflexive adjacency matrix).

matrix[i][j] == 1 iff items[i] is a prerequisite of items[j] or i == j. The surmise relation is reflexive, so the diagonal is always 1 — consistent with membership testing ((x, x) in relation is always True). Off-diagonal entries reflect the stored cover only; call transitive_closure() first for the full transitive matrix.

Return type:: tuple[list[str], list[list[int]]]

classmethod from_adjacency_matrix(items, matrix)[source]¶

Create from an adjacency matrix.

matrix[i][j] = 1 means items[i] is a prerequisite of items[j].

Parameters:

items (list[str])
matrix (list[list[int]])

Return type:

classmethod from_prerequisites_dict(prereqs)[source]¶

Create from {item: [its prerequisites]}.

Parameters:

prereqsMapping[str, Collection[str]]: For each item, the collection of its prerequisites.

Parameters:

prereqs (Mapping[str, Collection[str]])

Return type:

Knowledge structures, spaces, and learning spaces.

A knowledge structure on a domain Q is a family K of subsets of Q (called knowledge states) that contains at least the empty set and Q itself.

Special cases: - Knowledge space: closed under set union. - Closure space: closed under set intersection. - Learning space: a well-graded knowledge space (equivalently, an antimatroid).

References:

Doignon, J.-P., & Falmagne, J.-C. (1999). Knowledge Spaces. Springer-Verlag.

Falmagne, J.-C., & Doignon, J.-P. (2011). Learning Spaces. Springer-Verlag.

class knowledgespaces.structures.knowledge_structure.KnowledgeStructure(domain, states)[source]¶

A family of knowledge states over a domain of items.

Parameters:

domainCollection[str]: The set of all items. Must be non-empty: the knowledge-structure axioms (Falmagne & Doignon 2011, Def. 1.1.1) require a nonempty domain Q, and every downstream algorithm in this package (BLIM EM, QUERY, assessment) is undefined for |Q| = 0. An empty domain raises ValueError.
statesCollection[Collection[str]]: The knowledge states (subsets of domain). The empty set and the full domain are added automatically if missing.

Parameters:

domain (Collection[str])
states (Collection[Collection[str]])

property domain: frozenset[str]¶

property states: frozenset[frozenset[str]]¶

property n_items: int¶

property n_states: int¶

property is_knowledge_space: bool¶: True if closed under union.

property is_closure_space: bool¶: True if closed under intersection.

property is_well_graded: bool¶

True if every non-empty state has an immediate predecessor.

A state K has an immediate predecessor if there exists q in K such that K {q} is also a state.

Warning

This check verifies only the local well-gradedness condition (every state has at least one single-item predecessor). It is equivalent to the full definition (Falmagne & Doignon, 2011, Def. 1.44 / Thm 1.49) only if is_knowledge_space is True. For bare KnowledgeStructure instances that are not union-closed, the result may be a false positive with respect to the textbook definition, which additionally requires a tight path between every pair of comparable states. Prefer is_learning_space, which enforces both conditions, whenever well-gradedness is needed as a standalone guarantee.

property is_accessible: bool¶

True if every state is reachable from the empty set.

Reachability means there exists a chain from ∅ to the state where each step adds exactly one item.

property is_learning_space: bool¶

True if this is a well-graded knowledge space.

Equivalent to: knowledge space + well-graded. Equivalent to: no hanging states + ∅ and Q present.

inner_fringe(state)[source]¶

Items whose removal yields another valid state.

These represent the ‘most recently consolidated’ items in the state.

Parameters:: state (frozenset[str])
Return type:: frozenset[str]

outer_fringe(state)[source]¶

Items whose addition yields another valid state.

These represent the ‘next learnable’ items from this state.

Parameters:: state (frozenset[str])
Return type:: frozenset[str]

hanging_states()[source]¶

Non-empty states with empty inner fringe.

A hanging state cannot be reached incrementally — it indicates a structural problem in the learning space.

Return type:: list[frozenset[str]]

atoms()[source]¶

Atoms of the knowledge structure.

An atom at item q is a minimal state containing q — i.e. a state K such that q ∈ K and no proper subset of K that is also a state contains q. An item can have more than one atom.

In a closure space (closed under intersection) each item has exactly one atom (the intersection of all states containing it). In a knowledge space (closed under union) the collection of all atoms equals the base.

Returns:

dict[str, list[frozenset[str]]]: Mapping from each item to the list of its atoms (minimal states containing that item), sorted by size then content.

Return type:

dict[str, list[frozenset[str]]]

References

Doignon & Falmagne (1999), Knowledge Spaces, Def. 1.23. Falmagne & Doignon (2011), Learning Spaces, Chapter 1.

base()[source]¶

Minimal generating family under union.

The base B of a knowledge space K is the smallest family such that K equals the closure of B under union (plus ∅). A state K is in the base iff it cannot be expressed as the union of states in K that are strictly contained in K.

Only meaningful for knowledge spaces (closed under union).

Return type:: list[frozenset[str]]

states_by_size()[source]¶

Distribution of states by cardinality.

Return type:: dict[int, int]

learning_paths(target=None, max_paths=100)[source]¶

Find learning paths from ∅ to target.

A learning path is a maximal chain where each step adds one item.

Parameters:

targetfrozenset[str] or None: Target state. Defaults to the full domain.
max_pathsint: Maximum number of paths to return (to avoid combinatorial explosion).

Parameters:

target (frozenset[str] | None)
max_paths (int)

Return type:

list[list[frozenset[str]]]

classmethod from_surmise_relation(relation, *, max_items=None)[source]¶

Build the ordinal knowledge structure from a surmise relation.

A state is valid iff for every item in the state, all its prerequisites are also in the state (downward closure).

The resulting structure is always closed under both union and intersection (it is a distributive lattice).

This enumerates 2^|Q| subsets of the domain; max_items overrides the default hard limit for advanced callers.

Parameters:

relation (SurmiseRelation)
max_items (int | None)

Return type:

classmethod from_states(states)[source]¶

Build from an explicit collection of states.

The domain is inferred as the union of all states.

Parameters:: states (Collection[Collection[str]])
Return type:: KnowledgeStructure

intersection_with(other)[source]¶

Intersection of two structures: states present in both.

Parameters:: other (KnowledgeStructure)
Return type:: KnowledgeStructure

projection(sub_domain)[source]¶

Project (restrict) the structure to a subset of items.

For each state K, the projected state is K ∩ sub_domain. Only items present in the original domain are retained.

Raises:

ValueError: If sub_domain contains items not in the original domain.

Parameters:

sub_domain (Collection[str])

Return type:

surmise_function()[source]¶

Derive the surmise function from this knowledge space.

For each item q, σ(q) is the set of atoms at q (minimal states containing q). Requires a knowledge space (union-closed) where every item has at least one atom (granularity).

Returns:

SurmiseFunction

Raises:

ValueError: If the structure is not a granular knowledge space.

Return type:

SurmiseFunction

References

Falmagne & Doignon (2011), Definition 5.2.1, Theorem 5.2.5.

surmise_relation()[source]¶

Extract the surmise relation implied by this structure.

Item a is a prerequisite of b iff every state containing b also contains a.

Return type:: SurmiseRelation

Surmise functions (generalized surmise relations with multiple clauses).

A surmise function σ on a domain Q maps each item q to a family of subsets of Q called clauses. Each clause represents a possible minimal foundation for the mastery of q. When every item has exactly one clause, the surmise function reduces to a surmise relation (partial order).

The four axioms of a surmise function (Definition 5.1.2):

σ(q) ≠ ∅ for all q — at least one clause per item
q ∈ C for all C ∈ σ(q) — each clause contains its item
if q’ ∈ C ∈ σ(q), then ∃ C’ ∈ σ(q’) with C’ ⊆ C — refinement (transitivity analogue)
clauses for q are incomparable — no clause is a subset of another

References:: Falmagne, J.-C., & Doignon, J.-P. (2011). Learning Spaces, Chapter 5 — Surmise Systems. Springer-Verlag.

class knowledgespaces.structures.surmise_function.SurmiseFunction(domain, clauses)[source]¶

A surmise function on a domain of items.

Maps each item q to a family of clauses σ(q), where each clause is a frozenset of items representing an alternative foundation for mastering q.

Parameters:

domainCollection[str]: The set of all items.
clausesMapping[str, Collection[Collection[str]]]: For each item, the collection of its clauses. Each clause is a collection of item names.

Raises:

ValueError: If any of the four surmise function axioms is violated, or if clauses reference items outside the domain.

Parameters:

domain (Collection[str])
clauses (Mapping[str, Collection[Collection[str]]])

Examples

>>> sf = SurmiseFunction(
...     {"a", "b", "c", "d", "e"},
...     {
...         "a": [{"a"}],
...         "b": [{"b", "d"}, {"a", "b", "c"}, {"b", "c", "e"}],
...         "c": [{"a", "b", "c"}, {"b", "c", "e"}],
...         "d": [{"b", "d"}],
...         "e": [{"b", "c", "e"}],
...     },
... )

property domain: frozenset[str]¶: The item domain Q.

property n_items: int¶: Number of items in the domain.

property is_ordinal: bool¶

True if every item has exactly one clause.

When ordinal, the surmise function is equivalent to a surmise relation (partial order).

property is_discriminative: bool¶

True if distinct items have distinct clause families.

σ(q) = σ(q’) implies q = q’ (Definition 5.1.2).

property is_acyclic: bool¶

True if the relation Rσ is acyclic (Definition 5.6.12).

Rσ is defined by: q Rσ q’ ⟺ ∃ C ∈ σ(q’) : q ∈ C. Acyclicity means no cycle q1 Rσ q2 Rσ … Rσ q1 with q1 ≠ qk.

clauses_for(item)[source]¶

Return the clause family σ(item).

Parameters:

itemstr: An item in the domain.

Returns:

frozenset[frozenset[str]]: The set of clauses for item.

Parameters:

item (str)

Return type:

frozenset[frozenset[str]]

precedence_relation()[source]¶

The precedence relation: r ≺ q ⟺ r ∈ ∩σ(q).

An item r precedes q iff r belongs to every clause for q. This is always a quasi order and generalizes the surmise relation (Definition 3.7.1 / §5.6.9).

Return type:: set[tuple[str, str]]

to_knowledge_space(*, max_items=None)[source]¶

Derive the knowledge space from this surmise function.

A set K ⊆ Q is a state iff for every q ∈ K, there exists a clause C ∈ σ(q) with C ⊆ K (Eq. 5.2, Definition 5.2.3).

The result is always a knowledge space (closed under union). When the surmise function satisfies all four axioms, each clause is an atom and the space is granular.

Parameters:

max_itemsint or None: Hard limit override for the domain-size preflight check. This operation enumerates up to 2^|Q| candidate states, so by default it raises DomainTooLargeError for |Q| >= 25. Pass a larger value to opt out (at your own risk).

Returns:

KnowledgeStructure: The derived knowledge space.

Parameters:

max_items (int | None)

Return type:

KnowledgeStructure

References

Falmagne & Doignon (2011), Theorem 5.2.5.

to_surmise_relation()[source]¶

Convert to a SurmiseRelation (only valid for ordinal case).

When each item has exactly one clause, the surmise function is equivalent to a quasi order (Definition 5.1.4).

Returns:

SurmiseRelation: The equivalent surmise relation.

Raises:

ValueError: If the surmise function is not ordinal.

Return type:

SurmiseRelation

classmethod from_knowledge_space(ks)[source]¶

Derive the surmise function from a granular knowledge space.

For each item q, σ(q) = {atoms at q in K} (Definition 5.2.1).

Parameters:

ksKnowledgeStructure: Must be a knowledge space (union-closed). The space must be granular (every item has at least one atom).

Returns:

SurmiseFunction

Raises:

ValueError: If the structure is not a knowledge space, or if some item has no atom (non-granular).

Parameters:

ks (KnowledgeStructure)

Return type:

SurmiseFunction

References

Falmagne & Doignon (2011), Definition 5.2.1, Theorem 5.2.5.

classmethod from_surmise_relation(relation)[source]¶

Cast a surmise relation as a surmise function.

Each item q gets a single clause: {q} ∪ prerequisites(q) in the transitive closure (Definition 5.1.4).

Parameters:

relationSurmiseRelation: A surmise relation (partial order on items).

Returns:

SurmiseFunction: An ordinal surmise function (one clause per item).

Parameters:

relation (SurmiseRelation)

Return type:

SurmiseFunction

knowledgespaces.query¶

Core types for the QUERY algorithm.

Defines the Query object and the result containers used across all blocks of the algorithm.

class knowledgespaces.query.types.InferenceSource(*values)[source]¶: How a query answer was determined.

class knowledgespaces.query.types.Query(antecedent, consequent)[source]¶

A single query: ‘Does failing all items in A imply failing q?’

For pair queries (Block 1): antecedent has 1 item. For group queries (Block 2+): antecedent has 2+ items. For Qmax minimality test: antecedent = Q without {q}.

Parameters:

antecedentfrozenset[str]: The set A of items.
consequentstr: The target item q.

Parameters:

antecedent (frozenset[str])
consequent (str)

classmethod pair(a, q)[source]¶

Create a pair query: a -> q.

Parameters:

a (str)
q (str)

Return type:

Query

classmethod group(A, q)[source]¶

Create a group query: A -> q.

Parameters:

A (frozenset[str])
q (str)

Return type:

Query

class knowledgespaces.query.types.QueryAnswer(query, answer, source)[source]¶

A resolved query with its answer and source.

Parameters:

query (Query)
answer (bool)
source (InferenceSource)

property was_asked: bool¶: True if this answer came from an expert, not inference.

Expert protocols for the QUERY algorithm.

An expert is any callable that answers prerequisite queries. This module defines the protocol and provides built-in implementations for testing and interactive use.

References:: Koppen, M., & Doignon, J.-P. (1990). How to build a knowledge space by querying an expert. Journal of Mathematical Psychology, 34, 311-331.

class knowledgespaces.query.expert.Expert(*args, **kwargs)[source]¶

Protocol for expert query functions.

An expert receives a Query and returns True (positive) or False (negative).

The semantic of a positive answer to Query(A, q) is: ‘If a student fails all items in A, they will also fail q.’ Equivalently: mastering q requires mastering at least one item in A.

class knowledgespaces.query.expert.PresetExpert(answers=None, default=False)[source]¶

Expert with predetermined answers, for testing and replay.

Parameters:

answersdict[tuple[frozenset[str], str], bool]: Mapping from (antecedent, consequent) to answer.
defaultbool: Answer for queries not in the mapping.

Parameters:

answers (dict[tuple[frozenset[str], str], bool] | None)
default (bool)

classmethod from_relation(relations)[source]¶

Create an expert that answers based on a known surmise relation.

A pair query (a -> q) is positive iff (a, q) is in the relation. A group query (A -> q) is positive iff some a in A has (a, q).

This simulates an expert who knows the true prerequisite structure. Suitable for testing the query algorithm against a known ground truth.

Parameters:: relations (Collection[tuple[str, str]])
Return type:: PresetExpert

class knowledgespaces.query.expert.CallbackExpert(fn)[source]¶

Expert that delegates to a user-supplied function.

Parameters:

fnCallable[[frozenset[str], str], bool]: A function that takes (antecedent_set, consequent) and returns bool.

Parameters:

fn (Callable[[frozenset[str], str], bool])

exception knowledgespaces.query.expert.QueryNeeded(query)[source]¶

Raised by ReplayExpert when no cached answer exists.

Attributes:

queryQuery: The unanswered query that needs to be posed to the expert.

Parameters:

query (Query)

Return type:

None

class knowledgespaces.query.expert.ReplayExpert(answers)[source]¶

Expert that replays cached answers, raises QueryNeeded on cache miss.

Useful for web applications and session resumption: replay all prior answers instantly, then pause at the next unanswered query.

Parameters:

answersdict[tuple[frozenset[str], str], bool]: Mapping from (antecedent, consequent) to answer.

Parameters:

answers (dict[tuple[frozenset[str], str], bool])

Block 1 of the QUERY algorithm: surmise relation discovery.

Extracts prerequisite relations between items by querying an expert with pair queries (a -> q) and optional Qmax minimality tests.

Implementation note — multi-phase optimization.

The standard QUERY Block 1 (Koppen & Doignon, 1990; Learning Spaces, Falmagne & Doignon, 2011, Ch. 15) iterates over all item pairs and uses transitivity + antisymmetry to prune redundant queries.

This implementation splits Block 1 into four phases as an optimization that maximises early inference opportunities:

Phase 0 (optional): Qmax test — group query Q{q} → q for each item, identifies globally minimal items that need no pair queries.
Phase 1: Forward pass — pair queries with transitivity/antisymmetry pruning, processing items in input order.
Phase 2: Bottom-up pass — re-explores from minimal items outward, exploiting newly discovered transitivity.
Phase 3: Final verification sweep — covers any pair not yet resolved by Phases 1–2, guaranteeing completeness.

The result is identical to the standard single-pass algorithm: the same surmise relation is discovered, but typically with fewer expert queries thanks to the ordering strategy. Completeness is guaranteed by Phase 3.

Antisymmetry assumption (item-level QUERY).

Block 1 assumes the target surmise relation is a partial order (antisymmetric): once a prerequisite (q, r) is established, the reverse (r, q) is inferred to be False without re-querying the expert. This is the standard, valid assumption when QUERY is run on a discriminative item domain, where no two distinct items are equivalent. The inference is fully traceable, not silent: each inferred pair is logged with InferenceSource.ANTISYMMETRY, counted in Block1Stats.skipped_by_antisymmetry, and listed by Block1Result.assumed_antisymmetric_pairs. Equivalent (mutually prerequisite) items belong to competence-based KST (skills), not item-level QUERY: if your domain may contain equivalent items, collapse them first or model skills via knowledgespaces.derivation.

References:

Koppen, M., & Doignon, J.-P. (1990). How to build a knowledge space by querying an expert. Journal of Mathematical Psychology, 34, 311-331.

Falmagne, J.-C., & Doignon, J.-P. (2011). Learning Spaces, Chapter 15. Springer-Verlag.

class knowledgespaces.query.block1.Block1Stats(pair_queries=0, group_queries=0, deduced_by_transitivity=0, skipped_by_antisymmetry=0)[source]¶

Statistics from Block 1 execution.

Parameters:

pair_queries (int)
group_queries (int)
deduced_by_transitivity (int)
skipped_by_antisymmetry (int)

class knowledgespaces.query.block1.Block1Result(relation, closure, minimal_global, minimal_local, stats, log=<factory>)[source]¶

Result of Block 1 query phase.

Parameters:

relation (SurmiseRelation)
closure (SurmiseRelation)
minimal_global (frozenset[str])
minimal_local (frozenset[str])
stats (Block1Stats)
log (list[QueryAnswer])

property assumed_antisymmetric_pairs: list[tuple[str, str]]¶

Reverse pairs (r, q) inferred False by the antisymmetry assumption.

These pairs were not put to the expert: each was inferred False because (q, r) is a prerequisite and the surmise relation is assumed antisymmetric (a partial order). Exposed for full transparency — see the module docstring’s “Antisymmetry assumption” note.

knowledgespaces.query.block1.run_block1(items, expert, *, use_qmax=True, max_items=None)[source]¶

Execute Block 1 of the QUERY algorithm.

Parameters:

itemslist[str]: The domain of items.
expertExpert: The expert to query.
use_qmaxbool: If True, run Phase 0 (Qmax minimality test). Default True.
max_itemsint | None: Hard limit override for the |Q| preflight check. Block 1 is O(|Q|^5) worst case; above ~25 items it is not practical.

Returns:

Block1Result: Discovered surmise relation, closure, minimal items, and stats.

Parameters:

items (list[str])
expert (Expert)
use_qmax (bool)
max_items (int | None)

Return type:

Block1Result

Block 2+ of the QUERY algorithm: learning space refinement.

Refines an ordinal space (from Block 1) into a learning space by querying group prerequisites. Block N tests antecedent sets of size N.

Block 2 handles antecedent size 2, Block 3 size 3, etc. The algorithm is identical at every level — only the antecedent cardinality changes.

Three inference mechanisms reduce expert queries:

Negative monotonicity: if q is globally minimal (Qmax), then (A,q)=NO.
Positive monotonicity: if any subset of A already implies q, then (A,q)=YES.
Structural test (Theorem 43): if answering YES would create hanging states, then the answer must be NO.

References:: Falmagne, J.-C., & Doignon, J.-P. (2011). Learning Spaces, Chapter 15. Springer-Verlag.

class knowledgespaces.query.block2.BlockNStats(expert_queries=0, inferred_negative_monotonicity=0, inferred_positive_monotonicity=0, inferred_structural=0)[source]¶

Statistics from a Block N execution.

Parameters:

expert_queries (int)
inferred_negative_monotonicity (int)
inferred_positive_monotonicity (int)
inferred_structural (int)

class knowledgespaces.query.block2.BlockNResult(structure, positive, negative, stats, log=<factory>)[source]¶

Result of a Block N query phase.

Parameters:

structure (KnowledgeStructure)
positive (set[tuple[frozenset[str], str]])
negative (set[tuple[frozenset[str], str]])
stats (BlockNStats)
log (list[QueryAnswer])

knowledgespaces.query.block2.run_block_n(items, structure, prior_positive, expert, *, antecedent_size=2, minimal_global=frozenset({}), max_items=None)[source]¶

Execute Block N of the QUERY algorithm.

Parameters:

itemslist[str]: The domain of items.
structureKnowledgeStructure: The current knowledge structure to refine.
prior_positiveset[tuple[frozenset[str], str]]: Positive relations from all previous blocks. Each entry is (antecedent_set, consequent). Single-item antecedents come from Block 1’s closure: {(frozenset({a}), b) for (a,b) in closure}.
expertExpert: The expert to query.
antecedent_sizeint: Size of antecedent sets to test. Default 2 (standard Block 2).
minimal_globalfrozenset[str]: Items certified globally minimal by Qmax in Block 1.
max_itemsint | None: Hard limit override for the |Q| preflight check. Block N scales as O(|Q| * C(|Q|-1, k)); values above ~25 are not practical.

Returns:

BlockNResult: Refined structure, discovered relations, and stats.

Parameters:

items (list[str])
structure (KnowledgeStructure)
prior_positive (set[tuple[frozenset[str], str]])
expert (Expert)
antecedent_size (int)
minimal_global (frozenset[str])
max_items (int | None)

Return type:

BlockNResult

Full QUERY pipeline: Block 1 → L1 → Block 2 → … → Block N.

Orchestrates the complete derivation of a learning space from expert queries.

class knowledgespaces.query.pipeline.QueryPipelineResult(structure, block1, block_n_results=<factory>)[source]¶

Result of the full QUERY pipeline.

Parameters:

structure (KnowledgeStructure)
block1 (Block1Result)
block_n_results (list[BlockNResult])

knowledgespaces.query.pipeline.run_query(items, expert, *, use_qmax=True, max_antecedent_size=2, max_items=None)[source]¶

Run the full QUERY algorithm.

Parameters:

itemslist[str]: The domain of items.
expertExpert: The expert to query.
use_qmaxbool: If True, use Qmax minimality test in Block 1.
max_antecedent_sizeint: Maximum antecedent size for group queries. Default 2 (Block 2 only). Set to 3 for Block 3, etc.
max_itemsint | None: Hard limit override for the |Q| preflight check. When None (default), DEFAULT_MAX_N_ITEMS is used. QUERY is exponential in |Q|; values above ~25 are not practical in a browser worker.

Returns:

QueryPipelineResult: The derived learning space with full traceability.

Raises:

ValueError: If items is empty. QUERY is defined only for a nonempty domain (Koppen 1993, §2; Falmagne & Doignon 2011, §15).

Parameters:

items (list[str])
expert (Expert)
use_qmax (bool)
max_antecedent_size (int)
max_items (int | None)

Return type:

QueryPipelineResult

knowledgespaces.derivation¶

Skill maps: the mapping from items to required skills.

A skill map μ assigns to each item q the set of skills μ(q) needed to solve it. The problem function p(C) = {q ∈ Q | μ(q) ⊆ C} maps a competence state to the set of solvable items.

References:: Doignon, J.-P., & Falmagne, J.-C. (1999). Knowledge Spaces, Chapter 5. Springer-Verlag.

class knowledgespaces.derivation.skill_map.SkillMap(items, skills, mapping)[source]¶

Mapping from items to the skills required to solve them.

Parameters:

itemsCollection[str]: The domain of items.
skillsCollection[str]: The set of all skills.
mappingMapping[str, Collection[str]]: For each item, the skills required to solve it: μ(q).

Raises:

ValueError: If an item references a skill not in the skills set, or if mapping keys don’t match items.

Parameters:

items (Collection[str])
skills (Collection[str])
mapping (Mapping[str, Collection[str]])

skills_for(item)[source]¶

Return μ(q): skills required by item q.

Parameters:: item (str)
Return type:: frozenset[str]

problem_function(competence)[source]¶

Compute p(C) = {q ∈ Q | μ(q) ⊆ C}.

An item is solvable iff ALL its required skills are present in the competence state.

Parameters:: competence (frozenset[str])
Return type:: frozenset[str]

to_matrix()[source]¶

Return (items, skills, binary matrix).

matrix[i][j] = 1 iff skill skills[j] is required by items[i].

Return type:: tuple[list[str], list[str], list[list[int]]]

classmethod from_matrix(items, skills, matrix)[source]¶

Create from a binary matrix.

matrix[i][j] = 1 means items[i] requires skills[j].

Raises:

ValueError: If matrix dimensions don’t match items/skills, or if values are not 0 or 1.

Parameters:

items (list[str])
skills (list[str])
matrix (list[list[int]])

Return type:

SkillMap

Competence-Based Knowledge Space Theory (CbKST) derivation.

Given a skill map μ (items → skills) and a surmise relation on skills, derives the knowledge structure on items through the problem function.

The pipeline: 1. Build competence structure C from skill prerequisites. 2. Apply problem function p(C) to each competence state. 3. Collect unique knowledge states → knowledge structure K.

Additionally provides conversion from skill prerequisites to item prerequisites (surmise relation on items).

References:: Doignon, J.-P., & Falmagne, J.-C. (1999). Knowledge Spaces, Chapter 5. Springer-Verlag.

class knowledgespaces.derivation.cbkst.CBKSTResult(competence_structure, knowledge_structure, mapping, skill_map, skill_relation)[source]¶

Result of a CbKST derivation.

Parameters:

competence_structure (KnowledgeStructure)
knowledge_structure (KnowledgeStructure)
mapping (dict[frozenset[str], frozenset[str]])
skill_map (SkillMap)
skill_relation (SurmiseRelation)

knowledgespaces.derivation.cbkst.derive_knowledge_structure(skill_map, skill_relation)[source]¶

Derive a knowledge structure from a competence model.

Parameters:

skill_mapSkillMap: The mapping μ: items → required skills.
skill_relationSurmiseRelation: Prerequisite relation on skills (will be transitively closed). Must cover all skills referenced by the skill map.

Returns:

CBKSTResult: Contains the competence structure C, knowledge structure K, and the mapping from competence states to knowledge states.

Raises:

ValueError: If skill_map references skills not in skill_relation’s domain.

Parameters:

skill_map (SkillMap)
skill_relation (SurmiseRelation)

Return type:

CBKSTResult

knowledgespaces.derivation.cbkst.skill_to_item_relation(skill_map, skill_relation)[source]¶

Derive an item surmise relation from skills.

Item p is a prerequisite of item q iff every skill required by p is “covered” by q — meaning the skill is either directly required by q or is a prerequisite of a skill required by q.

Formally:: covers(q) = μ(q) ∪ {s : ∃t ∈ μ(q), (s,t) ∈ closure} p ≺ q iff μ(p) ⊆ covers(q)

Note: the result is already transitively closed.

Parameters:

skill_mapSkillMap: The mapping μ: items → required skills.
skill_relationSurmiseRelation: Prerequisite relation on skills.

Returns:

SurmiseRelation: Surmise relation on items.

Raises:

ValueError: If skill_map references skills not in skill_relation’s domain.

Parameters:

skill_map (SkillMap)
skill_relation (SurmiseRelation)

Return type:

knowledgespaces.assessment¶

Basic Local Independence Model (BLIM) and Bayesian state inference.

The BLIM defines the probability of a response pattern given a knowledge state, using two parameters: - beta (slip): P(incorrect | item mastered) - eta (guess): P(correct | item not mastered)

The model assumes local independence: responses to different items are conditionally independent given the knowledge state.

StatePosterior maintains the Bayesian probability distribution over knowledge states and supports sequential updating.

References:

Doignon, J.-P., & Falmagne, J.-C. (1999). Knowledge Spaces, Chapter 12. Springer-Verlag.

Falmagne, J.-C., & Doignon, J.-P. (2011). Learning Spaces, Chapter 12. Springer-Verlag.

class knowledgespaces.assessment.blim.BLIMParams(beta, eta)[source]¶

Parameters for the Basic Local Independence Model.

Parameters:

betafloat or dict[str, float]: Slip probability: P(incorrect response | item mastered). A scalar applies the same value to every item; a dict maps each item to its own slip probability. Values must be in [0, 1).
etafloat or dict[str, float]: Guess probability: P(correct response | item not mastered). A scalar applies the same value to every item; a dict maps each item to its own guess probability. Values must be in [0, 1).
The constraint beta + eta < 1 (per item, when dicts are used) is the
*informative item* condition: a mastered respondent must be more likely
to give a correct response than an unmastered one. It is necessary for
the item to be informative, but does not by itself guarantee model
identifiability (which depends on the knowledge structure).

Parameters:

beta (float | dict[str, float])
eta (float | dict[str, float])

class knowledgespaces.assessment.blim.BLIM(structure, params)[source]¶

Basic Local Independence Model on a knowledge structure.

Parameters:

structureKnowledgeStructure: The knowledge structure defining valid states.
paramsBLIMParams: Slip and guess parameters (scalar or per-item dict).

Parameters:

structure (KnowledgeStructure)
params (BLIMParams)

likelihood(item, response, state)[source]¶

Compute P(response | state) for a single item and state.

Parameters:

itemstr: The item being assessed.
responsebool: True for correct, False for incorrect.
statefrozenset[str]: The knowledge state. Must be a state in the structure.

Raises:

ValueError: If item is not in the domain or state is not in the structure.

Parameters:

item (str)
response (bool)
state (frozenset[str])

Return type:

float

likelihood_vector(item, response)[source]¶

Compute P(response | state) for all states.

Returns a 1D array of length n_states, where element i is P(response | states[i]).

Raises:

ValueError: If item is not in the domain.

Parameters:

item (str)
response (bool)

Return type:

ndarray

class knowledgespaces.assessment.blim.StatePosterior(blim, probabilities)[source]¶

Bayesian probability distribution over knowledge states.

Immutable: update() returns a new StatePosterior.

Parameters:

blimBLIM: The BLIM model defining states and likelihoods.
probabilitiesnp.ndarray: Probability for each state. Must sum to 1.

Parameters:

blim (BLIM)
probabilities (np.ndarray)

classmethod uniform(blim)[source]¶

Create a uniform prior (maximum uncertainty).

Parameters:: blim (BLIM)
Return type:: StatePosterior

classmethod from_prior(blim, prior)[source]¶

Create from an explicit prior mapping states to probabilities.

Parameters:

blimBLIM: The BLIM model.
priordict[frozenset[str], float]: Mapping from each state to its prior probability. Must cover exactly the structure’s states (no missing, no extra) and sum to 1.

Parameters:

blim (BLIM)
prior (dict[frozenset[str], float])

Return type:

StatePosterior

update(item, response)[source]¶

Return a new posterior after observing a response.

Applies Bayes’ theorem:: P(state | response) ∝ P(response | state) × P(state)

Parameters:

itemstr: The item that was assessed. Must be in the structure’s domain.
responsebool: True for correct, False for incorrect.

Raises:

ValueError: If item is not in the domain, or if the observation has zero probability under all states with nonzero prior (impossible evidence).

Parameters:

item (str)
response (bool)

Return type:

StatePosterior

property entropy: float¶: Shannon entropy (bits) of the current distribution.

property most_likely_state: tuple[frozenset[str], float]¶: Return (state, probability) of the most probable state.

marginal_mastery()[source]¶

Marginal probability of mastering each item.

For each item q: P(q mastered) = sum of P(state) for all states containing q.

Return type:: dict[str, float]

knowledgespaces.assessment.blim.shannon_entropy(probs)[source]¶

Shannon entropy in bits, handling zero probabilities.

Parameters:: probs (ndarray)
Return type:: float

Adaptive assessment: item selection policies and termination criteria.

Provides the Expected Information Gain (EIG) policy for selecting the most informative item at each step of an adaptive assessment.

References:: Cover, T. M., & Thomas, J. A. (2006). Elements of Information Theory, 2nd ed. Wiley.

class knowledgespaces.assessment.adaptive.ItemScore(item, score)[source]¶

Score of an item under a selection policy.

Parameters:

item (str)
score (float)

knowledgespaces.assessment.adaptive.select_item_eig(posterior, candidates=None, exclude=None)[source]¶

Select the item maximizing Expected Information Gain.

EIG(q) = H(current) - E[H(posterior after observing q)]

where the expectation is over both possible responses (correct/incorrect), weighted by their marginal probability.

Parameters:

posteriorStatePosterior: Current state distribution.
candidateslist[str] or None: Items to consider. If None, uses all items in the domain.
excludeset[str] or None: Items to exclude (e.g. already asked). Applied after candidates.

Returns:

ItemScore: The best item and its EIG score.

Raises:

ValueError: If no candidates are available, or if candidates contains items not in the domain.

Parameters:

posterior (StatePosterior)
candidates (list[str] | None)
exclude (set[str] | None)

Return type:

ItemScore

Notes

The EIG for an item \(q\) is derived from the definition of mutual information between the latent state \(K\) and the response \(R_q\):

\[\begin{split}\mathrm{EIG}(q) &= I(K; R_q) \\ &= H(K) - E_{R_q}\bigl[H(K \mid R_q)\bigr] \\ &= H(K) - \bigl[\,P(R_q{=}1)\,H(K \mid R_q{=}1) + P(R_q{=}0)\,H(K \mid R_q{=}0)\bigr].\end{split}\]

The marginal \(P(R_q{=}1) = \sum_K P(R_q{=}1\mid K)\,P(K)\) comes from the law of total probability; the conditional posteriors \(P(K \mid R_q)\) are obtained by Bayes’ rule and renormalized to sum to 1.

References

Cover & Thomas (2006), Elements of Information Theory, §2.

knowledgespaces.assessment.adaptive.is_converged(posterior, threshold=0.85)[source]¶

Check if the assessment has converged.

Convergence occurs when the most likely state has probability above the threshold.

Parameters:

posteriorStatePosterior: Current state distribution.
thresholdfloat: Probability threshold for convergence. Must be in (0, 1]. Default 0.85.

Raises:

ValueError: If threshold is not in (0, 1].

Parameters:

posterior (StatePosterior)
threshold (float)

Return type:

bool

Instance pool: multiple questions per item.

In KST assessment, each item (competency) can be tested through multiple instances (concrete questions). The adaptive engine selects the best instance, maps it to its parent item for the BLIM update, and excludes only that specific instance from future selection.

This module provides the InstancePool data structure and an instance-aware version of select_item_eig.

class knowledgespaces.assessment.instances.Instance(id, item)[source]¶

A concrete question that tests a specific item.

Parameters:

idstr: Unique identifier for this instance.
itemstr: The item (competency) this instance tests.

Parameters:

id (str)
item (str)

class knowledgespaces.assessment.instances.InstancePool(instances)[source]¶

A collection of instances mapped to items.

Parameters:

instanceslist[Instance]: All available instances.

Raises:

ValueError: If instance IDs are not unique, or if an instance references an item not in the provided domain.

Parameters:

instances (list[Instance])

property items: set[str]¶: All unique items covered by this pool.

item_of(instance_id)[source]¶

Get the parent item of an instance.

Parameters:: instance_id (str)
Return type:: str

instances_for(item)[source]¶

Get all instance IDs for a given item.

Parameters:: item (str)
Return type:: list[str]

validate_domain(domain)[source]¶

Verify that pool items match the structure’s domain.

Raises:

ValueError: If pool contains items not in domain, or domain has items with no instances.

Parameters:

domain (frozenset[str])

Return type:

None

classmethod from_dict(mapping)[source]¶

Create from {item: [instance_id, …]}.

Example:

pool = InstancePool.from_dict({
    "addition": ["add_q1", "add_q2", "add_q3"],
    "subtraction": ["sub_q1", "sub_q2"],
})

Parameters:: mapping (dict[str, list[str]])
Return type:: InstancePool

class knowledgespaces.assessment.instances.InstanceScore(instance_id, item, score)[source]¶

Score of an instance under the EIG policy.

Parameters:

instance_id (str)
item (str)
score (float)

knowledgespaces.assessment.instances.select_instance_eig(posterior, pool, asked=None)[source]¶

Select the instance maximizing Expected Information Gain.

This is the instance-aware version of select_item_eig. It computes EIG per item, picks an item with the highest EIG (ties broken at random), then selects a random un-asked instance of that item (since instances of the same item are equivalent from the BLIM perspective).

Parameters:

posteriorStatePosterior: Current state distribution.
poolInstancePool: Available instances.
askedset[str] or None: Instance IDs already asked (excluded from selection).

Returns:

InstanceScore: The best instance, its parent item, and EIG score.

Raises:

ValueError: If no un-asked instances remain, or if pool items don’t match the structure’s domain.

Parameters:

posterior (StatePosterior)
pool (InstancePool)
asked (set[str] | None)

Return type:

InstanceScore

knowledgespaces.estimation¶

EM algorithm for BLIM parameter estimation.

Estimates slip (β) and guess (η) parameters from observed response patterns using Expectation-Maximization. Supports both global (homogeneous) and per-item (heterogeneous) parameterization.

The algorithm iterates between: - E-step: compute posterior P(state | response pattern) for each pattern. - M-step: re-estimate β, η, and state prior π from sufficient statistics.

References:

Doignon, J.-P., & Falmagne, J.-C. (1999). Knowledge Spaces, Chapter 12. Springer-Verlag.

Heller, J., & Wickelmaier, F. (2013). Minimum discrepancy estimation in probabilistic knowledge structures. Electronic Notes in Discrete Mathematics, 42, 49-56.

exception knowledgespaces.estimation.blim_em.ConvergenceWarning[source]¶: Emitted when an iterative estimator fails to meet its convergence criterion.

class knowledgespaces.estimation.blim_em.ResponseMatrix(items, patterns, counts=None)[source]¶

Observed response patterns from a group of respondents.

Parameters:

itemslist[str]: Item labels (columns), must match the structure’s domain.
patternsnp.ndarray: Binary matrix of shape (n_respondents, n_items). patterns[r, q] = 1 if respondent r answered item q correctly.
countsnp.ndarray or None: Optional frequency for each unique pattern. If None, each row in patterns is one respondent (count=1 each).

Parameters:

items (list[str])
patterns (ndarray)
counts (ndarray | None)

property effective_counts: ndarray¶: Counts for each pattern (ones if not provided).

class knowledgespaces.estimation.blim_em.GoodnessOfFit(G2, df, p_value, npar, AIC, BIC, BIC_npatterns)[source]¶

Goodness-of-fit statistics for a BLIM estimate.

Follows the approach of Heller & Wickelmaier (2013) and the R pks package (Heller & Wickelmaier, 2013, J. Stat. Softw.). The primary statistic is the likelihood ratio G2 (deviance), tested against a chi-squared distribution.

Attributes:

G2float: Likelihood ratio statistic: 2 * sum_r N_r ln(N_r / E_r).
dfint: Degrees of freedom: max(min(2^Q - 1, N) - npar, 0). The min(2^Q - 1, N) cap follows the pks convention: when the total sample size N is smaller than the number of possible response patterns (2^Q), the saturated model cannot be fully identified, so N replaces 2^Q - 1.
p_valuefloat: P-value from chi-squared test on G2.
nparint: Number of free parameters: |K| - 1 + 2 * Q.
AICfloat: Akaike Information Criterion: -2*LL + 2*npar.
BICfloat: Bayesian Information Criterion using the total sample size: -2*LL + ln(N)*npar. This is the standard BIC definition of Schwarz (1978, p. 461), where N is the number of independent observations contributing to the likelihood. For BLIM each of the N respondents supplies one i.i.d. draw from the pattern distribution, so the Laplace-approximation derivation of the log(N)·npar penalty applies. This is the recommended primary criterion for model selection.
BIC_npatternsfloat: Variant Bayesian Information Criterion using the number of distinct observed response patterns: -2*LL + ln(n_patterns)*npar. This matches what R pks::blim() returns: pks does not define an explicit BIC method, instead overriding nobs.blim to return the count of distinct patterns and delegating to stats::BIC (see cran/pks/R/blim.R, logLik.blim / nobs.blim). Provided for cross-package replication; not recommended as a primary selection criterion because the count of distinct patterns is bounded above by 2^Q and therefore does not satisfy the asymptotic-consistency conditions of Schwarz (1978).

Parameters:

G2 (float)
df (int)
p_value (float)
npar (int)
AIC (float)
BIC (float)
BIC_npatterns (float)

class knowledgespaces.estimation.blim_em.BLIMEstimate(beta, eta, pi, log_likelihood, n_iterations, converged, items, states, gof, degenerate_items)[source]¶

Result of BLIM parameter estimation via EM.

Attributes:

betanp.ndarray: Slip parameters, shape (n_items,). beta[q] = P(incorrect | q mastered).
etanp.ndarray: Guess parameters, shape (n_items,). eta[q] = P(correct | q not mastered).
pinp.ndarray: State prior probabilities, shape (n_states,).
log_likelihoodfloat: Final log-likelihood of the data.
n_iterationsint: Number of EM iterations until convergence.
convergedbool: True if converged within max_iter.
itemslist[str]: Item labels corresponding to beta/eta indices.
stateslist[frozenset[str]]: Knowledge states corresponding to pi indices (same order).
gofGoodnessOfFit: Goodness-of-fit statistics (G2, df, p-value, AIC, BIC).
degenerate_itemstuple[str, …]: Items whose final beta[q] + eta[q] >= 1 - 1e-3. Such items are non-informative under the current knowledge structure: a mastering respondent is no more likely to answer correctly than a non-mastering one. The literature (Spoto, Stefanutti & Vidotto, 2013) treats this as a structural diagnostic — typically the item should be removed or the structure revised. Empty tuple when no item is degenerate.

Parameters:

beta (ndarray)
eta (ndarray)
pi (ndarray)
log_likelihood (float)
n_iterations (int)
converged (bool)
items (list[str])
states (list[frozenset[str]])
gof (GoodnessOfFit)
degenerate_items (tuple[str, ...])

beta_for(item)[source]¶

Get beta (slip) for a specific item.

Parameters:: item (str)
Return type:: float

eta_for(item)[source]¶

Get eta (guess) for a specific item.

Parameters:: item (str)
Return type:: float

beta_dict()[source]¶

Return beta as {item: value} dict.

Return type:: dict[str, float]

eta_dict()[source]¶

Return eta as {item: value} dict.

Return type:: dict[str, float]

pi_dict()[source]¶

Return pi as {state: probability} dict.

Return type:: dict[frozenset[str], float]

knowledgespaces.estimation.blim_em.estimate_blim(structure, data, *, max_iter=500, tol=1e-06, beta_init=0.1, eta_init=0.1, max_memory_bytes=8000000000)[source]¶

Estimate BLIM parameters via Expectation-Maximization.

Parameters:

structureKnowledgeStructure: The knowledge structure defining valid states.
dataResponseMatrix: Observed response patterns.
max_iterint: Maximum number of EM iterations. Default 500.
tolfloat: Convergence tolerance on log-likelihood change. Default 1e-6.
beta_initfloat or np.ndarray: Initial slip values. Scalar for homogeneous, array for per-item.
eta_initfloat or np.ndarray: Initial guess values. Scalar for homogeneous, array for per-item.
max_memory_bytesint: Hard cap on the estimated allocation for the posterior matrix (n_patterns × n_states × 8 bytes). If the estimate exceeds this value a MemoryError is raised before any large array is allocated. Default 8 GB.

Returns:

BLIMEstimate: Estimated parameters, log-likelihood, and convergence info.

Raises:

ValueError: If data items don’t match structure domain, or if init parameters are out of range.
MemoryError: If the estimated posterior allocation would exceed max_memory_bytes.

Parameters:

structure (KnowledgeStructure)
data (ResponseMatrix)
max_iter (int)
tol (float)
beta_init (float | ndarray)
eta_init (float | ndarray)
max_memory_bytes (int)

Return type:

BLIMEstimate

Notes

The M-step independently clips beta[q] and eta[q] into [1e-6, 1 - 1e-6], mirroring R pks::blim(). The canonical BLIM parameter space is the open box per item; the joint condition beta[q] + eta[q] < 1 is the informative item condition (Falmagne & Doignon 2011 §11), not part of the parameter space, and is therefore not enforced inside the loop — doing so would break EM monotonicity (Dempster, Laird & Rubin 1977).

Items whose final beta[q] + eta[q] >= 1 - 1e-3 are surfaced via BLIMEstimate.degenerate_items and a ConvergenceWarning is emitted. Such items are non-informative under the current knowledge structure and the recommended remedy is structural — drop the item or revise the structure (Spoto, Stefanutti & Vidotto 2013).

References

Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. B, 39(1), 1-38.

Heller, J., & Wickelmaier, F. (2013). Minimum discrepancy estimation in probabilistic knowledge structures. ENDM, 42, 49-56.

Spoto, A., Stefanutti, L., & Vidotto, G. (2013). Assessing the local identifiability of probabilistic knowledge structures. Behavior Research Methods, 45(4), 1197-1211.

knowledgespaces.estimation.blim_em.estimate_blim_restarts(structure, data, *, n_restarts=10, max_iter=500, tol=1e-06, seed=None, max_memory_bytes=8000000000, init_range=(0.01, 0.4), init_strategy='uniform')[source]¶

Estimate BLIM parameters with multiple random restarts.

Runs estimate_blim() n_restarts times with random initial values for beta, eta, and selects the result with the highest log-likelihood. This helps avoid local optima.

The R pks package does not provide this natively — users must loop manually with randinit=TRUE.

Parameters:

structureKnowledgeStructure: The knowledge structure defining valid states.
dataResponseMatrix: Observed response patterns.
n_restartsint: Number of random restarts. Default 10.
max_iterint: Maximum EM iterations per restart.
tolfloat: Convergence tolerance per restart.
seedint or None: Random seed for reproducibility.
max_memory_bytesint: Forwarded to estimate_blim(). Default 8 GB.
init_rangetuple[float, float]: Lower/upper bounds for the random U(low, high) draw of beta_init and eta_init when init_strategy="uniform". Default (0.01, 0.4). Ignored when init_strategy="pks".
init_strategy{“uniform”, “pks”}: Random initialization strategy. "uniform" (default) draws both parameters from U(*init_range) and rescales in-place until beta[q] + eta[q] < 0.95 on each item. "pks" mirrors pks::blim(..., randinit=TRUE) (R source: cran/pks/R/blim.R): each parameter is drawn from U(0, 1), then reflected as 1 - x on items where beta[q] + eta[q] >= 1 to restore the identifiability constraint.

Returns:

BLIMEstimate: The best result (highest log-likelihood) across all restarts.

Parameters:

structure (KnowledgeStructure)
data (ResponseMatrix)
n_restarts (int)
max_iter (int)
tol (float)
seed (int | None)
max_memory_bytes (int)
init_range (tuple[float, float])
init_strategy (Literal['uniform', 'pks'])

Return type:

BLIMEstimate

Notes

The default init_range=(0.01, 0.4) is a narrowed basin that avoids near-boundary draws at the identifiability frontier beta + eta = 1, where EM can stall in a degenerate-item attractor (cf. Spoto, Stefanutti & Vidotto 2013). The "pks" strategy is provided for reproducibility with the R pks package: note that pks uses runif(nitems) = U(0, 1) with reflection, not U(0, 0.5) as sometimes reported.

References

Heller, J., & Wickelmaier, F. (2013). Minimum discrepancy estimation in probabilistic knowledge structures. ENDM, 42, 49-56.

knowledgespaces.io¶

CSV import/export for KST objects.

Supports three standard CSV formats: - Skill map matrix: rows=items, cols=skills, binary (μ: items→skills). - Prerequisite matrix: rows=labels, cols=labels, binary (surmise relation). - Knowledge structure: state_size, state_id, then binary columns per item.

All CSV files use the first column as row index and the first row as header.

knowledgespaces.io.csv.read_skill_map(path)[source]¶

Read a skill map from CSV.

Expected format:

,skill1,skill2,...
item1,0,1,...
item2,1,0,...

Raises:

ValueError: If rows have wrong column count or non-binary values.

Parameters:

path (str | Path)

Return type:

SkillMap

knowledgespaces.io.csv.write_skill_map(skill_map, path)[source]¶

Write a skill map to CSV.

Parameters:

skill_map (SkillMap)
path (str | Path)

Return type:

None

knowledgespaces.io.csv.read_relation(path)[source]¶

Read a surmise relation from a CSV prerequisite matrix.

Expected format:

,label1,label2,...
label1,0,1,...
label2,0,0,...

Row labels must match column labels exactly.

Raises:

ValueError: If rows have wrong column count, non-binary values, or row labels don’t match header labels.

Parameters:

path (str | Path)

Return type:

knowledgespaces.io.csv.write_relation(relation, path)[source]¶

Write a surmise relation to a CSV prerequisite matrix.

Parameters:

relation (SurmiseRelation)
path (str | Path)

Return type:

None

knowledgespaces.io.csv.read_structure(path)[source]¶

Read a knowledge structure from CSV.

Expected format:

state_size,state_id,item1,item2,...
0,0,0,0,...
1,1,1,0,...

Raises:

ValueError: If rows have wrong column count or non-binary item values.

Parameters:

path (str | Path)

Return type:

knowledgespaces.io.csv.write_structure(structure, path)[source]¶

Write a knowledge structure to CSV.

Parameters:

structure (KnowledgeStructure)
path (str | Path)

Return type:

None

JSON serialization for KST objects.

Provides roundtrip-safe serialization for the core KST types: KnowledgeStructure, SurmiseRelation, SkillMap, and SurmiseFunction.

knowledgespaces.io.json.structure_to_dict(structure)[source]¶

Serialize a KnowledgeStructure to a JSON-compatible dict.

Parameters:: structure (KnowledgeStructure)
Return type:: dict[str, Any]

knowledgespaces.io.json.dict_to_structure(data)[source]¶

Deserialize a KnowledgeStructure from a dict.

Raises:

ValueError: If required keys are missing or have wrong types.

Parameters:

data (dict[str, Any])

Return type:

knowledgespaces.io.json.write_structure_json(structure, path)[source]¶

Write a KnowledgeStructure to a JSON file.

Parameters:

structure (KnowledgeStructure)
path (str | Path)

Return type:

None

knowledgespaces.io.json.read_structure_json(path)[source]¶

Read a KnowledgeStructure from a JSON file.

Parameters:: path (str | Path)
Return type:: KnowledgeStructure

knowledgespaces.io.json.relation_to_dict(relation)[source]¶

Serialize a SurmiseRelation to a JSON-compatible dict.

Parameters:: relation (SurmiseRelation)
Return type:: dict[str, Any]

knowledgespaces.io.json.dict_to_relation(data)[source]¶

Deserialize a SurmiseRelation from a dict.

Raises:

ValueError: If required keys are missing or have wrong types.

Parameters:

data (dict[str, Any])

Return type: