Research/ Multimodal systems/ In preparation

Chalkless: recovering the logical structure of lectures

Chalkless analyzes recordings of board-based mathematics lectures and labels each segment by the role it plays in the lesson: topic, definition, example, key concept, or transition. The system pairs lightweight per-segment inference with a deterministic pass that enforces how lectures tend to be ordered over time. Its aim is an interpretable, label-free way to recover lecture structure, a problem usually handled with models trained on annotated data. Results so far are qualitative and preliminary, drawn from inspecting real lectures rather than a labeled benchmark.

ORIA Research teamMultimodal · Educational AI

2026 · In preparationVenue TBD

§ 01 · Problem

Why local classification is insufficient

A short clip from a math lecture rarely carries its own meaning. The same equation can be the definition an instructor is introducing, a line inside a worked example, or the identity that anchors a key result. What separates those roles is the segment's place in the lesson.

Take the coefficient formula a_n = (1/π) ∫ f(x) cos(nx) dx and the orthogonality fact ∫ sin(mx) sin(nx) dx = 0. Both are integrals of products of trigonometric functions. A classifier that reads one segment at a time sees nearly the same object in each, even though one belongs to a definition and the other is the idea that makes the definition work.

Recovering lecture structure is therefore a sequence problem. The role of a segment depends on the segments around it, so a method that labels each segment on its own cannot reconstruct the arc of the lesson.

The signal that fixes a segment's role often sits in a neighbouring segment, sometimes several away.

Figure 1 · Interactive

One lecture, five roles

A single lecture on Fourier series, shown as five segments in the order they were taught. Step through them. Several segments look almost the same up close, and their place in the sequence is what fixes the role each one plays.

Topic

Segment 1 of 5

f(x + 2π) = f(x)

Opens the lecture · names the subject

Local cue: a title, written and underlined

The lecture opens by naming what it covers, here periodic functions and Fourier series. On its own this looks like any title written on a board. What makes it the topic is mostly its position at the start, which a per-segment label cannot see.

Figure 1. A single Fourier-series lecture, segmented by role. The definition (Segment 2) and the key concept (Segment 4) both center on an integral of a product of trigonometric functions and are difficult to tell apart from one segment alone. Use the step controls above, or focus a tab and use the left and right arrow keys, to move through the arc. Math is rendered with KaTeX; with scripting turned off, all five segments are shown in order with plain-text math.

§ 02 · Approach

A two-stage pipeline

Chalkless runs in two stages. The first makes a rough guess for every segment. The second reconciles those guesses using rules about how lectures tend to be ordered.

01 · Input

Lecture capture

Audio and video of a board-based lecture, taken as-is.

02 · Local inference

Per-segment labels

Speech transcription, a board-activity signal from the video, and a lightweight model propose a role for each short segment.

03 · Stabilization

Temporal rules

Deterministic rules reconcile the noisy per-segment guesses into one coherent sequence.

04 · Output

Structured arc

Timestamped roles across the lecture: topic, definition, example, key concept, transition.

02.1The stabilization rules

The second stage encodes patterns that hold across board-based math lectures. A few examples of the kind of rule it applies:

Lectures usually open with a topic segment.
Examples tend to run in continuous spans rather than alternating with other roles.
A key concept commonly follows the example that motivates it.
Frequent back-and-forth switching between roles is usually inference noise.

These rules are written down by hand instead of learned by a model. Encoding them keeps the system legible and removes the need for labeled lectures. When a label changes, you can read the exact rule that caused it.

We do not claim this beats a trained CRF or a neural sequence model on accuracy. Its value is interpretability and running without annotated data.

§ 03 · Preliminary observations

What we have seen so far

These are early, qualitative observations from inspecting real lectures. They are not measured results, and we report no accuracy numbers at this stage.

Applying the temporal rules appears to produce a more coherent sequence than the per-segment guesses on their own. On inspection, the stabilized labeling reads closer to how the lecture is actually organized.
The key-concept role seems to be the hardest to identify locally. In the cases we looked at, it appears to need the surrounding sequence, the example before it and the transition after, to be recognized at all.

We treat both as directions to test properly. Neither is a settled conclusion yet.

§ 04 · Limitations

What this does not yet show

Scope is narrow. The system is built and tested on board-based mathematics lectures, and we do not expect it to transfer unchanged to other subjects or formats.
The evidence is qualitative. With no labeled benchmark yet, the observations above come from manual inspection rather than measurement.
The rules are hand-built. They encode patterns we have seen in math lectures and may not hold for other teaching styles.
The approach is interpretable but unvalidated against trained baselines. Comparing it to CRFs or neural sequence models on a labeled set is future work.

§ 05 · Paper

Paper and status

Research paper · Multimodal systems

Local Classification is Insufficient: Recovering Lecture Structure Through Temporal Constraint Enforcement

TypeResearch paper

StatusIn preparation · venue TBD

AuthorsORIA Research team

The paper sets out why recovering pedagogical structure from board-based mathematics lectures is a sequence-level problem, and describes a two-stage system that pairs local inference with deterministic temporal rules. It frames the contribution as an interpretable, label-free alternative to trained sequence models, and reports preliminary qualitative observations rather than benchmark numbers.

Lecture analysis Multimodal systems Sequence structure Educational AI

Request the preprint

Colophon

Set in Space Grotesk and IBM Plex. Figure 1 renders math with KaTeX and is driven by its step controls and the arrow keys. The page is static HTML, CSS, and JavaScript.

Status

In preparation, venue to be decided. Preprint available on request via Apply.