Skip to content
Jae Hoon Kim
Projects

GazeStill · drawing set · v0.2

Part I · Cover

U.S. Provisional Patent Application · Drawing set
Docket GS-2026-A
Sheets 5
Rev. B · v0.2
Title · Method and system for structured re-orientation upon resumption of a long-form reading interface, comprising a confidence-banded paragraph range with last-attentive sentences and a next-sentence preview, derived from a webcam-inferred disengagement event
Inventor · J. H. Kim
Internal · GazeStill
Filing target · pending pilot gate (≥80% bookmark accuracy across 3 articles) + re-entry study
Classification (proposed) · G06F 3/01 · G06F 16/957
About this Personal project · v0.2 card-centered Pilot-gate pending
Draft cover sheet and drawings for GazeStill, the anchor of the attention subtree in the portfolio (PageHaptic is its haptic-output descendant). The prototype lives in a companion repo; this page advances toward a full provisional draft once the pilot gate clears — ≥ 80 % of deliberate mid-article attention breaks land the bookmark within ± 2 paragraphs across 3 articles. The disengagement-detection mechanic is squarely anticipated by US20060256083A1 (Rosenberg / Outland, 2006), its issued counterpart US7429108B2, and US8643680B2 (Amazon); the contribution claimed here is downstream of detection — the re-entry card shown on resume (FIG. 5). The patent-draft format forces that distinction to be made on the cover, not buried in the discussion.
Abstract of the disclosure Cover-sheet boilerplate v0.2
A system and method assist a reader in resuming a long-form document at the point of inferred attention collapse by rendering, on resumption, a structured re-orientation card rather than scrolling to a single anchor. A webcam integral to the host device drives an on-device gaze regressor; an off-text-dwell detector, normalized by per-paragraph difficulty, identifies a disengagement event and a back-dated bookmark paragraph — both mechanics being acknowledged building blocks from the prior-art gaze-bookmark literature (Rosenberg 2006; Amazon 2014). The disclosed contribution is the re-entry card dispatched at the next CLOSED→RESUMED transition: it presents (i) the last two sentences read attentively before the break, (ii) the first sentence after the bookmark as a forward preview, (iii) a confidence band visualised across the paragraph range over which attention was decaying, and (iv) an explicit "Resume here" affordance that commits the scroll. Prior gaze-bookmark systems place an anchor and return the reader to it; the disclosed system instead surfaces the context immediately surrounding the break so that re-engagement does not require the reader to mentally rebuild lost context. All inference is performed on the host device; raw video frames are discarded after gaze regression, and no per-user identifier, raw video, or gaze trace is transmitted off-host.
Field
G06F 3/01 · user input
Acknowledged prior art (detection)
  • US20060256083A1 — Rosenberg / Outland · look-away place-marker
  • US7429108B2 — issued; place-marker on look-away
  • US8643680B2 — Amazon · gaze-dwell bookmark
Distinguished from (re-entry UX)
  • Rosenberg / Outland — single-anchor place-marker, no re-entry context
  • Amazon dwell-bookmark — anchor only, no contextual restore
  • Kindle Page-Flip — scroll-position heuristic, no attention model
  • Pocket / Instapaper — explicit save, no resume scaffold
Index of sheets Tap a row to jump 5 sheets

Part II · Drawings

Sheet 1 / 5 FIG. 1 · Reader view · disengagement on ¶4 (input to the re-entry card) 100
112 · reader 110 · gaze ray 100 · host (laptop) 102 · webcam 104 · rendered document ¶1 ¶2 ¶3 ¶4 ¶5 108 · fixation cluster · d(¶) high d ↓ 106 ≈ 50 cm · reading distance 150 · LEGEND fixation gaze ray bookmark d ↓
FIG. 1
Sheet 2 / 5 FIG. 2 · Disengagement detector · off-text dwell vs threshold τ_d 200
5 4 3 2 1 0 off-text dwell tail (s) 0 s 5 10 15 20 25 elapsed reading time (s) · per src/disengagement.ts 204 · τ_d = 3.0 s · base off-text threshold τ_d · 0.6 = 1.8 s (easy · Flesch 90) DIFFICULTY MULTIPLIER τ_d · 1.8 = 5.4 s (hard, above plot) 202 · off-text dwell tail · resets on on-text fixation on-text resets · 208 · bookmark fires first upward crossing of τ_d 206 · precursor back-date bookmarked ¶ 210 · LEGEND 202 · off-text dwell 204 · τ_d (Flesch-normalised) 206 · precursor back-date 208 · bookmark
FIG. 2
Sheet 3 / 5 FIG. 3 · Pipeline · webcam → fixations → disengagement → re-entry card 300
300 · ON-DEVICE BOUNDARY · no frame, no gaze trace, no per-user id transmitted off-host webcam · 302 hw · 30 fps · integral RGB · 720p frames gaze regressor · 304 WebGazer-class · sw (x, y, t) on document dwell window · 306 12 s sliding · Flesch-norm. · sw τ off-text dwell · τ_d ≈ 3.0 s re-entry card · 308 contextual restore · sw RESUME last 2 · next 1 · band store 310 · local ¶ map frames (x, y, t) dwell, τ_d card DISCARD · frames not retained per claim 1(c) · no raw RGB persisted or transmitted LEGEND hardware software signal discarded PER-FRAME BUDGET · Apple Silicon capture ≈ 33 ms regress ≈ 5 ms dwell ≈ 1 ms card on event store on event All stages execute on-device. Per claim 1(c), no frames are retained beyond the gaze regression in 304.
FIG. 3
Sheet 4 / 5 FIG. 4 · Bookmark resume state machine 400
400 · SESSION STATE MACHINE · WRITE event on disengage · RENDER card on resume READING 402 · off-text dwell < τ_d on-text fixations · I-DT DISENGAGED 404 · off-text dwell ≥ τ_d → WRITE event CLOSED 408 · tab / app exit RESUMED 406 · render re-entry card 310 · EVENT STORE ¶ · range · conf · t off-text dwell ≥ τ_d → WRITE DisengagementEvent on-text fixation · re-engage tab close tab close (no disengage · last-good ¶) resume on open → READ event · construct card user focuses WRITE READ
FIG. 4
Sheet 5 / 5 Representative FIG. 5 · Re-entry card anatomy · last attentive · next sentence · confidence band 500
500 · STRUCTURED RE-ORIENTATION CARD · rendered at CLOSED → RESUMED transition ¶3 ¶4 512 ¶5 ¶6 508 · confidence band · 0.62 over range_paragraph_ids Where you left off 502 · CARD 504 · Attention broke somewhere in this range confidence 62 % · 2 paragraphs · ¶4 – ¶5 506 · LAST ATTENTIVE READING lastTwoSentences(¶4) · src/ui.ts:95 510 · NEXT SENTENCE firstSentence(¶5) · src/ui.ts:100 Resume here › 514 514 commit → scrolls to ¶4 CARD FIELDS · DERIVED FROM DisengagementEvent 504 ← e.confidence · e.range_paragraph_ids.length 506 ← lastTwoSentences(¶bookmark) 510 ← firstSentence(¶bookmark+1) 514 → commit scroll
FIG. 5

Part III · Specification

Background of invention Prior-art context

Webcam-derived gaze estimation is well-studied. WebGazer.js (Brown / Papoutsaki et al., IJCAI 2016) and other regression-based estimators achieve roughly 100–150 px error (~4° visual angle) under reasonable lighting, sufficient for paragraph-level but not line-level resolution at typical 15-inch laptop viewing distances. Dedicated eye-tracking hardware (Tobii, EyeTribe) achieves substantially better accuracy at the cost of additional hardware. Both classes are predominantly used for analytics — heatmaps, gaze plots, attention dwell-time — rather than as a control input to user-facing application logic.

The detection mechanic of writing a bookmark on a gaze-derived look-away event is squarely anticipated by prior art: US20060256083A1 (Rosenberg / Outland, 2006) and its issued counterpart US7429108B2 disclose a gaze-responsive interface that determines a look-away event and displays a place-marker at or near the word the reader was viewing; US8643680B2 (Amazon, Selker & Baldwin, 2014) discloses gaze-dwell bookmarking and gaze-derived page-turn detection. The disclosed method does not claim the detection mechanic over these references and explicitly relies on them as enabling building blocks for the contribution that follows.

What the prior art does not address is what is rendered at the next CLOSED → RESUMED transition. Rosenberg / Outland places a single graphical marker (highlight, outline, underline) on the lost word, and Amazon scrolls to or annotates the dwell point. The reader, returning hours or days later, is given a location but no context: the surrounding text immediately preceding the break is no longer easily distinguishable from later text, and the reader must mentally rebuild the lost reading context from cold.

The disclosed contribution — the load-bearing framing of this disclosure — is a structured re-orientation card that the system constructs and renders on resume. The card presents (i) the last two sentences read attentively before the disengagement event, (ii) the first sentence after the bookmark as a forward preview, (iii) a confidence band visualised across a paragraph range over which attention was decaying, and (iv) an explicit commit affordance. The card converts a known-location bookmark into a re-engagement scaffold; it is independent of the specific detection mechanic used upstream.

The detection pipeline disclosed herein — webcam → on-device gaze regressor → off-text dwell over a paragraph hit-test, normalised by per-paragraph Flesch difficulty, with a precursor back-date to the latest paragraph receiving two or more fixations — is one practical realisation that produces the inputs the card requires. Any equivalent disengagement detector from the prior-art family above could be substituted without changing the card's structure or claims.

Summary of the invention per 37 CFR § 1.73

The disclosed system constructs and renders a structured re-orientation card (502) at the next CLOSED → RESUMED transition of a long-form reading interface, in response to a disengagement event recorded earlier in the session. The card comprises (a) a confidence-band note (504) reporting the system's estimated confidence and paragraph range over which attention was decaying; (b) a last-attentive blockquote (506) presenting the last two sentences read attentively prior to the disengagement event, extracted from the bookmarked paragraph; (c) a next-sentence blockquote (510) presenting the first sentence of the following paragraph as a forward preview; and (d) an explicit commit affordance (514) that, when actuated, scrolls the underlying document to the bookmarked paragraph and dismisses the card.

The disengagement event consumed by the card is produced by a webcam-derived pipeline running on the host device. The webcam (102) feeds an on-device gaze regressor (304); a dwell window (306) accumulates an off-text dwell signal whose threshold τ_d is normalised by per-paragraph Flesch readability; on first upward crossing of τ_d, a precursor back-date step selects as the bookmark the most recent paragraph that received at least two fixations within the active sliding window. The detection pipeline is acknowledged to overlap with prior-art gaze-bookmark systems; the card constructor (308) and the card structure itself (FIG. 5) are the disclosed contribution.

Brief description of drawings Sheets 1 – 4

Part IV · Claims

Claims 1 independent · 5 dependent · 1 apparatus Draft v0.2
What is claimed is:

1. A method for restoring a reader's engagement upon resumption of a long-form reading interface following an inferred attention break, comprising:

  1. (a)receiving, at a host computing device, a disengagement event identifying a bookmark paragraph and a confidence-banded paragraph range over which a reader's attention was decaying prior to the event;
  2. (b)upon a CLOSED → RESUMED transition of said reading interface, constructing a re-orientation card (502) comprising at least: (i) a last-attentive blockquote (506) presenting the final one or more sentences of the bookmark paragraph; (ii) a next-sentence blockquote (510) presenting at least the first sentence following the bookmark paragraph; (iii) a confidence-band note (504) reporting the said confidence and range; and (iv) a commit affordance (514);
  3. (c)rendering said re-orientation card above the bookmarked paragraph in the document layout, and visually distinguishing the confidence-banded paragraph range (508) within the rendered document;
  4. (d)upon actuation of the commit affordance, dismissing the re-orientation card and scrolling the underlying document to the bookmark paragraph; and
  5. (e)wherein the re-orientation card surfaces both the reading context immediately preceding the disengagement event and a forward preview, such that re-engagement does not require the reader to rebuild context from a single anchor.

2. The method of claim 1, wherein the disengagement event of (a) is produced by an on-device pipeline comprising a webcam-fed gaze regressor (304), an off-text dwell window (306) whose threshold τ_d is normalised by per-paragraph readability, and a precursor back-date step selecting as the bookmark paragraph the most recent paragraph receiving at least two fixations within the active window.

3. The method of claim 2, wherein per-paragraph readability is computed from a Flesch reading-ease score and the multiplier applied to τ_d is bounded between 0.6 and 1.8.

4. The method of claim 2, wherein each captured video frame is discarded after gaze regression, and no raw video frame, no gaze-location stream, and no per-user identifier is persisted on, or transmitted from, the host device.

5. The method of claim 1, wherein the disengagement event of (a) is received from any disengagement detector, including the place-marker-on-look-away detector taught by US7429108B2 or the gaze-dwell detector taught by US8643680B2; and wherein the contribution of the present claim resides in the construction and rendering of the re-orientation card at (b) and (c).

6. The method of claim 1, wherein the bookmark resolution is paragraph-level and the confidence band of (b)(iii) spans one to four contiguous paragraphs.

7. A host computing device comprising a webcam, a display, one or more processors, and non-transitory memory storing instructions which, when executed, perform the method of any one of claims 1 – 6.

Claims · 7 total · 1 independent · 5 dependent · 1 apparatus

Part V · Appendices

Prior-art rejection forecast Element-by-element novelty argument v0.2

The disclosed claims are drafted on the assumption that an examiner would cite US7429108B2 (Rosenberg / Outland; gaze-responsive place-marker on look-away) and US8643680B2 (Amazon; gaze-dwell bookmark and gaze-derived page turn) as the closest art. Below, claim 1 is read element-by-element against each reference and the surviving limitations are identified.

Claim 1 element US7429108B2 (Rosenberg) US8643680B2 (Amazon) Surviving
(a) receive disengagement event taught — look-away event is the trigger taught — gaze-dwell and absence-of-gaze events no — preamble overlap
(b)(i) last-attentive blockquote · last 1+ sentences of bookmark ¶ not taught — Rosenberg places a graphical marker on or near the lost word; no sentence extraction or quoted display not taught — Amazon highlights or scrolls to the dwell location; no sentence-level context replay yes — primary novelty
(b)(ii) next-sentence blockquote · first sentence after bookmark not taught not taught yes — primary novelty
(b)(iii) confidence-band note · confidence + paragraph-range disclosure not taught — Rosenberg's marker is a single point with no decay-range representation not taught — Amazon offers no confidence/range UI surface yes — novel
(b)(iv) explicit commit affordance not taught — Rosenberg's marker is passive; no commit step partial — Amazon allows gaze gestures (wink, dwell) to commit; the disclosed claim requires an explicit pointer-actuated affordance, not a gaze gesture yes — narrowed against Amazon by requiring non-gaze actuation
(c) visually distinguish range within underlying document not taught — marker is at a point not taught — Amazon scrolls but does not paint a band yes
(d) commit dismisses card and scrolls to bookmark n/a (no card) n/a (no card) yes — depends on card existing

Anticipated 35 U.S.C. § 102 / § 103 posture. No single reference discloses elements (b)(i)–(iv) together; the closest combination would require importing the “card-on-resume” concept from a non-gaze reading-aware system (e.g., an e-reader summary widget) and substituting Rosenberg's look-away anchor for the reader's manual cue. That combination is not motivated by the cited art: Rosenberg's place-marker is itself the disclosed assist (the reader finds the lost word from the marker alone), and Amazon's bookmarks favor gaze-gesture commit (wink, dwell, nod) rather than scaffolded re-orientation rendered at a CLOSED→RESUMED transition. The disclosed claim therefore reads as non-obvious under KSR over the cited combinations available at filing. Confidence in this analysis is bounded by a v0.2 reading of the two references; a search-report-quality novelty study has not been commissioned.

Acknowledged exposure. Amazon describes "highlighting" and "annotation" surfaces around a dwell point in several embodiments, and an examiner could press one of those embodiments against claim element (c). The mitigation is the band-over-range structure of (b)(iii) and (c) — Amazon's annotation is point-anchored at the dwell location; the disclosed band is range-anchored over range_paragraph_ids derived from a per-paragraph decay model. Should the examiner press the point, dependent claim 6 (band spans 1–4 paragraphs) and claim 1(c) ("visually distinguishing" the range, not annotating a point) carry the narrowing.

Prior-art bibliography Selected; not exhaustive
Patents (detection mechanic, acknowledged prior art)
Gaze regression and fixation identification
Reading-attention and mind-wandering signals
Reading-aware UI (non-gaze)

Part VI · Execution

Version history Draft · not filed

Promotion to a full provisional draft remains conditioned on the pilot gate. The card's value — not the detector's accuracy — is the falsifiable claim, and the planned within-subjects study (SCROLL / LOOKAWAY / RE-ENTRY) at experiment-re-entry.md is the next concrete step after the pilot.

/gazestill · v0.2 · card-centered draft
Index