Theta Lake’s new patent targets ASR errors in FinTech

May 7, 2026

Theta Lake has been awarded a United States patent — US 12,045,561, System and Method for Disambiguating Data to Improve Analysis of Electronic Content — for its TranscriptionRN® technology, a proprietary framework designed to automatically generate and rank sound-alike and look-alike terms to sharpen the analysis of electronic communications.

The patent sits at the intersection of a broader intellectual property portfolio that collectively defines Theta Lake’s approach to digital communications oversight. Its existing patents span context-based policy detection across spoken, visual, and shared content in video calls, as well as participant disambiguation and AI-assisted review workflows.

According to Theta Lake, this latest addition strengthens that foundation by addressing a deceptively simple but consequential question: what was actually said?

When transcripts get it wrong

Anyone who has worked with automated speech recognition (ASR) systems at scale will be familiar with the problem. Transcripts routinely mangle similar-sounding words, blur word boundaries, and introduce phonetic errors that bear little resemblance to the original utterance. Chat messages dashed off on platforms like Slack or Microsoft Teams are riddled with typos and shorthand. Optical character recognition (OCR) systems processing shared screens or documents introduce their own character-level distortions. A system might render “litecoin” as “light coin,” “late fees” as “ladies,” or — in a particularly striking example — “interest rate” as “pinterest rite.”

These are not fringe occurrences. They represent the everyday reality of working with communications data at any meaningful scale. For organisations relying on keyword identification to meet regulatory compliance obligations, manage privacy risks, or address cybersecurity threats, such errors are a fundamental problem: the relevant term was spoken or typed, but it may not appear in the data as captured.

What the patent covers

Patent 12,045,561 describes a two-stage framework. In the first stage, compound keywords from a domain-relevant input list are broken down into constituent seed words using both a morphological analyser — which deconstructs words according to linguistic structure — and a phonetic encoding algorithm that identifies how words might be split based on how they sound. “Litecoin,” for instance, might yield “lite” and “coin”; “payable” might become “pay” and “able.” Non-compound words are retained as-is and added to the same seed pool.

In the second stage, the system generates sound-alike and look-alike candidates for each seed word by drawing on three complementary methods: a spelling correction algorithm operating within a defined edit distance, a word formation module producing grammatical inflections and derivations, and a novel Look-Alike Sound-Alike (LASA) generator. The LASA generator — an algorithm invented by Theta Lake — blends consecutive words using grammatical formation rules to produce plausible confusion candidates. Starting from “late fees,” it surfaces terms like “ladies” or “layers,” reflecting how an ASR system might combine phonetic confusion with word boundary shifts.

All generated candidates are then scored and ranked according to phonetic similarity, frequency in real-world spoken language, and grammatical plausibility. This ranking allows downstream systems to treat high-confidence candidates differently from lower-confidence ones, depending on the task at hand.

Domain adaptability and downstream use

One of the framework’s practical strengths is its domain adaptability. TranscriptionRN® can be calibrated to any industry by supplying a word frequency list drawn from conversations in that field. A list built from financial services communications will produce candidates tuned to the vocabulary of finance; one drawn from healthcare or technology will reflect those respective terminologies. Critically, this customisation does not require retraining an underlying ASR model.

The sound-alike and look-alike lists produced by TranscriptionRN® feed directly into Theta Lake’s AI-driven risk classifiers, which analyse communications for regulatory, compliance, privacy, cybersecurity, and HR risks. Those classifiers are designed to incorporate ranked candidate lists as part of their detection logic, enabling them to identify risk-relevant language even when the underlying data is imperfect. The result, according to the company, is broader and more precise risk detection, with the ranking mechanism helping to reduce noise and focus attention where it matters most.

Linguistic expertise at the core

Theta Lake emphasises that the quality of the system’s outputs depends heavily on the quality of its inputs. The keyword sets underpinning its classifiers are informed by regulatory guidance and the practical patterns through which compliance risks surface in everyday workplace conversations. The patented TranscriptionRN® framework takes those carefully constructed inputs and expands them into a far wider set of variants — one that would be impractical to build by hand given the sheer number of ways any term can be distorted across speech, typed text, and OCR output.

The company positions this capability as essential infrastructure for modern communications surveillance, which it argues cannot be reduced to simple keyword matching. Effective risk detection requires understanding context, intent, and the many forms a relevant term might take when spoken in different accents, in noisy environments, or typed hastily into a chat window.

Read the full Theta Lake post here.

Read the daily FinTech news