Skip to main content

Section 9: Predictive Marketing AI Module — Layer 9

OTCM PROTOCOL

Comprehensive Technical Whitepaper    Version 7.0

ST22 Digital Securities Platform  |  March 2026  |  Groovy Company, Inc. dba OTCM Protocol

 

Section 9: Predictive Marketing AI Module — Layer 9

The commercial intelligence engine monitoring approximately 15,000 U.S. OTC companies in real time, scoring issuer distress and tokenization readiness, targeting verified accredited investors, and optimizing ST22 Digital Securities offering timing.

 

9.1  Module Overview and Strategic Purpose

9.1.1  The Commercial Velocity Problem

OTCM Protocol's technical infrastructure — ST22 Digital Securities tokens, 42-control Transfer Hook enforcement, the Global Unified CEDEX Liquidity Pool, and Empire Stock Transfer custody — constitutes a best-in-class architecture for tokenizing illiquid securities. Technical excellence alone does not generate issuer pipeline or trading volume.

The target universe of approximately 11,000 OTC companies with degraded or eliminated market maker eligibility represents approximately $50 billion in trapped shareholder value. These companies do not systematically self-identify as tokenization candidates. Their boards are not monitoring blockchain infrastructure developments. Their shareholders are experiencing liquidity crises in silence. Without a systematic, data-driven mechanism to identify the highest-urgency candidates and route them into the OTCM Gateway, pipeline growth remains dependent on manual outreach — a fundamentally unscalable approach for a platform designed to serve the entire OTC market.

On the investor side, ST22 Digital Securities offerings face an equivalent discovery problem. Unlike exchange-listed securities with Bloomberg terminal coverage and broker-dealer distribution, ST22 offerings are invisible to their highest-probability investors — accredited investors, family offices, and tokenized asset specialists — absent a systematic mechanism to surface opportunities at the precise moment of investor intent. The Predictive Marketing AI Module resolves both sides simultaneously, converting OTCM Protocol's technical infrastructure into active commercial velocity.

 

9.1.2  Module Architecture Philosophy

       Data Primacy — All prospecting and targeting decisions are derived from structured public data — SEC EDGAR filings and OTC Markets Group datasets — rather than purchased contact lists or broad-spectrum advertising. This produces higher conversion rates, lower acquisition costs, and full regulatory defensibility.

       Mathematical Scoring over Human Judgment — The module generates an Issuer Distress and Opportunity Score (IDOS) for every OTC-listed company in the target universe, refreshed continuously. Outreach prioritization is algorithmic, enabling the platform to process hundreds of issuers in parallel without proportional headcount growth.

       Compliance-Constrained Targeting — All investor outreach operates within Rule 506(c) general solicitation parameters — targeting only verified or verifiable accredited investors. The module enforces this programmatically, filtering wallet behavioral profiles through accreditation criteria before any outreach sequence is triggered.

 

9.1.3  Module Position in the Protocol Stack

Layer 9 operates as the commercial intelligence overlay of the nine-layer architecture, consuming oracle data from Layer 6 and feeding qualified issuer and investor records into the OTCM Issuer Gateway (Section 10) and CEDEX trading infrastructure (Section 5). The module does not alter or bypass any Transfer Hook security controls — it operates exclusively at the pre-onboarding discovery and targeting layer.

 

9.2  SEC EDGAR Integration Architecture

9.2.1  EDGAR Data Feed Overview

The SEC's Electronic Data Gathering, Analysis, and Retrieval system (EDGAR) is the authoritative public record for all U.S. registered and reporting companies. For the AI Module, EDGAR functions as a real-time issuer distress signal monitor. All data consumed from EDGAR is publicly available with no data licensing restrictions or privacy implications.

 

EDGAR Endpoint

AI Module Application

Update Mode

efts.sec.gov — Full-Text Search API

Distress language NLP scan across all MD&A filings

Real-time on each new filing

data.sec.gov/submissions/ — Filing History

Form D · 8-K · 10-K indexing per CIK

Event-driven on new submission

data.sec.gov/api/xbrl/ — Structured Financials

Financial health and liquidity ratio scoring

Quarterly on each 10-K/10-Q

EDGAR RSS Feed — New Filing Notifications

8-K trigger alert pipeline

Real-time continuous monitoring

 

9.2.2  Form D Filing Intelligence

Every company raising capital under Regulation D Rule 506(b) or 506(c) must file a Form D with the SEC within 15 days of the first sale. The module monitors Form D filings continuously to identify companies that are actively capital-raising but lack public trading market liquidity — a compound signal that strongly predicts tokenization readiness. A Form D filed 12–24 months prior with no subsequent SEC filings and no reported active trading market produces the highest urgency score in this category, indicating a company that has exhausted its private raise capacity and has no public liquidity alternative.

 

9.2.3  Natural Language Processing — Liquidity Distress Index

Annual reports (Form 10-K) and quarterly reports (Form 10-Q) contain MD&A sections where management is required to disclose material conditions affecting the company's operations and securities. The module applies NLP-based sentiment and keyword extraction to these filings, constructing a Liquidity Distress Index (LDI) for each issuer, normalized to a 0–100 scale and updated on each new filing.

 

Weight

Target Phrases

HIGH (2.0×)

"no established trading market exists" · "shareholders may have difficulty selling"

HIGH (2.0×)

"limited or no market maker activity" · "no assurance that a liquid market will develop"

MEDIUM (1.5×)

"thin trading volume" · "limited trading market" · "no broker-dealer has agreed to make a market"

INDICATOR (1.0×)

"OTC Markets" · "Pink Sheets" · "15c2-11" · "delisted" · "trading was suspended"

 

9.2.4  8-K Trigger Alert System

Form 8-K current reports disclose material events at public companies in near-real time. The module monitors the EDGAR RSS feed and scores each 8-K against a trigger classification matrix. A high-scoring 8-K routes the issuer to an expedited outreach sequence — companies disclosing material distress events typically represent high-conversion windows lasting 30–90 days.

 

Condition

Score

Response

1.03 — Bankruptcy or Receivership Filing

0.95

URGENT — shareholder protection window open; immediate outreach

4.02 — Non-Reliance on Prior Financial Statements

0.90

High priority — financial restatement distress signal

4.01 — Changes in Certifying Accountant

0.85

High priority — leadership / audit uncertainty

2.04 — Triggering Events for Debt Acceleration

0.80

High — debt covenant / liquidity stress event

5.02 — Departure / Appointment of Officers

0.65

Medium — board restructuring signal; monitor for follow-on filings

 

9.2.5  Proxy Statement Shareholder Count Intelligence

Annual proxy statements (Form DEF 14A) and Form 10-K annual reports contain holder-of-record counts, machine-readable through the EDGAR XBRL API. The module extracts and indexes shareholder count data for all target universe companies, enabling prioritization by trapped holder volume — a direct proxy for the social and financial impact of OTCM's intervention.

 

Shareholder Count

Outreach Tier

Response

10,000+ holders of record

TIER 1

Immediate outreach · direct referral pathway if Empire Stock Transfer client

2,500–9,999 holders

TIER 2

Scheduled outreach sequence · standard IDOS queue

500–2,499 holders

TIER 3

Nurture sequence · periodic re-score

< 500 holders

TIER 4

Monitoring only · flag for future re-evaluation

 

9.3  OTC Markets Group Intelligence Integration

9.3.1  Tier Degradation Monitoring

Downward tier movement — from OTCQB to Pink, from Pink Current Information to Pink Limited, or from Pink Limited to No Information or Expert Market — represents a measurable, timestamped signal of issuer deterioration that correlates directly with tokenization urgency. The module monitors tier transitions daily, assigning distress delta scores that feed into the composite IDOS calculation.

 

OTC Market Tier

Distress Weight in IDOS

Outreach Response

Dark / Expert Market (retail trading blocked)

1.00 — Maximum distress

Immediate outreach sequence

Pink No Information (zero disclosure)

0.85 — Critical

High-priority outreach

Pink Limited Information (lapsed disclosure)

0.65 — High

Scheduled outreach

Pink Current Information (meets disclosure)

0.40 — Moderate

Monitor for further degradation

OTCQB / OTCQX (active market tiers)

0.10 — Low

Secondary prospects only

 

9.3.2  Transfer Agent Cross-Reference Protocol

OTC Markets publishes transfer agent identity on each issuer's public profile. OTCM Protocol's COO Patrick Mokros serves as President of Empire Stock Transfer, which manages shareholder accounts for hundreds of public companies. The module maintains a continuously updated cross-reference index mapping OTC Markets transfer agent disclosures against the Empire client roster.

Issuers identified as current Empire Stock Transfer clients are reclassified from cold outreach targets to warm relationship leads, routed to a dedicated internal pathway. This transforms what would otherwise be a multi-month cold sales cycle into a direct executive engagement, leveraging the existing Empire relationship to open the tokenization conversation at the board level. Empire client status applies a 1.5×–2.0× multiplier to the base IDOS score for prioritization purposes.

 

9.3.3  Recovery Probability Score

The module constructs a Recovery Probability Score (RPS) for each issuer using price trajectory, volume trend, and time-since-last-trade metrics. Companies with low RPS — indicating minimal likelihood of organic market revival — score highest for tokenization urgency: their boards have fewer viable alternatives and are more receptive to OTCM's permanent liquidity solution.

 

RPS = w1 × PriceDecay + w2 × VolumeDecay + w3 × DaysInactive

 

PriceDecay   = (current_price / peak_52w_price - 1.0), bounded [0, 1]

VolumeDecay  = 1 - (avg_30d_volume / avg_prior_year_volume), bounded [0, 1]

DaysInactive = min(days_since_last_trade / 730, 1.0)

 

Weights (empirically calibrated): w1=0.35  w2=0.30  w3=0.35

 

RPS > 0.80     Critical distress   — highest tokenization urgency

RPS 0.60–0.80 →  Significant distress — high urgency

RPS 0.40–0.60 →  Moderate distress   — standard outreach

RPS < 0.40     Early stage         — nurture sequence

 

9.4  Composite IDOS Score Architecture

9.4.1  Signal Weights and Refresh Cadence

The Issuer Distress and Opportunity Score (IDOS) synthesizes all EDGAR and OTC Markets signals into a single normalized composite score for each of the approximately 15,000 companies in the target universe, refreshed continuously as new data arrives through real-time feed monitoring.

 

Signal Component

Source

Weight

Refresh Cadence

Liquidity Distress Index (LDI) — NLP scan

EDGAR 10-K / 10-Q

0.22

Per new filing

Tier Degradation Score

OTC Markets

0.18

Daily

Recovery Probability Score (RPS)

OTC Markets

0.18

Daily

Time Since Last Recorded Trade

OTC Markets

0.12

Daily

Shareholder Count (log-normalized)

EDGAR DEF 14A

0.12

Annual / on new filing

8-K Trigger Score

EDGAR RSS Feed

0.10

Real-time

Form D Urgency Score

EDGAR Form D

0.08

Per new filing

 

9.4.2  Outreach Pathway Multipliers

The composite IDOS score is adjusted by an Outreach Pathway Multiplier reflecting the quality of the existing relationship between OTCM Protocol and the issuer's transfer agent:

 

Outreach Classification

Multiplier

Criteria

Direct Referral

2.0×

Empire Stock Transfer client confirmed + 10,000+ shareholders of record

Warm Lead

1.5×

Empire Stock Transfer client confirmed · any shareholder count

Cold Prospect

1.0×

No existing Empire or OTCM relationship on file

 

9.4.3  Dynamic Priority Queue

The IDOS engine maintains a continuously ordered priority queue across all target universe companies. Outreach automation consumes from the head of this queue, dispatching the highest-IDOS issuers into active sequences while maintaining a rolling view of the top 50–100 priority targets at any given time. Companies already in active outreach sequences are excluded from re-dispatch until the current sequence resolves — preventing duplicate contact and ensuring professional engagement cadence.

 

9.5  AI Model Technical Specification

9.5.1  Model Architecture — IDOS Scoring Engine

The IDOS is computed by a gradient boosted decision tree ensemble (XGBoost v2.x). This architecture was selected for three reasons specific to the OTCM use case: interpretability — each scoring decision can be traced to specific feature contributions, supporting regulatory defensibility; performance on sparse tabular data — SEC EDGAR filings produce sparse, irregular time-series data that gradient boosting handles more effectively than sequence models; and low inference latency — sub-10ms per issuer scoring enables real-time refreshes across 15,000 companies.

 

9.5.2  Training Methodology

 

Parameter

Specification

Training data window

2015–2024 SEC EDGAR + OTC Markets historical data

Training sample size

~180,000 issuer-quarter observations

Positive class definition

Issuers that engaged with a tokenization or liquidity solution within 12 months

Negative class definition

Issuers that remained in grey or expert market without engagement

Train / validation / test split

70% / 15% / 15% — time-based split, no future data leakage

Cross-validation

Walk-forward validation — 4 folds, each fold = 2 years

Held-out test period

2023–2024 data (never seen during training)

Primary evaluation metric

AUC-ROC on held-out test set (target: > 0.78)

 

9.5.3  Model Drift Detection and Retraining Policy

 

Metric

Monitoring Frequency

Drift Threshold

Response

Score distribution shift (PSI)

Weekly

PSI > 0.20

Alert + manual review

Prediction accuracy on recent cohort

Monthly

AUC-ROC drops > 0.05 from baseline

Trigger retraining

Feature distribution shift

Weekly

KS-statistic > 0.15 for any feature

Alert + feature audit

Engagement prediction vs. actual

Quarterly

Precision/recall drops > 10%

Full model retraining

 

Scheduled quarterly retraining with 6-month rolling training window expansion. Emergency retraining triggered if drift thresholds are exceeded. Retrained models require validation AUC-ROC ≥ 0.75 before promotion to production.

 

9.5.4  Model Governance Gates

 

Governance Gate

Requirement

Approver

New model promotion

Validation AUC-ROC ≥ 0.75 on held-out test set

CTO — Frank Yglesias

Feature addition

Privacy review + SHAP impact analysis

CLO — Jeff Turner

Training data expansion

Data provenance documentation

CLO — Jeff Turner

Production deployment

A/B shadow mode run ≥ 2 weeks

Engineering Lead

Emergency rollback

Triggered automatically if drift threshold exceeded

Automated

 

9.6  Investor-Side Predictive Intelligence

9.6.1  Wallet Behavioral Profiling

On the investor side, the module operates a parallel intelligence layer targeting accredited investors most likely to participate in ST22 Digital Securities offerings and secondary market trading on CEDEX. Wallet behavioral profiles are constructed from on-chain analytics aggregated across Solana mainnet, supplemented by off-chain identity enrichment where publicly available. All on-chain data is inherently public on the Solana blockchain.

 

On-Chain Behavioral Signal

Investor Profile Implication

Holdings of tokenized equity instruments

Demonstrated preference for regulated tokenized assets

Idle USDC/USDT balance > $25,000

Available capital for new position deployment

Historical participation in regulated offerings

Experience with compliant Digital Securities mechanics

Governance token holdings across protocols

Protocol-engaged sophisticated investor profile

Average position hold time > 90 days

Long-horizon investor — lower churn risk, better fit for Rule 144 holding period

LP provision behavior in compliant pools

Active market participant — yield-seeking behavior compatible with CEDEX economics

 

9.6.2  Rule 506(c) Compliance Enforcement

All investor-side outreach operates under Rule 506(c) of Regulation D, which permits general solicitation and advertising provided that issuers take reasonable steps to verify that all purchasers are accredited investors. The module enforces this constraint programmatically at the targeting layer — no outreach sequence is triggered for any wallet or identity that has not cleared the accreditation proxy filter.

Verified accreditation records from the Empire Stock Transfer onboarding registry are shared with the investor targeting module via the internal compliance data bus, eliminating duplicative verification workflows for investors who have previously been certified as accredited through prior ST22 participation on the platform.

 

9.6.3  Launch Timing Optimization Engine (LTOE)

Post-deployment analysis identified that suboptimal offering timing — competing launches, adverse macroeconomic conditions, RPC network congestion, and insufficient investor pre-warming — created measurable drag on ST22 offering performance during the critical initial distribution window. The Launch Timing Optimization Engine (LTOE) generates a Launch Readiness Score (LRS) for each pending ST22 offering by modeling five environmental factors:

 

       Competing offering schedules — Competing ST22 offerings on CEDEX within the preceding and following 7-day window — investor attention is finite

       Solana network conditions — Mainnet RPC congestion index and historical throughput patterns — high congestion increases transaction latency and cost

       Macroeconomic sentiment — Sentiment index (−1.0 risk-off to +1.0 risk-on) derived from market data feeds — risk-off conditions reduce investor appetite for new positions

       Investor pool depth — Count of Empire-verified accredited investor wallets confirmed as available and ready to participate — a direct demand-side readiness signal

       Community engagement readiness — Wallet pre-registration metrics and investor notification open rates from prior outreach sequences

 

LRS outputs below 0.55 trigger a launch deferral recommendation, with the engine proposing an alternative window within the following 7–14 days. LRS outputs above 0.80 trigger an expedited investor notification sequence to maximize participation in the offering's first 72 hours.

 

9.7  OTCM Security Token Integration

9.7.1  AI Module as Staking Tier Capability

The Predictive Marketing AI Module integrates into the OTCM Security Token staking tier architecture as a gated platform capability. Access to AI module features is unlocked through active token staking. All benefits require active platform engagement — staking alone, without module usage, confers no incremental value.

 

Staking Tier

OTCM Staked

AI Module Access Unlocked

Bronze

1,000 OTCM

IDOS dashboard read-only · top 500 issuers by score

Silver

10,000 OTCM

Full IDOS access + weekly AI-generated prospect report

Gold

50,000 OTCM

Full EDGAR NLP Engine + OTC Markets tier alerts + investor pool analytics

Platinum

100,000 OTCM

Complete suite: real-time feeds · IDOS priority queue · LTOE · wallet profiling · outreach automation

 

9.7.2  Per-Operation Token Burn Mechanics

Individual AI module operations carry OTCM token burn costs, generating deflationary pressure through genuine platform utilization. Burns execute at the smart contract level, are irreversible, and are permanently recorded on-chain.

 

AI Module Operation

OTCM Burn Cost

EDGAR batch query execution (per 500 records)

1,000 OTCM burned

OTC Markets feed refresh subscription (monthly)

50 OTCM burned

Investor wallet behavioral report (per ST22 offering)

250 OTCM burned

Launch Readiness Score analysis (per deployment)

500 OTCM burned

Full IDOS universe refresh cycle

1,000 OTCM burned

Automated outreach sequence launch (per issuer)

750 OTCM burned

 

9.8  Data Moat and Competitive Defensibility

Every campaign executed, every issuer conversion or non-conversion, every investor wallet interaction, and every ST22 offering outcome feeds back into the module's training data. The module improves continuously with platform scale. After 500 ST22 launches, Groovy Company, Inc. dba OTCM Protocol will have accumulated a dataset on Digital Securities investor behavior, issuer conversion patterns, and offering timing correlations that no competitor can replicate without operating at equivalent scale and regulatory depth.

This dataset constitutes a structural moat: the AI's accuracy and efficiency improve as the platform grows, which improves commercial outcomes, which accelerates platform growth — compounding the data advantage with each issuer onboarded. Competitors entering the issuer tokenization market without equivalent historical data will face systematically lower conversion rates and higher acquisition costs for an indefinite period.

 

Self-Reinforcing Competitive Advantage

Each successful issuer conversion generates ST22 trading volume. Volume generates transaction fee revenue. Fee revenue deepens the Global Unified CEDEX Liquidity Pool. Deeper pool improves secondary market quality, making the platform more attractive to the next issuer and the next investor. Improved AI module accuracy drives faster issuer conversion at lower cost. The data advantage is therefore self-funding — requiring no incremental investment beyond normal platform operations to maintain and grow.

 

9.9  Performance Specifications

 

Performance Metric

Specification

IDOS refresh latency — 8-K trigger event

< 60 seconds from EDGAR RSS publication to score update

IDOS full universe refresh cadence

Every 24 hours · continuous partial refresh on real-time events

EDGAR full-text search throughput

1,000 CIKs per batch cycle

OTC Markets tier change detection

Near real-time · 15-minute polling interval

Investor wallet profile refresh

Every 6 hours · event-driven on large on-chain movements

Launch Readiness Score update frequency

Every 4 hours · real-time on Solana congestion events

Priority queue depth

15,000+ issuers · memory-resident with persistent backing store

Empire cross-reference match latency

< 5 seconds on new issuer ingestion

Outreach sequence trigger delay

< 2 minutes from IDOS threshold breach

 

9.10  Regulatory and Privacy Considerations

All data sources consumed by the Predictive Marketing AI Module are publicly available. SEC EDGAR is a public database operated for the express purpose of providing universal investor access to issuer filings. OTC Markets Group publishes tier data, issuer profiles, and trading statistics as a market infrastructure function. No proprietary, non-public, or personally identifiable data is accessed without explicit consent in the issuer prospecting pipeline.

Investor-side wallet behavioral profiling operates on on-chain transaction data, which is inherently public on the Solana blockchain. Off-chain identity enrichment is limited to publicly available professional data sources. All investor outreach is compliant with CAN-SPAM Act requirements, GDPR Article 6(1)(f) legitimate interest basis where applicable, and Rule 506(c) general solicitation standards. The module does not store or process payment card data, social security numbers, or health information. All data handling practices are documented in the OTCM Protocol Privacy Policy and subject to the annual compliance audit framework.

 

Rule 506(c) Compliance Gate

The AI module enforces Rule 506(c) compliance programmatically. No investor outreach sequence is triggered for any wallet or identity that has not cleared the module's accreditation proxy filter. Where the wallet has a verified accreditation record on file with Empire Stock Transfer, that record is used directly. Where it does not, the module routes the prospect through the Empire onboarding flow before any offering-specific communication is triggered. This is not a policy — it is a code-level constraint on the outreach automation pipeline.

 

Groovy Company, Inc. dba OTCM Protocol  |  CIK: 1499275  |  Version 7.0  |  March 2026  |  Confidential