Section 9: Predictive Marketing AI Module — Layer 9
|
OTCM PROTOCOL Comprehensive Technical Whitepaper — Version 7.0 ST22 Digital Securities Platform | March 2026 | Groovy Company, Inc. dba OTCM Protocol |
Section 9: Predictive Marketing AI Module — Layer 9
The commercial intelligence engine monitoring approximately 15,000 U.S. OTC companies in real time, scoring issuer distress and tokenization readiness, targeting verified accredited investors, and optimizing ST22 Digital Securities offering timing.
9.1 Module Overview and Strategic Purpose
9.1.1 The Commercial Velocity Problem
OTCM Protocol's technical infrastructure — ST22 Digital Securities tokens, 42-control Transfer Hook enforcement, the Global Unified CEDEX Liquidity Pool, and Empire Stock Transfer custody — constitutes a best-in-class architecture for tokenizing illiquid securities. Technical excellence alone does not generate issuer pipeline or trading volume.
The target universe of approximately 11,000 OTC companies with degraded or eliminated market maker eligibility represents approximately $50 billion in trapped shareholder value. These companies do not systematically self-identify as tokenization candidates. Their boards are not monitoring blockchain infrastructure developments. Their shareholders are experiencing liquidity crises in silence. Without a systematic, data-driven mechanism to identify the highest-urgency candidates and route them into the OTCM Gateway, pipeline growth remains dependent on manual outreach — a fundamentally unscalable approach for a platform designed to serve the entire OTC market.
On the investor side, ST22 Digital Securities offerings face an equivalent discovery problem. Unlike exchange-listed securities with Bloomberg terminal coverage and broker-dealer distribution, ST22 offerings are invisible to their highest-probability investors — accredited investors, family offices, and tokenized asset specialists — absent a systematic mechanism to surface opportunities at the precise moment of investor intent. The Predictive Marketing AI Module resolves both sides simultaneously, converting OTCM Protocol's technical infrastructure into active commercial velocity.
9.1.2 Module Architecture Philosophy
• Data Primacy — All prospecting and targeting decisions are derived from structured public data — SEC EDGAR filings and OTC Markets Group datasets — rather than purchased contact lists or broad-spectrum advertising. This produces higher conversion rates, lower acquisition costs, and full regulatory defensibility.
• Mathematical Scoring over Human Judgment — The module generates an Issuer Distress and Opportunity Score (IDOS) for every OTC-listed company in the target universe, refreshed continuously. Outreach prioritization is algorithmic, enabling the platform to process hundreds of issuers in parallel without proportional headcount growth.
• Compliance-Constrained Targeting — All investor outreach operates within Rule 506(c) general solicitation parameters — targeting only verified or verifiable accredited investors. The module enforces this programmatically, filtering wallet behavioral profiles through accreditation criteria before any outreach sequence is triggered.
9.1.3 Module Position in the Protocol Stack
Layer 9 operates as the commercial intelligence overlay of the nine-layer architecture, consuming oracle data from Layer 6 and feeding qualified issuer and investor records into the OTCM Issuer Gateway (Section 10) and CEDEX trading infrastructure (Section 5). The module does not alter or bypass any Transfer Hook security controls — it operates exclusively at the pre-onboarding discovery and targeting layer.
9.2 SEC EDGAR Integration Architecture
9.2.1 EDGAR Data Feed Overview
The SEC's Electronic Data Gathering, Analysis, and Retrieval system (EDGAR) is the authoritative public record for all U.S. registered and reporting companies. For the AI Module, EDGAR functions as a real-time issuer distress signal monitor. All data consumed from EDGAR is publicly available with no data licensing restrictions or privacy implications.
|
EDGAR Endpoint |
AI Module Application |
Update Mode |
|
efts.sec.gov — Full-Text Search API |
Distress language NLP scan across all MD&A filings |
Real-time on each new filing |
|
data.sec.gov/submissions/ — Filing History |
Form D · 8-K · 10-K indexing per CIK |
Event-driven on new submission |
|
data.sec.gov/api/xbrl/ — Structured Financials |
Financial health and liquidity ratio scoring |
Quarterly on each 10-K/10-Q |
|
EDGAR RSS Feed — New Filing Notifications |
8-K trigger alert pipeline |
Real-time continuous monitoring |
9.2.2 Form D Filing Intelligence
Every company raising capital under Regulation D Rule 506(b) or 506(c) must file a Form D with the SEC within 15 days of the first sale. The module monitors Form D filings continuously to identify companies that are actively capital-raising but lack public trading market liquidity — a compound signal that strongly predicts tokenization readiness. A Form D filed 12–24 months prior with no subsequent SEC filings and no reported active trading market produces the highest urgency score in this category, indicating a company that has exhausted its private raise capacity and has no public liquidity alternative.
9.2.3 Natural Language Processing — Liquidity Distress Index
Annual reports (Form 10-K) and quarterly reports (Form 10-Q) contain MD&A sections where management is required to disclose material conditions affecting the company's operations and securities. The module applies NLP-based sentiment and keyword extraction to these filings, constructing a Liquidity Distress Index (LDI) for each issuer, normalized to a 0–100 scale and updated on each new filing.
|
Weight |
Target Phrases |
|
HIGH (2.0×) |
"no established trading market exists" · "shareholders may have difficulty selling" |
|
HIGH (2.0×) |
"limited or no market maker activity" · "no assurance that a liquid market will develop" |
|
MEDIUM (1.5×) |
"thin trading volume" · "limited trading market" · "no broker-dealer has agreed to make a market" |
|
INDICATOR (1.0×) |
"OTC Markets" · "Pink Sheets" · "15c2-11" · "delisted" · "trading was suspended" |
9.2.4 8-K Trigger Alert System
Form 8-K current reports disclose material events at public companies in near-real time. The module monitors the EDGAR RSS feed and scores each 8-K against a trigger classification matrix. A high-scoring 8-K routes the issuer to an expedited outreach sequence — companies disclosing material distress events typically represent high-conversion windows lasting 30–90 days.
|
Condition |
Score |
Response |
|
1.03 — Bankruptcy or Receivership Filing |
0.95 |
URGENT — shareholder protection window open; immediate outreach |
|
4.02 — Non-Reliance on Prior Financial Statements |
0.90 |
High priority — financial restatement distress signal |
|
4.01 — Changes in Certifying Accountant |
0.85 |
High priority — leadership / audit uncertainty |
|
2.04 — Triggering Events for Debt Acceleration |
0.80 |
High — debt covenant / liquidity stress event |
|
5.02 — Departure / Appointment of Officers |
0.65 |
Medium — board restructuring signal; monitor for follow-on filings |
9.2.5 Proxy Statement Shareholder Count Intelligence
Annual proxy statements (Form DEF 14A) and Form 10-K annual reports contain holder-of-record counts, machine-readable through the EDGAR XBRL API. The module extracts and indexes shareholder count data for all target universe companies, enabling prioritization by trapped holder volume — a direct proxy for the social and financial impact of OTCM's intervention.
|
Shareholder Count |
Outreach Tier |
Response |
|
10,000+ holders of record |
TIER 1 |
Immediate outreach · direct referral pathway if Empire Stock Transfer client |
|
2,500–9,999 holders |
TIER 2 |
Scheduled outreach sequence · standard IDOS queue |
|
500–2,499 holders |
TIER 3 |
Nurture sequence · periodic re-score |
|
< 500 holders |
TIER 4 |
Monitoring only · flag for future re-evaluation |
9.3 OTC Markets Group Intelligence Integration
9.3.1 Tier Degradation Monitoring
Downward tier movement — from OTCQB to Pink, from Pink Current Information to Pink Limited, or from Pink Limited to No Information or Expert Market — represents a measurable, timestamped signal of issuer deterioration that correlates directly with tokenization urgency. The module monitors tier transitions daily, assigning distress delta scores that feed into the composite IDOS calculation.
|
OTC Market Tier |
Distress Weight in IDOS |
Outreach Response |
|
Dark / Expert Market (retail trading blocked) |
1.00 — Maximum distress |
Immediate outreach sequence |
|
Pink No Information (zero disclosure) |
0.85 — Critical |
High-priority outreach |
|
Pink Limited Information (lapsed disclosure) |
0.65 — High |
Scheduled outreach |
|
Pink Current Information (meets disclosure) |
0.40 — Moderate |
Monitor for further degradation |
|
OTCQB / OTCQX (active market tiers) |
0.10 — Low |
Secondary prospects only |
9.3.2 Transfer Agent Cross-Reference Protocol
OTC Markets publishes transfer agent identity on each issuer's public profile. OTCM Protocol's COO Patrick Mokros serves as President of Empire Stock Transfer, which manages shareholder accounts for hundreds of public companies. The module maintains a continuously updated cross-reference index mapping OTC Markets transfer agent disclosures against the Empire client roster.
Issuers identified as current Empire Stock Transfer clients are reclassified from cold outreach targets to warm relationship leads, routed to a dedicated internal pathway. This transforms what would otherwise be a multi-month cold sales cycle into a direct executive engagement, leveraging the existing Empire relationship to open the tokenization conversation at the board level. Empire client status applies a 1.5×–2.0× multiplier to the base IDOS score for prioritization purposes.
9.3.3 Recovery Probability Score
The module constructs a Recovery Probability Score (RPS) for each issuer using price trajectory, volume trend, and time-since-last-trade metrics. Companies with low RPS — indicating minimal likelihood of organic market revival — score highest for tokenization urgency: their boards have fewer viable alternatives and are more receptive to OTCM's permanent liquidity solution.
|
RPS = w1 × PriceDecay + w2 × VolumeDecay + w3 × DaysInactive
PriceDecay = (current_price / peak_52w_price - 1.0), bounded [0, 1] VolumeDecay = 1 - (avg_30d_volume / avg_prior_year_volume), bounded [0, 1] DaysInactive = min(days_since_last_trade / 730, 1.0)
Weights (empirically calibrated): w1=0.35 w2=0.30 w3=0.35
RPS > 0.80 → Critical distress — highest tokenization urgency RPS 0.60–0.80 → Significant distress — high urgency RPS 0.40–0.60 → Moderate distress — standard outreach RPS < 0.40 → Early stage — nurture sequence |
9.4 Composite IDOS Score Architecture
9.4.1 Signal Weights and Refresh Cadence
The Issuer Distress and Opportunity Score (IDOS) synthesizes all EDGAR and OTC Markets signals into a single normalized composite score for each of the approximately 15,000 companies in the target universe, refreshed continuously as new data arrives through real-time feed monitoring.
|
Signal Component |
Source |
Weight |
Refresh Cadence |
|
Liquidity Distress Index (LDI) — NLP scan |
EDGAR 10-K / 10-Q |
0.22 |
Per new filing |
|
Tier Degradation Score |
OTC Markets |
0.18 |
Daily |
|
Recovery Probability Score (RPS) |
OTC Markets |
0.18 |
Daily |
|
Time Since Last Recorded Trade |
OTC Markets |
0.12 |
Daily |
|
Shareholder Count (log-normalized) |
EDGAR DEF 14A |
0.12 |
Annual / on new filing |
|
8-K Trigger Score |
EDGAR RSS Feed |
0.10 |
Real-time |
|
Form D Urgency Score |
EDGAR Form D |
0.08 |
Per new filing |
9.4.2 Outreach Pathway Multipliers
The composite IDOS score is adjusted by an Outreach Pathway Multiplier reflecting the quality of the existing relationship between OTCM Protocol and the issuer's transfer agent:
|
Outreach Classification |
Multiplier |
Criteria |
|
Direct Referral |
2.0× |
Empire Stock Transfer client confirmed + 10,000+ shareholders of record |
|
Warm Lead |
1.5× |
Empire Stock Transfer client confirmed · any shareholder count |
|
Cold Prospect |
1.0× |
No existing Empire or OTCM relationship on file |
9.4.3 Dynamic Priority Queue
The IDOS engine maintains a continuously ordered priority queue across all target universe companies. Outreach automation consumes from the head of this queue, dispatching the highest-IDOS issuers into active sequences while maintaining a rolling view of the top 50–100 priority targets at any given time. Companies already in active outreach sequences are excluded from re-dispatch until the current sequence resolves — preventing duplicate contact and ensuring professional engagement cadence.
9.5 AI Model Technical Specification
9.5.1 Model Architecture — IDOS Scoring Engine
The IDOS is computed by a gradient boosted decision tree ensemble (XGBoost v2.x). This architecture was selected for three reasons specific to the OTCM use case: interpretability — each scoring decision can be traced to specific feature contributions, supporting regulatory defensibility; performance on sparse tabular data — SEC EDGAR filings produce sparse, irregular time-series data that gradient boosting handles more effectively than sequence models; and low inference latency — sub-10ms per issuer scoring enables real-time refreshes across 15,000 companies.
9.5.2 Training Methodology
|
Parameter |
Specification |
|
Training data window |
2015–2024 SEC EDGAR + OTC Markets historical data |
|
Training sample size |
~180,000 issuer-quarter observations |
|
Positive class definition |
Issuers that engaged with a tokenization or liquidity solution within 12 months |
|
Negative class definition |
Issuers that remained in grey or expert market without engagement |
|
Train / validation / test split |
70% / 15% / 15% — time-based split, no future data leakage |
|
Cross-validation |
Walk-forward validation — 4 folds, each fold = 2 years |
|
Held-out test period |
2023–2024 data (never seen during training) |
|
Primary evaluation metric |
AUC-ROC on held-out test set (target: > 0.78) |
9.5.3 Model Drift Detection and Retraining Policy
|
Metric |
Monitoring Frequency |
Drift Threshold |
Response |
|
Score distribution shift (PSI) |
Weekly |
PSI > 0.20 |
Alert + manual review |
|
Prediction accuracy on recent cohort |
Monthly |
AUC-ROC drops > 0.05 from baseline |
Trigger retraining |
|
Feature distribution shift |
Weekly |
KS-statistic > 0.15 for any feature |
Alert + feature audit |
|
Engagement prediction vs. actual |
Quarterly |
Precision/recall drops > 10% |
Full model retraining |
Scheduled quarterly retraining with 6-month rolling training window expansion. Emergency retraining triggered if drift thresholds are exceeded. Retrained models require validation AUC-ROC ≥ 0.75 before promotion to production.
9.5.4 Model Governance Gates
|
Governance Gate |
Requirement |
Approver |
|
New model promotion |
Validation AUC-ROC ≥ 0.75 on held-out test set |
CTO — Frank Yglesias |
|
Feature addition |
Privacy review + SHAP impact analysis |
CLO — Jeff Turner |
|
Training data expansion |
Data provenance documentation |
CLO — Jeff Turner |
|
Production deployment |
A/B shadow mode run ≥ 2 weeks |
Engineering Lead |
|
Emergency rollback |
Triggered automatically if drift threshold exceeded |
Automated |
9.6 Investor-Side Predictive Intelligence
9.6.1 Wallet Behavioral Profiling
On the investor side, the module operates a parallel intelligence layer targeting accredited investors most likely to participate in ST22 Digital Securities offerings and secondary market trading on CEDEX. Wallet behavioral profiles are constructed from on-chain analytics aggregated across Solana mainnet, supplemented by off-chain identity enrichment where publicly available. All on-chain data is inherently public on the Solana blockchain.
|
On-Chain Behavioral Signal |
Investor Profile Implication |
|
Holdings of tokenized equity instruments |
Demonstrated preference for regulated tokenized assets |
|
Idle USDC/USDT balance > $25,000 |
Available capital for new position deployment |
|
Historical participation in regulated offerings |
Experience with compliant Digital Securities mechanics |
|
Governance token holdings across protocols |
Protocol-engaged sophisticated investor profile |
|
Average position hold time > 90 days |
Long-horizon investor — lower churn risk, better fit for Rule 144 holding period |
|
LP provision behavior in compliant pools |
Active market participant — yield-seeking behavior compatible with CEDEX economics |
9.6.2 Rule 506(c) Compliance Enforcement
All investor-side outreach operates under Rule 506(c) of Regulation D, which permits general solicitation and advertising provided that issuers take reasonable steps to verify that all purchasers are accredited investors. The module enforces this constraint programmatically at the targeting layer — no outreach sequence is triggered for any wallet or identity that has not cleared the accreditation proxy filter.
Verified accreditation records from the Empire Stock Transfer onboarding registry are shared with the investor targeting module via the internal compliance data bus, eliminating duplicative verification workflows for investors who have previously been certified as accredited through prior ST22 participation on the platform.
9.6.3 Launch Timing Optimization Engine (LTOE)
Post-deployment analysis identified that suboptimal offering timing — competing launches, adverse macroeconomic conditions, RPC network congestion, and insufficient investor pre-warming — created measurable drag on ST22 offering performance during the critical initial distribution window. The Launch Timing Optimization Engine (LTOE) generates a Launch Readiness Score (LRS) for each pending ST22 offering by modeling five environmental factors:
• Competing offering schedules — Competing ST22 offerings on CEDEX within the preceding and following 7-day window — investor attention is finite
• Solana network conditions — Mainnet RPC congestion index and historical throughput patterns — high congestion increases transaction latency and cost
• Macroeconomic sentiment — Sentiment index (−1.0 risk-off to +1.0 risk-on) derived from market data feeds — risk-off conditions reduce investor appetite for new positions
• Investor pool depth — Count of Empire-verified accredited investor wallets confirmed as available and ready to participate — a direct demand-side readiness signal
• Community engagement readiness — Wallet pre-registration metrics and investor notification open rates from prior outreach sequences
LRS outputs below 0.55 trigger a launch deferral recommendation, with the engine proposing an alternative window within the following 7–14 days. LRS outputs above 0.80 trigger an expedited investor notification sequence to maximize participation in the offering's first 72 hours.
9.7 OTCM Security Token Integration
9.7.1 AI Module as Staking Tier Capability
The Predictive Marketing AI Module integrates into the OTCM Security Token staking tier architecture as a gated platform capability. Access to AI module features is unlocked through active token staking. All benefits require active platform engagement — staking alone, without module usage, confers no incremental value.
|
Staking Tier |
OTCM Staked |
AI Module Access Unlocked |
|
Bronze |
1,000 OTCM |
IDOS dashboard read-only · top 500 issuers by score |
|
Silver |
10,000 OTCM |
Full IDOS access + weekly AI-generated prospect report |
|
Gold |
50,000 OTCM |
Full EDGAR NLP Engine + OTC Markets tier alerts + investor pool analytics |
|
Platinum |
100,000 OTCM |
Complete suite: real-time feeds · IDOS priority queue · LTOE · wallet profiling · outreach automation |
9.7.2 Per-Operation Token Burn Mechanics
Individual AI module operations carry OTCM token burn costs, generating deflationary pressure through genuine platform utilization. Burns execute at the smart contract level, are irreversible, and are permanently recorded on-chain.
|
AI Module Operation |
OTCM Burn Cost |
|
EDGAR batch query execution (per 500 records) |
1,000 OTCM burned |
|
OTC Markets feed refresh subscription (monthly) |
50 OTCM burned |
|
Investor wallet behavioral report (per ST22 offering) |
250 OTCM burned |
|
Launch Readiness Score analysis (per deployment) |
500 OTCM burned |
|
Full IDOS universe refresh cycle |
1,000 OTCM burned |
|
Automated outreach sequence launch (per issuer) |
750 OTCM burned |
9.8 Data Moat and Competitive Defensibility
Every campaign executed, every issuer conversion or non-conversion, every investor wallet interaction, and every ST22 offering outcome feeds back into the module's training data. The module improves continuously with platform scale. After 500 ST22 launches, Groovy Company, Inc. dba OTCM Protocol will have accumulated a dataset on Digital Securities investor behavior, issuer conversion patterns, and offering timing correlations that no competitor can replicate without operating at equivalent scale and regulatory depth.
This dataset constitutes a structural moat: the AI's accuracy and efficiency improve as the platform grows, which improves commercial outcomes, which accelerates platform growth — compounding the data advantage with each issuer onboarded. Competitors entering the issuer tokenization market without equivalent historical data will face systematically lower conversion rates and higher acquisition costs for an indefinite period.
|
Self-Reinforcing Competitive Advantage Each successful issuer conversion generates ST22 trading volume. Volume generates transaction fee revenue. Fee revenue deepens the Global Unified CEDEX Liquidity Pool. Deeper pool improves secondary market quality, making the platform more attractive to the next issuer and the next investor. Improved AI module accuracy drives faster issuer conversion at lower cost. The data advantage is therefore self-funding — requiring no incremental investment beyond normal platform operations to maintain and grow. |
9.9 Performance Specifications
|
Performance Metric |
Specification |
|
IDOS refresh latency — 8-K trigger event |
< 60 seconds from EDGAR RSS publication to score update |
|
IDOS full universe refresh cadence |
Every 24 hours · continuous partial refresh on real-time events |
|
EDGAR full-text search throughput |
1,000 CIKs per batch cycle |
|
OTC Markets tier change detection |
Near real-time · 15-minute polling interval |
|
Investor wallet profile refresh |
Every 6 hours · event-driven on large on-chain movements |
|
Launch Readiness Score update frequency |
Every 4 hours · real-time on Solana congestion events |
|
Priority queue depth |
15,000+ issuers · memory-resident with persistent backing store |
|
Empire cross-reference match latency |
< 5 seconds on new issuer ingestion |
|
Outreach sequence trigger delay |
< 2 minutes from IDOS threshold breach |
9.10 Regulatory and Privacy Considerations
All data sources consumed by the Predictive Marketing AI Module are publicly available. SEC EDGAR is a public database operated for the express purpose of providing universal investor access to issuer filings. OTC Markets Group publishes tier data, issuer profiles, and trading statistics as a market infrastructure function. No proprietary, non-public, or personally identifiable data is accessed without explicit consent in the issuer prospecting pipeline.
Investor-side wallet behavioral profiling operates on on-chain transaction data, which is inherently public on the Solana blockchain. Off-chain identity enrichment is limited to publicly available professional data sources. All investor outreach is compliant with CAN-SPAM Act requirements, GDPR Article 6(1)(f) legitimate interest basis where applicable, and Rule 506(c) general solicitation standards. The module does not store or process payment card data, social security numbers, or health information. All data handling practices are documented in the OTCM Protocol Privacy Policy and subject to the annual compliance audit framework.
|
Rule 506(c) Compliance Gate The AI module enforces Rule 506(c) compliance programmatically. No investor outreach sequence is triggered for any wallet or identity that has not cleared the module's accreditation proxy filter. Where the wallet has a verified accreditation record on file with Empire Stock Transfer, that record is used directly. Where it does not, the module routes the prospect through the Empire onboarding flow before any offering-specific communication is triggered. This is not a policy — it is a code-level constraint on the outreach automation pipeline. |
|
Groovy Company, Inc. dba OTCM Protocol | CIK: 1499275 | Version 7.0 | March 2026 | Confidential |