← Back to MaacVerify
MAACVerify Certification Report

DeepSeek V3

Baseline-to-Client Scenario Gap Assessment · MAAC v4.7 · Decision Support
ClientMeridian Health AnalyticsReport IDMV-2026-DSV3-001
System Under AssessmentDeepSeek-Chat V3Assessment DateMarch 18, 2026
Vendor / ProviderDeepSeek AIIssue DateMarch 24, 2026
Domain · Use CaseHealthcare · Clinical decision support draftingValidity PeriodMar 24, 2026 → Mar 24, 2027
Assessment TypeBaseline + Client ScenarioMAAC Instrumentv4.7
Lead AssessorAbdalla Doleh, PhDSeal AuthorizationConditional
CERTIFICATION OUTCOME: Certified with Conditions
Issued by MaacVerify Certification Authority · March 24, 2026 · Supervised decision-support use only
MAAC Seal of Trust
Report Type
Conditional Certification Report
Operational Status
Pending Control Closure
Public Seal Status
Suspended until Control Closure

Cognitive Profile Overview

DeepSeek V3 was assessed against (a) a controlled synthetic baseline scenario set for the defined healthcare clinical decision-support drafting use case and (b) Meridian Health Analytics' client-specific operational scenarios for the same use case. Both runs used the MAAC v4.7 instrument across all nine cognitive dimensions (baseline n = 2,195; client n = 412). The system demonstrates exceptional output-oriented performance on the baseline corpus — Tool Execution (94), Content Quality (93), Cognitive Load (89) — with a material decline in Hallucination Control under client-specific ambiguous-evidence cases (78 → 67, an 11-point gap).

The certification outcome is Certified with Conditions for supervised decision-support drafting only, contingent on the required controls documented in Section 15. The system is not certified for unsupervised clinical use, autonomous decisioning, or regulatory submission without expert review.

STRENGTHS (BASELINE + CLIENT)
Tool Execution (94 → 91)
Coordinates analytical resources and multi-tool reasoning chains with industry-leading consistency; minimal degradation on client cases.
Content Quality (93 → 90)
Produces coherent, domain-compliant outputs; structural integrity holds under the client's longer chart-summary scenarios.
MATERIAL GAPS & MONITORING
Hallucination Control (78 → 67) — Material Gap
11-point decline on ambiguous-evidence cases. Requires source-verification workflow and expert review before clinical use.
Memory Integration (78 → 72) — Monitor
Context loss in extended chart workflows; requires context-length limits and regression testing.
maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 1 of 13

Leadership-Level Summary

MaacVerify assessed DeepSeek V3 using a controlled baseline-to-client scenario gap assessment method. The system was first evaluated against a domain- and use-case-specific synthetic baseline scenario set, then against Meridian's client-specific operational scenarios using the same MAAC framework. The resulting comparison identifies performance gaps, operational risks, required controls, and certification conditions stated below.

CERTIFICATION OUTCOME
Certification DecisionCertified with Conditions
Operational AuthorizationPending control closure — not active for certified operational use until Section 15a requirements are satisfied
Certified UseSupervised clinical decision-support drafting under defined operating controls
Risk TierModerate-High
Baseline MAAC Score83 / 100
Client Scenario MAAC Score78 / 100
Gap StatusMaterial gap on HC; Monitor on MI, KT, PE
Validity PeriodMar 24, 2026 → Mar 24, 2027
Seal UseConditional — not authorized for public use until pending controls are Met
Monitoring RequiredYes — quarterly drift review
Required ControlsHuman review, evidence verification, logging, change control, drift monitoring
Certification Posture. Certified with Conditions does not mean "approved for use today no matter what." It means the system is eligible for certified supervised use once the listed required controls (Section 15) are implemented and maintained, and the certification mark is not authorized for public display until pending controls are closed (Section 15a).
83
Baseline MAAC
78
Client MAAC
5
Composite Gap
1 / 9
Material Gaps

This certification applies only to the system, version, configuration, corpus, intended use, deployment context, and validity period specified in this report. It does not extend to materially modified versions, altered prompts, new tools, new deployment environments, additional use cases, autonomous use cases, or workflows not expressly included in scope (Section 03).

Dimensional weighting: All nine dimensions are weighted equally per MAAC v4.7. The composite MAAC score is the unweighted mean of all nine dimensional scores. Domain-specific weighting profiles are under active development.
maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 2 of 13

What Was Assessed — and What Was Excluded

System nameDeepSeek-Chat V3
System version / buildv3.0.2 · build 2026.02-r4
Vendor / providerDeepSeek AI
Deployment configurationAPI · system prompt v2.1 · retrieval over Meridian KB · no autonomous tools
Assessment environmentMaacVerify isolated assessment harness · no production data egress
DomainHealthcare — outpatient internal medicine
Use caseDrafting clinical decision-support summaries for physician review
User roleLicensed attending physician (final reviewer)
Output typeStructured draft summaries and recommendation lists
Assessment windowMarch 4 – March 18, 2026
Scenario countsBaseline 2,195 · Client 412
ExclusionsAutonomous diagnosis · unsupervised clinical use · regulatory submission · enterprise-wide use · cybersecurity certification · privacy certification

Permitted, Conditional, and Prohibited Uses

This certification applies only to supervised clinical decision-support drafting in which outputs are reviewed by a licensed attending physician before being used for any clinical, operational, or documentation decision. It does not authorize autonomous diagnosis, unsupervised clinical use, regulatory submission without expert review, or any use outside the defined intended-use scope.

Baseline-to-Client Scenario Gap Assessment

  1. Define domain, use case, system, configuration, and intended use.
  2. Generate a synthetic baseline scenario set for the assessed domain.
  3. Run baseline scenarios through the system; adjudicate using MAAC v4.7.
  4. Conduct quality review of baseline assessment results.
  5. Run client-specific operational scenarios through the same MAAC process.
  6. Conduct quality review of client scenario results.
  7. Compare baseline and client performance dimension-by-dimension.
  8. Identify gaps, failure modes, and operational meaning.
  9. Assign required controls and monitoring conditions.
  10. Issue the certification decision documented in Section 02.
maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 3 of 13

Controlled Synthetic Baseline

The baseline scenario set represents expected domain- and use-case-specific task demands under controlled assessment conditions. It establishes the structured reference point for comparison against client-specific scenarios. The baseline is not intended to represent all possible deployment conditions.

CategoryCountComplexityRiskNotes
Typical workflow cases1,240Simple / ModerateLow / ModerateStandard CDS drafting patterns
Edge cases520Moderate / ComplexModerate / HighAtypical presentations, rare comorbidities
Ambiguous evidence cases275ComplexHighConflicting or incomplete chart data
Failure-prone cases160ComplexHighAdversarially constructed from prior incident registry

Baseline Results — Overall 83 / 100

CLTECQMICHHCKTPEPOA20406080100
Baseline (83)
CLCognitive Load
TETool Execution
CQContent Quality
MIMemory Integration
CHComplexity Handling
HCHallucination Control
KTKnowledge Transfer
PEProcessing Efficiency
POAProcess-Outcome Alignment
#DimensionScoreStatus
01
Cognitive Load
89Strong
02
Tool Execution
94Strong
03
Content Quality
93Strong
04
Memory Integration
78Monitor
05
Complexity Handling
79Monitor
06
Hallucination Control
78Monitor
07
Knowledge Transfer
75Monitor
08
Processing Efficiency
76Monitor
09
Process-Outcome Alignment
87Strong

Baseline scores reflect observed system behavior under controlled baseline assessment conditions. They do not guarantee performance in client-specific deployment environments or future system versions.

maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 4 of 13

Meridian Health Analytics — Operational Scenarios

Client-specific scenarios were derived from Meridian's clinical decision-support SOPs, redacted historical cases, and physician interview-derived workflows. All scenarios were de-identified prior to assessment and reviewed by Meridian's clinical informatics lead.

Scenario SourceCountDescriptionReview Status
Client-provided examples120Curated CDS prompts from production logsReviewed
SOP-derived workflows140Generated from clinical-pathway SOPsReviewed
Expert interview-derived92Edge-case prompts from 6 attending physiciansReviewed
Redacted historical cases60De-identified prior-incident casesReviewed

Client Results — Overall 78 / 100

CLTECQMICHHCKTPEPOA20406080100
Client (78)
CLCognitive Load
TETool Execution
CQContent Quality
MIMemory Integration
CHComplexity Handling
HCHallucination Control
KTKnowledge Transfer
PEProcessing Efficiency
POAProcess-Outcome Alignment
#DimensionScoreStatus
01
Cognitive Load
86Strong
02
Tool Execution
91Strong
03
Content Quality
90Strong
04
Memory Integration
72Monitor
05
Complexity Handling
75Monitor
06
Hallucination Control
67Monitor
07
Knowledge Transfer
70Monitor
08
Processing Efficiency
73Monitor
09
Process-Outcome Alignment
82Strong

Client scenario results reflect observed behavior under Meridian's assessed scenario set. They do not represent all possible client workflows, users, edge cases, or future deployment conditions.

maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 5 of 13

Side-by-Side Cognitive Profile

The comparative radar map overlays the controlled baseline (navy) and the client-specific operational results (gold). The gap analysis table below quantifies dimension-level divergence and assigns each gap a status using MaacVerify's threshold profile: Stable (0–3) · Monitor (4–7) · Material (8–12) · Critical (13+).

CLTECQMICHHCKTPEPOA20406080100
Baseline (83)
Client (78)
θ floor
CLCognitive Load
TETool Execution
CQContent Quality
MIMemory Integration
CHComplexity Handling
HCHallucination Control
KTKnowledge Transfer
PEProcessing Efficiency
POAProcess-Outcome Alignment
DimensionBaseClientGapStatus
Cognitive Load8986-3Stable
Tool Execution9491-3Stable
Content Quality9390-3Stable
Memory Integration7872-6Monitor
Complexity Handling7975-4Monitor
Hallucination Control7867-11Material Gap
Knowledge Transfer7570-5Monitor
Processing Efficiency7673-3Stable
Process-Outcome Alignment8782-5Monitor
Composite8378-5Monitor

Gaps do not necessarily indicate system failure. They identify areas where client-specific conditions create additional operational risk, monitoring needs, or control requirements. The HC material gap drives the conditional certification outcome and the required-controls list in Section 15.

The system qualifies for Certified with Conditions status because baseline performance (83/100) exceeded the MAAC assessment threshold, client scenario performance (78/100) remained within conditional-certification range, the identified material gap was bounded to Hallucination Control in ambiguous-evidence cases, prohibited uses are excluded from scope, and required controls are available to manage the observed risk under supervised deployment conditions.
maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 6 of 13

What the Gaps Mean in Practice

GapOperational MeaningRiskRequired ControlCert Impact
HC −11Increased unsupported factual claims under ambiguous-evidence chart casesHighSource verification + attending physician reviewCertified with Conditions
MI −6Context loss in extended chart-summary workflows beyond 8k tokensModerateContext-length limits + regression testingMonitor
KT −5Reduced generalization to atypical presentationsModerateDomain-specific reviewer confirmation on rare casesMonitor
POA −5Mild reasoning-output drift on multi-step recommendation chainsModerateStructured output templates + reasoning trace loggingMonitor

Risk Tier: Moderate-High

Healthcare domain, advisory-only autonomy level, reversible outputs subject to attending-physician review, regulated data sensitivity (client scenarios were de-identified prior to MAACVerify assessment), qualified human oversight present, traceability through audit logs, controlled change process, and departmental deployment scale. Classification reflects observed risk factors and is consistent with the Certified-with-Conditions outcome.

Certification threshold logic. Client scenario scores below 80 may remain eligible for Certified with Conditions status where material gaps are bounded to specific dimensions, required controls are available, prohibited uses are excluded, and the intended use remains supervised. Composite scores below 65, or unbounded gaps across multiple high-risk dimensions, are not eligible.

Composite Score BandCertification Interpretation
≥ 80Eligible for Certified or Certified with Conditions
65 – 79Conditional range — requires bounded gaps, available controls, and supervised use
< 65Not eligible for certification
maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 7 of 13

Bias Examination and Distributional Fairness

MaacVerify conducted a structured fairness examination covering cognitive-bucket distribution mismatch, complexity-tier representativeness, de-identification verification, and dimension-level disparity analysis across the baseline and client corpora.

Bias DimensionFindingStatus
Cognitive bucket distribution (RQ3)Single-bucket engagement — mismatch index = 0.00. No distributional differences could drive performance deviations.No bias signal
Complexity tier balanceClient tier distribution within 20pp of baseline across all tiers. Adversarial distribution check passed.Within bounds
Scenario de-identificationClient scenarios de-identified prior to assessment. No protected-class identifiers present in corpus.Confirmed
Dimension-level disparityHallucination Control departure explained by domain-specific ambiguous-evidence cases, not demographic proxies. No disparate impact detected.None detected

Satisfies EU AI Act Art. 10(4), NIST AI RMF MEASURE 2.6, and MAS FEAT F1–F2. Does not substitute for a full algorithmic fairness audit where required by law.

Permitted, Conditional, Prohibited

Use CaseStatusRequired Conditions
Internal analytical planningPermittedHuman review
Low-risk content generationPermittedReview before use
Clinical decision-support drafting (in scope)ConditionalAttending review + evidence verification
Regulatory documentation supportConditionalQA / regulatory expert review
Autonomous diagnosisProhibitedNot certified
Unsupervised high-impact decisionsProhibitedNot certified
maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 8 of 13

Certification Conditions Checklist

Certified use is conditional upon implementation and maintenance of the controls listed below. Controls marked Pending must be closed under Section 15a before operational authorization and public seal use take effect. Failure to maintain Met controls may limit, suspend, or void the certification.

ControlRequiredOwnerEvidenceStatus
Qualified human review (attending physician)YesClientReview SOP · role definitionMet
Source / evidence verification workflowYesClientVerification workflow docPending
Prompt / configuration version controlYesClientVersion recordsMet
Input / output loggingYesClientLog policy + samplesMet
Escalation protocol for HC flagsYesClientEscalation SOPPending
User training (physicians + informatics)YesClientTraining recordMet
Data governance controls (PHI handling)YesClientData policyMet
Incident reportingYesClientIncident SOPMet
Drift monitoring (quarterly)YesClient + MaacVerifyMonitoring planPending
Reassessment processYesClient + MaacVerifyReassessment planMet

Pending Controls — Required for Active Authorization

Pending ControlRequired EvidenceResponsible PartyDue DateCertification Impact
Source / evidence verification workflowApproved workflow document + role mappingClientApr 24, 2026Required before active operational use
HC escalation protocolSigned escalation SOPClientApr 24, 2026Required before public seal use
Drift monitoring planSigned quarterly monitoring planClient + MaacVerifyMay 8, 2026Required for ongoing certification validity

Until all pending controls are verified as Met, or formally accepted in writing by MaacVerify under a corrective-action plan, the certification remains Conditionally Issued. The certification decision stands, but certified operational use and public seal use remain suspended unless expressly authorized in writing. Acceptance under a corrective-action plan requires written MaacVerify approval and may still restrict operational authorization, seal use, or both until specified milestones are closed.

maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 9 of 13

Observed & Plausible Failure Modes

IDFailure ModeDim.Sev.Lik.TriggerControlResidual
FM-001Unsupported factual claimHCHighMedAmbiguous evidenceSource verificationModerate
FM-002Weak uncertainty signalingHCHighMedIncomplete chartExpert reviewModerate
FM-003Context loss in long workflowsMIModMed>8k tokensContext limits / testingLow / Mod
FM-004Overgeneralization on rare casesKTModLowNovel presentationDomain expert checkLow
FM-005Process–output mismatchPOAHighLowComplex pathwayStructured templatesModerate

Sample Evidence Records

Evidence IDTypeDimensionFindingScoreLinked Failure ModeRequired Control
EV-001BaselineContent QualityStrong93Standard review
EV-002ClientHallucination ControlMaterial gap67FM-001, FM-002Source verification + attending review
EV-003ClientMemory IntegrationMonitor72FM-003Context-length limits + regression testing
EV-004BaselineTool ExecutionStrong94Standard review
EV-005ClientKnowledge TransferMonitor70FM-004Domain expert confirmation on rare cases

Full evidence corpus (n = 2,607) is retained in MaacVerify's evidence store and available for audit under the engagement NDA. The table above is a representative sample.

maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 10 of 13

Unless expressly stated in the assessment scope, MaacVerify does not independently certify the client's privacy, cybersecurity, data retention, or regulatory compliance posture. Certified use assumes the client maintains data governance, access controls, audit logging, vendor controls, and incident-response processes appropriate to the assessed deployment environment and risk tier.

Governance NeedMaacVerify EvidenceReport Section
Performance documentationBaseline + client MAAC scoresSections 07, 09, 10
Risk managementRisk tier + failure mode registerSections 12, 15
Human oversightRequired controlsSection 15
Change controlReassessment triggersSection 20
Audit readinessEvidence traceabilitySection 17
Board / procurement reviewExecutive summary + decisionSections 02, 20

Certification remains valid until March 24, 2027 unless voided by any of the following: model update, architecture change, prompt or system-instruction change, RAG / source corpus change, tool / plugin / API change, deployment workflow change, intended-use expansion, user population change, data type or sensitivity change, incident or suspected harm, drift exceeding ±2.5% on any monitored dimension, removal or failure of required controls, or expiration of the validity period. Without an active monitoring arrangement, MaacVerify makes no representation regarding continued performance after the assessment date.

maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 11 of 13

MaacVerify provides independent assessment of AI system performance, limitations, failure modes, and deployment-readiness conditions using defined evaluation criteria. This report reflects observed performance under the assessment scope, corpus, configuration, and date stated herein. MaacVerify does not build, sell, train, operate, deploy, supervise, or control the assessed system in the client environment. Deployment decisions, regulatory compliance, clinical judgment, legal judgment, user training, data governance, security controls, workflow integration, human oversight, and operational outcomes remain the responsibility of the client. This report does not guarantee future performance, absence of errors, regulatory approval, clinical safety, legal sufficiency, business outcomes, or fitness for use outside the certified scope.

Seal authorization: Conditional. The MaacVerify mark may be used only with the approved claim language: "This deployment has been independently assessed by MaacVerify under Report ID MV-2026-DSV3-001 for the defined use, version, controls, and validity period stated in the certification report." The mark may not be used for other products, versions, broader enterprise claims, after expiration or revocation, after material system changes, or for unsupervised or autonomous uses. Prohibited claims include "MaacVerify approved," "Certified safe," "Guaranteed accurate," "Clinically approved," and "Error-free."

Pending-control restriction. The certification mark may not be used publicly until all required controls marked Pending in Section 15 are verified as Met or formally accepted by MaacVerify under a written corrective-action plan. Display of the mark prior to that closure is unauthorized and voids the conditional seal grant.

No implied certification claim may be made through screenshots, partial excerpts, numerical scores, dimensional sub-scores, radar visuals, or internal report graphics. The certified claim is established only by the full approved claim language displayed with an active Report ID.

Public-use rules. Public use is limited to the approved claim language and the certification mark, displayed only with the Report ID and only while the certification is active. The client may not publish the full report or any executive summary without separate written authorization from MaacVerify. Partial quotation, excerpting, or visual reuse of charts, scores, or tables for marketing purposes is prohibited unless expressly approved in writing.

maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 12 of 13
OFFICIAL CERTIFICATION STATEMENT

MaacVerify confirms that DeepSeek-Chat V3 (DeepSeek AI) was assessed under the Multi-Dimensional Assessment for AI Cognition (MAAC) framework version 4.7 using a baseline-to-client scenario gap method, comprising 2,195 baseline and 412 client-specific scenarios across the defined healthcare decision-support drafting use case.

Based on the evidence reviewed, the system meets the criteria for Certified with Conditions status for the defined supervised decision-support use case. Composite scores: baseline 83/100 · client 78/100. This certification is conditional upon maintenance of the required controls (Section 15), adherence to the certified intended use (Section 04), and compliance with reassessment triggers (Section 20) and certification mark rules (Section 22).

This certification does not constitute a guarantee of future performance, legal compliance, regulatory approval, clinical safety, professional sufficiency, or operational outcomes.

Report IDMV-2026-DSV3-001Issue DateMarch 24, 2026
SystemDeepSeek-Chat V3ValidityMar 24, 2026 → Mar 24, 2027
DomainHealthcare CDS DraftingMAAC Instrumentv4.7
MAAC Seal of Trust
MAAC AUTHORITY
MAACVERIFY
Abdalla Doleh, PhD · Lead Assessor
Date: March 24, 2026
CLIENT ACKNOWLEDGMENT
Authorized Representative: ______________________
Date: ______________________
MaacVerify is the independent assessment authority for the MAAC standard.
We do not build, sell, or train AI models — eliminating the conflicts inherent in vendor self-evaluation.
maacverify.ai · info@maacverify.ai · Report MV-2026-DSV3-001
Page 13 of 13