DeckAnalyst — Scoring Methodology

How DeckAnalyst Scores a Pitch Deck

The complete, transparent methodology behind DeckAnalyst’s AI-powered evaluation engine. 8 dimensions, stage-aware weighting, peer benchmarking against 6,586 companies, and weekly calibration by a senior Techstars analyst.

Published by Unbiased Ventures · Foxsmart Systems GmbH, Zurich · June 2026 · Version 1.0 · CC BY 4.0

Why AI-Powered Deck Evaluation Matters

Venture capital has a consistency problem. When the same pitch deck is shown to five experienced investors, the resulting assessments routinely diverge by 20–30 points on a 100-point scale. This isn’t because any individual analyst is wrong — it’s because human evaluation is sensitive to presentation order, fatigue, anchoring bias, pattern-matching against recent wins, and the cognitive load of a document that touches market sizing, unit economics, competitive positioning, regulatory risk, and capital efficiency simultaneously.

A senior analyst doing a thorough evaluation of a single pitch deck spends 7–8 hours. They review the deck itself, cross-reference market data, check competitive benchmarks, assess financials, and synthesize across dimensions. The result is high-quality but expensive, slow, and not reproducible. The same analyst scoring the same deck a month later may arrive at a different number, because their mental benchmark pool has shifted.

DeckAnalyst was built to solve this specific problem: deliver a score that is structurally equivalent to what a senior analyst produces in a full deep dive, but do it in minutes, reproducibly, and across the same dimensions every time. The system doesn’t replace human judgment — it provides a consistent, transparent baseline that investors can use to triage, compare, and prioritize.

Core Design Principles

Score the deck, not the company. DeckAnalyst evaluates what founders actually present. If critical information is missing from the deck, that’s reflected in the score. Omissions are themselves a signal.

Stage-aware expectations. A Pre-Seed deck without revenue isn’t penalized the same way a Series A deck without revenue would be. Thresholds, weights, and gating rules shift based on stage detection.

Conservative on missing data. When a deck omits a key metric, the system doesn’t redistribute weight to other dimensions. The relevant sub-score is capped, and if the missing metric is critical, a hard evidence cap is applied to the total score. This prevents decks from achieving high scores by omitting weaknesses.

Transparent scoring. Every score comes with a per-dimension breakdown, evidence citations from the deck, and a clear indication of what data was present versus missing. There are no hidden weights or black-box adjustments.

Team evaluation is a separate discipline. DeckAnalyst deliberately excludes team and founder assessment from the deck score. Evaluating founders requires dedicated methodologies — structured interviews, psychological trait assessments, and behavioral analysis — that cannot be meaningfully performed by reading a pitch deck. The team dimension carries a placeholder score until purpose-built assessment tools provide real evidence.

The 8 Scoring Dimensions

DeckAnalyst evaluates pitch decks across 8 dimensions. Seven dimensions are fully scored based on deck evidence. The eighth — Team — carries a conservative placeholder score until dedicated founder assessment tools provide independent evidence.

Each dimension is scored 0–5 and multiplied by its weight. The total score ranges from 0 to 100. The formula is: Total Score = Σ (Sub-score × Weight). Default weights are calibrated for Seed-stage decks and shift for Pre-Seed and Series A.

#DimensionWeightWhat It Evaluates
1Market Attractiveness15%Market size, growth rate, problem urgency, ICP clarity
2Traction & Growth15%ARR/MRR, growth rate, retention, churn, pipeline
3Unit Economics15%CAC, LTV, LTV/CAC ratio, payback period, gross margin
4Go-to-Market Efficiency10%Repeatability, channel fit, sales cycle, PLG levers
5Product/Tech, USP & Defensibility15%Technical edge, competitive differentiation, data moats, IP, switching costs
6Capital Efficiency & Runway10%Burn multiple, runway months, milestone plan per €
7Regulatory/Operational Risk10%Compliance paths, platform dependencies, certifications
8Team (placeholder)5%Dummy score — pending dedicated founder assessment

1. Market Attractiveness — 15%

Market Attractiveness15%

Why this dimension exists: A startup solving a real problem in a large, growing market has structural tailwinds. Market attractiveness is weighted at 15% because it’s the single largest external factor determining whether a business model can scale. It also interacts with nearly every other dimension: a large, growing market makes unit economics easier, improves capital efficiency, and increases defensibility.

Scores Well
  • TAM €1–2B+ with a clearly defined SAM
  • Market CAGR 20%+ with identifiable structural drivers
  • Acute, well-articulated pain points — prospects actively seek solutions
  • Crisp ICP: who buys, why they buy, what the entry point looks like
Scores Poorly
  • “The market is still being created” without analogous demand evidence
  • Vague ICP: “enterprises” or “SMBs” without segmentation
  • Fragmented markets without a clear beachhead strategy

2. Traction & Growth — 15%

Traction & Growth15%

Why this dimension exists: Traction is the single strongest evidence that the market hypothesis is correct and the product works. At Seed and beyond, traction separates narrative from reality.

Stage nuance: Pre-Seed decks are not expected to show ARR. At Pre-Seed, traction signals shift to waitlists, LOIs, pilot commitments, or validated demand from interviews. The absence of ARR at Pre-Seed is treated as neutral (3/5), not penalized.

Scores Well
  • ARR €500K+ at Seed
  • MoM growth 10%+ or QoQ growth 30%+
  • Net dollar retention (NDR) 110%+
  • Logo churn 3% per month or less
  • Resilient pipeline with committed deals
Scores Poorly
  • Flat or negative growth trajectories
  • High churn requiring constant new-customer acquisition
  • Pilot-driven traction without conversion to paying customers

3. Unit Economics — 15%

Unit Economics15%

Why this dimension exists: Revenue growth without viable unit economics is a path to larger losses. This dimension determines whether each customer relationship creates or destroys value. Note: LTV/CAC below 1.0 triggers a hard cap of 30 on the total score (see Gating Rules).

Scores Well
  • LTV/CAC ratio 3× or higher
  • Payback period 12 months or less (SaaS)
  • Gross margin 70%+ (software-typical)
  • Low onboarding costs relative to contract value
Scores Poorly
  • LTV/CAC below 1.5×
  • Payback period exceeding 18–24 months
  • Gross margin below 50% (structural cost issue)

4. Go-to-Market Efficiency — 10%

Go-to-Market Efficiency10%

Why this dimension exists: A great product in a great market still fails if it can’t be sold repeatably. GTM efficiency measures whether the startup has found a sales motion that works and can scale.

Scores Well
  • Repeatable, documented, transferable sales process
  • Clear channel mix with evidence of what works
  • Sales cycle ≤90 days (SMB) or ≤180 days (mid-market)
  • Active customer references for new prospects
Scores Poorly
  • Purely opportunistic deal flow; every deal is custom
  • Only founder-led sales with no plan to build a sales motion
  • No evidence of ICP/pricing fit

5. Product/Tech, USP & Defensibility — 15%

Product/Tech, USP & Defensibility15%

Why this dimension exists: Without defensibility, any initial traction advantage can be competed away. This dimension evaluates whether the startup has built durable structural advantages and how its value proposition stacks up against companies validated by top accelerators (Techstars, Y Combinator, Alchemist, Plug & Play, de Vigier Prize, Swiss Top 100).

Scores Well
  • Proprietary data assets that improve with usage
  • Meaningful switching costs for customers
  • Patents, trade secrets, or non-trivial technical barriers
  • Technical architecture that compounds over time (data flywheel, ML performance)
  • Clear differentiation versus validated accelerator cohort peers
Scores Poorly
  • Commodity tech stack; product rebuilt by a competent team in months
  • Critical dependence on a single vendor or API with no proprietary layer
  • Below-benchmark differentiation versus comparable validated competitors

Competitive benchmarking sub-score (0–5)

ScoreSignal
0–1Below benchmark — similar solution with weaker metrics than comparable winners
2–3Parity or niche edge — competitive but not clearly differentiated
4Clear edge in at least two dimensions (traction, technology, or moat) versus current winner cohorts
5Top-decile differentiation profile — would stand out even among recent winners

6. Capital Efficiency & Runway — 10%

Capital Efficiency & Runway10%

Why this dimension exists: Capital efficiency determines how much progress a startup makes per euro raised. Burn rate without proportional growth is a structural problem, not a growth strategy. Note: burn multiple >3× with <9 months runway triggers a hard cap of 40 (see Gating Rules).

Scores Well
  • Burn multiple 1.5× or lower (net burn / net new ARR)
  • Runway 12–18 months
  • Clear milestone plan tying capital deployment to measurable outcomes
Scores Poorly
  • Burn multiple exceeding 2.5×
  • Runway under 9 months without strong pipeline or imminent close
  • Vague “use of proceeds” not connected to specific milestones

7. Regulatory/Operational Risk — 10%

Regulatory/Operational Risk10%

Why this dimension exists: Some markets carry regulatory requirements that can add years and millions to a startup’s timeline. This dimension is scored inversely: 5 = low risk, 0 = severe unaddressed risk.

Low Risk (Scores Well)
  • Clear regulatory approval path with identified milestones
  • Existing certifications or pre-approvals
  • Regulation used as a competitive moat — compliance as barrier to entry
High Risk (Scores Poorly)
  • Medical or financial products without a clear approval path
  • Critical platform dependencies (single distributor controls the business)
  • Uncalculated compliance costs that could materially affect unit economics
  • No mention of regulatory landscape in a clearly regulated domain

8. Team — 5% (Placeholder)

Team5%

DeckAnalyst assigns a conservative placeholder score for the Team dimension. This is a deliberate architectural decision, not an omission.

Why a placeholder? Evaluating founders and teams from a pitch deck is unreliable. A deck can state credentials and bios, but it cannot reveal psychological resilience, decision-making under pressure, interpersonal dynamics, dark-side personality traits, or founder-market fit at the depth required for investment decisions.

How team evaluation actually happens: Unbiased Ventures has purpose-built assessment tools that provide rigorous, independent team and founder scoring. These produce quantitative scores that replace the placeholder in the master scoring layer. See The Full Evaluation Stack for details.

Stage-Aware Weighting

The default weights above are calibrated for Seed stage. The system detects stage from the deck and adjusts expectations:

Pre-Seed

  • Traction and unit economics weights decrease; technology, product, and problem weights increase.
  • Missing revenue metrics are treated as neutral (3/5), not penalized.
  • Gating rules around LTV/CAC and payback are relaxed — these metrics typically don’t exist yet.

Series A

  • Traction, unit economics, and GTM efficiency weights increase; stricter gating thresholds apply.
  • Founder-led sales alone is a warning sign at Series A — the expectation for a repeatable motion is much higher.
  • Missing financial metrics are penalized more heavily.

Stage detection is automated based on signals within the deck: stated raise amount, team size, revenue figures, product maturity indicators, and explicit stage labels. When stage is ambiguous, the system defaults to Seed.

Gating Rules: Hard Caps That Override Weights

Regardless of how well a deck performs across other dimensions, certain conditions trigger hard score caps. Weighted scoring alone can mask fatal flaws — gating rules encode investment realities that no weighting scheme can capture through averaging alone.

ConditionMaximum Total Score
LTV/CAC < 1.030
Payback > 24 months (SaaS)40
Burn multiple > 3× with < 9 months runway40
No ICP + no paying customer at Seed35 (DeepTech exception possible)
Legal showstopper risk without a planReject until clarified

DeepTech exception: Pre-revenue DeepTech companies at Seed may lack paying customers by the nature of their development cycle (hardware, biotech, deep R&D). The “no ICP + no paying customer” gate can be relaxed when the deck demonstrates clear technical milestones, grant funding, LOIs, or institutional partnerships that validate demand.

Missing Data Handling & the Evidence Coverage Score

Founders sometimes omit metrics from their deck — sometimes because the metrics don’t exist yet (legitimate at Pre-Seed), sometimes because the numbers aren’t flattering. The scoring system handles both cases without rewarding omission.

Design Principles

  • No re-weighting: Missing data does not shift weight to other dimensions. The dimension is scored at a cap, and the weight stays.
  • Conservative caps: Critical missing fields trigger per-criterion score caps.
  • Evidence caps on total: When gating-level evidence is missing, the total score is capped regardless of per-dimension performance.
  • Stage-awareness: Pre-Seed decks are not penalized for metrics that don’t typically exist at that stage.

Per-Criterion Caps (Seed Defaults)

DimensionMissing EvidenceScore Cap
Traction & Growth (15%)Missing ARR and growth dataMax 2/5
Traction & Growth (15%)Missing one of ARR or growthMax 3/5
Traction & Growth (15%)Pre-Seed with no revenueNeutral 3/5
Unit Economics (15%)Missing both churn & CACMax 2/5
Unit Economics (15%)Missing one of churn or CACMax 3/5
Unit Economics (15%)Pre-SeedNeutral 3/5
Capital Efficiency (10%)Missing both runway & burn multipleMax 2/5
Capital Efficiency (10%)Missing one of runway or burnMax 3/5
GTM, Product/USPMissing core evidenceMax 3/5
Regulatory Risk (10%)Regulated domain, no clear path2/5
Regulatory Risk (10%)Unregulated domainNeutral 3/5

Evidence Cap on Total Score

Missing EvidenceTotal Score Cap
Any one of: LTV/CAC, Payback, Burn Multiple/Runway≤ 75
Any two of the above≤ 70
All three missing≤ 65

The Evidence Coverage Score (ECS)

Every DeckAnalyst report includes an Evidence Coverage Score (ECS) alongside the main score. The ECS is a 0–100 measure of what share of required evidence fields (weighted by importance) the deck actually provides.

A report might read: “Overall 72 with ECS 66” — meaning the deck scored 72 overall, but only 66% of required evidence fields were present. This gives investors immediate visibility into how much of the score rests on actual evidence versus missing-data handling defaults.

The ECS is accompanied by a “what’s missing” checklist naming the specific fields the deck should have included, making the score actionable.

Peer Benchmarking Against 6,586 Companies

DeckAnalyst doesn’t score decks in isolation. Every evaluation includes a peer benchmarking step that compares the startup against a curated database of 6,586 companies, including Y Combinator alumni and graduates of major accelerator programs.

How Peer Matching Works

The system uses sector and sub-sector classification to identify the most relevant peer group. Matching considers vertical (e.g., FinTech, HealthTech, PropTech), business model (B2B SaaS, marketplace, hardware), and stage. The result is a peer cohort of companies operating in comparable spaces at comparable stages.

What Peer Benchmarking Adds

  • Relative scoring context: A 72/100 in enterprise cybersecurity means something different than a 72/100 in consumer social. Benchmarking shows where the startup sits relative to companies that investors have already funded and validated.
  • Dimension-level comparison: The report shows not just an overall peer percentile, but per-dimension comparisons — exposing which areas are above-peer and which lag.
  • Competitive landscape signal: The peer set tells a story about the space. A startup in an underserved vertical with few quality peers may have a clearer path than one in a crowded, well-tracted space.

Known Limitations of Peer Matching

Peer matching works best when the startup’s sector has sufficient representation in the database. In very niche or emerging verticals, the peer set may be small or only loosely analogous. The system flags when peer density is low so investors know to interpret relative metrics with appropriate context.

The Full Evaluation Stack: Beyond the Deck

DeckAnalyst is one layer in a multi-layer investment evaluation methodology. The pitch deck score answers: “How strong is the investment case as presented in this document?” But a complete investment decision requires evaluating the founders and team independently, with tools purpose-built for that task.

📊

DeckAnalyst Score

AI-powered evaluation of the pitch deck across 8 dimensions. Deterministic, reproducible, evidence-backed. Scores the document, not the founders.

🎙️

Structured Founder Interview (Jaffar et al.)

A standardized, evidence-based interview protocol evaluating founders on execution orientation, vision clarity, adaptability, and domain mastery. Scored quantitatively, independent of the deck.

🧠

Comprehensive Psychological Assessment

Multi-instrument battery drawing on the Hogan Personality Inventory, Dark Tetrad (narcissism, Machiavellianism, psychopathy, sadism), and Big Five personality traits. Surfaces founder profiles invisible in a pitch deck.

The Master Score

The complete Unbiased Ventures evaluation produces a Master Score combining the DeckAnalyst pitch-deck score, the structured interview score, and the psychological assessment score. This master score reflects both the strength of the business case and the quality of the team behind it — each evaluated by the methodology best suited to that domain.

No aspect of the investment decision is left to informal impression. The deck is scored by DeckAnalyst. The founders are scored by dedicated interview and psychometric tools. The combined result is a comprehensive, transparent, and reproducible investment evaluation.

Validation: How We Know It Works

DeckAnalyst’s scoring methodology was developed over 12+ months of iterative research and calibration, involving parallel development tracks each lasting a minimum of six months.

Human Analyst in the Loop

Validation was conducted by a senior investment analyst who works for Techstars as senior analyst and lead evangelist of the Techstars methodology, and who also works with the Founder Institute by The Decile Group — a practitioner who evaluates pitch decks professionally, at scale, using established institutional frameworks.

The Validation Protocol

  • Seven sample decks from seven different industries were selected to cover the range of sectors, stages, and quality levels DeckAnalyst encounters.
  • Each deck was independently scored by the human analyst using a full deep-dive process (7–8 hours per deck).
  • DeckAnalyst scored the same decks.
  • Scores were compared on a weekly basis throughout the development cycle.
  • Discrepancies were analyzed at the dimension level to identify where the system’s logic needed refinement.
  • The process iterated: when human and machine scores diverged on a dimension, the underlying scoring logic, thresholds, or evidence extraction was adjusted and re-tested.

What “Alignment” Means

The target was not perfect agreement — even experienced human analysts don’t agree with each other perfectly. The target was that DeckAnalyst’s scores fall within the range of professional analyst variance. In practice, the system’s score on a given deck is as close to the human analyst’s score as two human analysts would typically be to each other.

Before & After: What Better Decks Look Like

The following examples show how DeckAnalyst distinguishes aspirational language from deterministic evidence across four dimensions.

Example 1: Market Attractiveness

Before — Score: 2/5
“We’re building the future of work. The market is massive — remote work is everywhere. Everyone needs our product.”

No TAM figure, no SAM, no CAGR, no ICP, no specific pain point. Vague and unsupported.

After — Score: 4/5
“The European SMB HR-tech market is €4.2B (Statista 2024), growing at 22% CAGR driven by regulatory complexity post-EU AI Act. Our ICP is 50–200 employee companies in DACH with distributed teams. Their #1 pain: compliance across 3+ jurisdictions costs €180K/year in legal fees.”

TAM sourced and sized, CAGR cited with structural driver, ICP crisp, pain point quantified, beachhead named.

Example 2: Unit Economics

Before — Score: 1/5
“We’re growing fast and costs will come down at scale.”

No CAC, no LTV, no margin, no payback. Asserts future efficiency without evidence.

After — Score: 4/5
“CAC is €2,400 (blended inbound/outbound), LTV is €14,400 (€400/mo × 82% gross margin × 36-month avg. lifetime), LTV/CAC is 6.0×, payback is 7.3 months. Inbound CAC trending down 12% QoQ as content flywheel matures.”

Every metric defined, LTV formula transparent, trends shown, channel mix visible.

Example 3: Traction & Growth

Before — Score: 2/5
“We have strong interest from several Fortune 500 companies and a growing pipeline.”

No revenue figures, no growth rate, no retention, no conversion from pipeline.

After — Score: 5/5
“€620K ARR, 14% MoM growth over last 6 months. NDR 118%. Logo churn 1.8%/month. Pipeline: €340K committed (signed LOIs), €210K best-case. 3 Fortune 500 pilots converting to annual contracts in Q1 (€45K ACV avg).”

Concrete numbers, growth quantified, retention demonstrated, pipeline staged by confidence level.

Example 4: Regulatory Risk

Before — Score: 1/5
“We operate in healthcare. Regulations exist but we’ll figure them out.”

Regulated domain acknowledged but no path, no timeline, no cost estimate.

After — Score: 4/5
“Class IIa medical device under EU MDR. Pre-submission meeting with notified body (BSI) completed March 2024. CER in progress — expected Q3 2024. Regulatory budget: €280K allocated. CE marking target: Q1 2025. Regulatory moat: competitors are 18–24 months behind on MDR compliance.”

Clear approval path, milestones dated, costs budgeted, compliance positioned as competitive advantage.

Output Scale & Interpretation

Score RangeInterpretation
80–100Top-Tier — Investable. The deck presents a compelling case across all major dimensions. Proceed to founder assessment and deeper due diligence.
65–79Track / Build a Case. Strong in several dimensions but gaps remain. Consider smaller initial tickets, milestone-based tranches, or further diligence on weak areas.
50–64High Uncertainty. The deck shows promise in some areas but critical dimensions are underdeveloped. Wait for specific milestones before committing.
Below 50Not Investable for Now. Fundamental gaps in multiple dimensions. The startup may need to iterate on product, market, or business model before seeking this stage of investment.

These ranges are calibrated against the human analyst’s scoring distribution. A score of 80+ corresponds to decks the analyst would flag as top-tier candidates for immediate deeper diligence.

Robustness & Guardrails

Hallucination Control

DeckAnalyst scores are derived exclusively from evidence present in the submitted deck. The system uses a retrieval-augmented generation (RAG) architecture where every claim must be traceable to specific text or data in the source document. An independent evidence auditor layer validates each dimension’s score against the extracted evidence.

When section-specific evidence extraction returns empty, the system falls back to full-text search across the entire document before defaulting to missing-data caps. This prevents legitimate evidence from being missed due to non-standard deck structures.

Determinism & Reproducibility

Scoring uses controlled generation parameters to minimize run-to-run variance. The target is that the same deck, scored multiple times, produces scores within a narrow band — tight enough that the variance is smaller than the margin of error between two human analysts scoring the same deck.

Prompt Injection Resistance

DeckAnalyst processes untrusted documents submitted by founders. The system includes guardrails against prompt injection attempts embedded in deck content. The scoring pipeline treats deck text as data to be evaluated, not as instructions to be followed. The specifics of these guardrails are not published to avoid providing a roadmap for circumvention.

Drift Prevention

The scoring methodology is versioned and locked. Changes to dimension definitions, weight allocations, gating thresholds, or evidence handling require explicit versioning and re-validation against the reference deck set. This prevents gradual drift in scoring behavior over time.

Limitations

DeckAnalyst scores the deck, not the company. The system can only evaluate what’s presented. A mediocre deck from a strong company will score lower than the company deserves. A polished deck from a weak company may score higher. The score measures how well the investment case is presented, not the company’s future.

Peer benchmarking has a ceiling in niche sectors. When the peer database has limited representation in a specific vertical, relative scoring is less informative. The system flags low peer density but cannot generate comparables that don’t exist.

Stage detection is heuristic. The system infers stage from deck signals. Occasionally a deck may be ambiguous (e.g., a Pre-Seed company raising a Seed-sized round). In ambiguous cases, the system defaults to Seed, which may under- or over-penalize certain metrics.

Language support covers major European languages. DeckAnalyst reliably scores pitch decks in English, German, Spanish, French, and Italian. Decks in other languages may produce less reliable results.

The deck score is one input, not the complete picture. DeckAnalyst produces the deck-analysis layer of the investment evaluation. The complete Unbiased Ventures methodology combines the deck score with independent founder interview scores (Jaffar et al.) and comprehensive psychological assessments (Hogan, Dark Tetrad, Big Five) to produce a master score.

Frequently Asked Questions

How does DeckAnalyst differ from a human analyst reviewing a pitch deck?

A senior analyst typically spends 7–8 hours on a comprehensive deck evaluation, reviewing market data, checking competitive benchmarks, assessing financials, and synthesizing across dimensions. DeckAnalyst performs a structurally equivalent analysis in minutes, across the same dimensions, with the same gating rules and stage-aware thresholds.

The key difference is consistency: the system produces the same score for the same deck regardless of time of day, order in the review queue, or analyst fatigue. The tradeoff is that a human analyst can incorporate context not in the deck — industry relationships, unpublished market intelligence — that an automated system cannot. For founder and team quality, DeckAnalyst defers to dedicated assessment tools rather than attempting to infer psychology from a “Team” slide.

Why 8 dimensions? Why these specific ones?

The 7 fully-scored dimensions were selected through iterative research to cover the complete set of factors that professional investors evaluate when assessing an early-stage company’s business case from its pitch deck. Each dimension was validated independently: removing any single dimension produced scores that diverged from human analyst assessments in predictable, explainable ways — confirming that each dimension carries unique information.

The eighth dimension — Team — is held as a placeholder because team evaluation requires methodologies fundamentally different from document analysis. Dedicated psychological and interview-based tools handle that domain.

Can founders “game” the scoring system?

Publishing the methodology transparently means founders know what’s being evaluated. But “gaming” the system means presenting strong evidence across all dimensions — which is functionally equivalent to building a strong investment case. A deck that scores well on DeckAnalyst presents clear market sizing, demonstrated traction, viable unit economics, a defensible product, and responsible capital management. If a founder “games” the system by adding all of that, they’ve simply written a better deck.

The system’s anti-gaming defense is that it scores evidence, not claims. Stating “our LTV/CAC is 5×” without supporting data scores differently than presenting the underlying metrics that demonstrate it. Gating rules and the Evidence Coverage Score further limit score inflation through narrative alone.

How often is the peer benchmark database updated?

The peer database is maintained and expanded on an ongoing basis. As of the current version, it contains 6,586 companies sourced from Y Combinator, major accelerator programs, and publicly available competition results. New cohorts are added as they become available. The benchmarking methodology and sector classification are versioned alongside the scoring system.

What if my deck doesn’t fit standard formats?

DeckAnalyst uses full-text fallback when section-specific extraction returns empty. Non-standard deck structures — decks without dedicated market slides, or with information distributed across narrative sections rather than labeled slides — are still processed. The system searches the entire document for evidence relevant to each dimension. However, decks with clear, well-labeled sections tend to score more accurately because the evidence extraction is more precise.

What languages does DeckAnalyst support?

DeckAnalyst reliably scores pitch decks in English, German, Spanish, French, and Italian. These cover the major European startup ecosystems. Support for additional languages may be added based on demand.

How does the team evaluation work if it’s not in the deck score?

DeckAnalyst assigns a 5% placeholder for the Team dimension. The real team score comes from dedicated assessment tools: a structured interview protocol (Jaffar et al.) and a comprehensive psychological assessment battery (Hogan, Dark Tetrad, Big Five). These produce independent, quantitative scores that replace the placeholder in the master scoring layer. This separation ensures that team quality is evaluated with the rigor it deserves, using the right tools for the job.

Formulas & Definitions

MetricFormula
LTV (Simplified)Average monthly revenue × Gross margin × (1 / Monthly churn rate)
LTV/CACLTV ÷ Customer acquisition cost
Payback Period (months)CAC ÷ Monthly contribution margin per customer
Burn MultipleNet burn ÷ Net new ARR (quarterly or 12-month rolling)
Net Dollar Retention (NDR)(ARR of cohort at t₁ / ARR of same cohort at t₀) × 100
Evidence Coverage Score (ECS)(Weighted sum of evidenced required fields / Total weighted required fields) × 100
Total ScoreΣ (Sub-score₁₋ × Weight₁₋) — subject to gating caps

Citation & License

This methodology guide is published by Unbiased Ventures and is made available under the Creative Commons Attribution 4.0 International License (CC BY 4.0). You are free to share and adapt this material for any purpose, including commercial use, provided appropriate credit is given.

Suggested citation:

Unbiased Ventures. “How DeckAnalyst Scores a Pitch Deck: The Complete Methodology.” Published at unbiasedventures.ch, June 2026.

For questions about the methodology, commercial API access, or partnership inquiries, contact deckanalyst@unbiasedventures.ch or visit unbiasedventures.ch.

Ready to score your pitch deck?

Upload your deck and get a full 8-dimension AI audit benchmarked against 6,586 companies. Results in under 3 minutes.

Upload a Deck →
Swiss made · Deterministic Investment Evaluation