$50 OFF · First 100 buyers

Scoring System

IdeaTwister uses a granular 0–10 scoring system across five commercial dimensions to help you objectively evaluate business variations.

Every variation produced by IdeaTwister passes through a validation pass that assigns scores based on real-time market research and strategic heuristics. The final Confidence Score is a weighted composite of these five dimensions, rounded to one decimal place, and is guaranteed to match the Composite row in the per-variation score table.


How the Composite is computed

The headline Confidence Score is the weighted average:

Composite = 0.25 × Urgency + 0.25 × RevenueSpeed + 0.25 × Defensibility
          + 0.15 × MarketProof + 0.10 × DistributionEdge

The three biggest weights - Urgency, Revenue Speed, and Defensibility - are the dimensions most predictive of whether a solo founder can both reach the first dollar fast and keep it once they do. Market Proof and Distribution Edge contribute, but matter less than the core three.


Score distribution

Scores spread roughly across the 4.0 to 9.0 band on a typical run. Genuinely weak variations land in the 4–5 range; strong, high-conviction variations land at 8+. The headline Confidence Score is now an honest discriminator across a run instead of compressing everything near 7.

Score rangeBucketWhat it means
7.0 – 10.0High ConvictionPromoted to your "Top Picks" list.
5.0 – 6.9RiskyPotentially viable but requires careful validation of specific assumptions.
0.0 – 4.9RejectSignificant flaws in one or more commercial dimensions.

1. Buyer Urgency & Market Timing

The question: How badly does the buyer need this right now, and is the moment right?

Combines the intensity of the pain with whether the market is actually ready to buy. A fierce pain in a market that won't open its wallet for another two years scores low here.

ScoreSignalExample
9–10Critical pain plus an active buying window. Regulation, fines, revenue loss, or a fresh trigger."New compliance rule lands in 90 days; firms are budgeting for it now."
7–8High pain plus clear demand signals. Budgets are shifting, competitor exits, enabling tech just matured."10 hours/week wasted; ops leads are actively shopping."
5–6Moderate pain or uncertain timing. Problem is real, trigger is not sharp."Annoying enough to complain about, not enough to switch."
3–4Low urgency or premature market. Buyers see the issue but won't pay yet."Cool idea; we'll revisit next year."
0–2No urgency, wrong timing, or both."Aspirational tool with no buying pressure."

2. Market Proof

The question: Is there existing proof that customers spend money on this?

Validates that the market is already educated and willing to pay for solutions.

ScoreSignalExample
9–10Direct competitors with high pricing/funding/growth."Competitor X just raised $20M and charges $500/mo."
7–8Indirect proof. Customers pay for workarounds or related tools."They hire agencies at $5k/mo for this manually."
5–6Evidence of intent. Active search volume, communities, complaints."3 subreddits with 50k members constantly ask for this."
3–4Theoretical proof. Similar models work in different industries."Works for doctors, should probably work for lawyers too."
0–2Speculative. No evidence of existing spend or urgent search."I have a gut feeling people will pay for this."

3. Distribution Edge

The question: Do you have a credible, unfair channel to reach buyers faster or cheaper than competitors?

Measures whether the founder has a real path to customers - an existing audience, partner access, niche community standing, or a content angle that compounds. This replaced the older "Solo Executable" dimension, which is now a simple yes/no flag in the variation header rather than a weighted score.

ScoreSignalExample
9–10Pre-existing audience or partner relationship that maps directly onto the buyer."I already run a 5,000-person newsletter for these exact buyers."
7–8Defined low-cost channel: niche communities, vertical events, content angle, partner referrals."Active on the two Slack groups every buyer hangs out in."
5–6Standard channels (cold outreach, paid ads, generic content) - workable but no edge."Cold email plus LinkedIn. Same as everyone else."
3–4Crowded, expensive channels with high CAC."Paid Google ads against well-funded incumbents."
0–2No realistic path to reach buyers."Hope they find us on Twitter eventually."

4. Revenue Speed

The question: How fast can the founder generate the first dollar?

Prioritizes ideas with short sales cycles and immediate monetization potential.

ScoreSignalExample
9–10First dollar in <14 days. Instant value."Pre-sell a template or run a manual service immediately."
7–8First dollar in 14–30 days. Short sales cycle."Build MVP, cold email 20 leads, close first $50 sale."
5–6First dollar in 30–60 days. Requires some trust."Needs a pilot program or longer demonstration period."
3–4First dollar in 60–90 days. Enterprise cycles."Requires budget approvals or complex integrations."
0–2Long lead time (90+ days). Heavy R&D needed."Needs 6 months of content before monetization."

5. Defensibility

The question: How hard is this to replicate?

Looks for moats that prevent others from easily cloning your business.

ScoreSignalExample
9–10High moat. Proprietary data or deep workflow lock-in."Exclusive access to industry API + deep integration."
7–8Moderate moat. Brand or specific niche focus."First mover in a very specific, underserved niche."
5–6Low moat. Easily copied but requires domain knowledge."Anyone can build it, but they need to understand the niche."
3–4Very low moat. Feature-level idea."Asana could add this as a toggle next month."
0–2No moat. Pure wrapper play."A basic GPT wrapper with no custom data."

Worked example

A variation scoring across the five dimensions and the resulting Composite:

DimensionWeightScoreContribution
Buyer Urgency & Market Timing0.258.02.00
Revenue Speed0.257.01.75
Defensibility0.256.01.50
Market Proof0.159.01.35
Distribution Edge0.108.00.80
Composite--7.4

The Composite (7.4) is the headline Confidence Score shown on the variation card.


Important notes

Solo executability is now a flag, not a score

Whether a variation is realistically buildable and operable by one person is shown as a yes/no flag in the variation header. It no longer rolls into the weighted Composite, so the headline score reflects commercial promise, not just operational fit.

Relative scoring

Scores are relative to your specific profile (skills, capital, risk tolerance). A "10" for a solo founder is different from a "10" for a venture-backed team.

The "Red Flag" rule

Even if an idea has a high average score, a dimension scoring below 4.0 is a "Red Flag" that signals a fundamental flaw you must address first.

Honest Weaknesses

Every variation also includes a short Weaknesses section that names two specific places the idea is genuinely fragile. Read it before you commit time - high scores don't mean no risk.