Scoring System

IdeaTwister uses a granular 0–10 scoring system across five commercial dimensions to help you objectively evaluate business variations.

Every variation produced by IdeaTwister passes through a validation pass that assigns scores based on real-time market research and strategic heuristics. The final Confidence Score is a weighted composite of these five dimensions, rounded to one decimal place, and is guaranteed to match the Composite row in the per-variation score table.

How the Composite is computed

The headline Confidence Score is the weighted average:

Composite = 0.25 × Urgency + 0.25 × RevenueSpeed + 0.25 × Defensibility
          + 0.15 × MarketProof + 0.10 × DistributionEdge

The three biggest weights - Urgency, Revenue Speed, and Defensibility - are the dimensions most predictive of whether a solo founder can both reach the first dollar fast and keep it once they do. Market Proof and Distribution Edge contribute, but matter less than the core three.

Score distribution

Scores spread roughly across the 4.0 to 9.0 band on a typical run. Genuinely weak variations land in the 4–5 range; strong, high-conviction variations land at 8+. The headline Confidence Score is now an honest discriminator across a run instead of compressing everything near 7.

Score range	Bucket	What it means
7.0 – 10.0	High Conviction	Promoted to your "Top Picks" list.
5.0 – 6.9	Risky	Potentially viable but requires careful validation of specific assumptions.
0.0 – 4.9	Reject	Significant flaws in one or more commercial dimensions.

1. Buyer Urgency & Market Timing

The question: How badly does the buyer need this right now, and is the moment right?

Combines the intensity of the pain with whether the market is actually ready to buy. A fierce pain in a market that won't open its wallet for another two years scores low here.

Score	Signal	Example
9–10	Critical pain plus an active buying window. Regulation, fines, revenue loss, or a fresh trigger.	"New compliance rule lands in 90 days; firms are budgeting for it now."
7–8	High pain plus clear demand signals. Budgets are shifting, competitor exits, enabling tech just matured.	"10 hours/week wasted; ops leads are actively shopping."
5–6	Moderate pain or uncertain timing. Problem is real, trigger is not sharp.	"Annoying enough to complain about, not enough to switch."
3–4	Low urgency or premature market. Buyers see the issue but won't pay yet.	"Cool idea; we'll revisit next year."
0–2	No urgency, wrong timing, or both.	"Aspirational tool with no buying pressure."

2. Market Proof

The question: Is there existing proof that customers spend money on this?

Validates that the market is already educated and willing to pay for solutions.

Score	Signal	Example
9–10	Direct competitors with high pricing/funding/growth.	"Competitor X just raised $20M and charges $500/mo."
7–8	Indirect proof. Customers pay for workarounds or related tools.	"They hire agencies at $5k/mo for this manually."
5–6	Evidence of intent. Active search volume, communities, complaints.	"3 subreddits with 50k members constantly ask for this."
3–4	Theoretical proof. Similar models work in different industries.	"Works for doctors, should probably work for lawyers too."
0–2	Speculative. No evidence of existing spend or urgent search.	"I have a gut feeling people will pay for this."

3. Distribution Edge

The question: Do you have a credible, unfair channel to reach buyers faster or cheaper than competitors?

Measures whether the founder has a real path to customers - an existing audience, partner access, niche community standing, or a content angle that compounds. This replaced the older "Solo Executable" dimension, which is now a simple yes/no flag in the variation header rather than a weighted score.

Score	Signal	Example
9–10	Pre-existing audience or partner relationship that maps directly onto the buyer.	"I already run a 5,000-person newsletter for these exact buyers."
7–8	Defined low-cost channel: niche communities, vertical events, content angle, partner referrals.	"Active on the two Slack groups every buyer hangs out in."
5–6	Standard channels (cold outreach, paid ads, generic content) - workable but no edge.	"Cold email plus LinkedIn. Same as everyone else."
3–4	Crowded, expensive channels with high CAC.	"Paid Google ads against well-funded incumbents."
0–2	No realistic path to reach buyers.	"Hope they find us on Twitter eventually."

4. Revenue Speed

The question: How fast can the founder generate the first dollar?

Prioritizes ideas with short sales cycles and immediate monetization potential.

Score	Signal	Example
9–10	First dollar in <14 days. Instant value.	"Pre-sell a template or run a manual service immediately."
7–8	First dollar in 14–30 days. Short sales cycle.	"Build MVP, cold email 20 leads, close first $50 sale."
5–6	First dollar in 30–60 days. Requires some trust.	"Needs a pilot program or longer demonstration period."
3–4	First dollar in 60–90 days. Enterprise cycles.	"Requires budget approvals or complex integrations."
0–2	Long lead time (90+ days). Heavy R&D needed.	"Needs 6 months of content before monetization."

5. Defensibility

The question: How hard is this to replicate?

Looks for moats that prevent others from easily cloning your business.

Score	Signal	Example
9–10	High moat. Proprietary data or deep workflow lock-in.	"Exclusive access to industry API + deep integration."
7–8	Moderate moat. Brand or specific niche focus.	"First mover in a very specific, underserved niche."
5–6	Low moat. Easily copied but requires domain knowledge.	"Anyone can build it, but they need to understand the niche."
3–4	Very low moat. Feature-level idea.	"Asana could add this as a toggle next month."
0–2	No moat. Pure wrapper play.	"A basic GPT wrapper with no custom data."

Worked example

A variation scoring across the five dimensions and the resulting Composite:

Dimension	Weight	Score	Contribution
Buyer Urgency & Market Timing	0.25	8.0	2.00
Revenue Speed	0.25	7.0	1.75
Defensibility	0.25	6.0	1.50
Market Proof	0.15	9.0	1.35
Distribution Edge	0.10	8.0	0.80
Composite	-	-	7.4

The Composite (7.4) is the headline Confidence Score shown on the variation card.

Important notes

Solo executability is now a flag, not a score

Whether a variation is realistically buildable and operable by one person is shown as a yes/no flag in the variation header. It no longer rolls into the weighted Composite, so the headline score reflects commercial promise, not just operational fit.

Relative scoring

Scores are relative to your specific profile (skills, capital, risk tolerance). A "10" for a solo founder is different from a "10" for a venture-backed team.

The "Red Flag" rule

Even if an idea has a high average score, a dimension scoring below 4.0 is a "Red Flag" that signals a fundamental flaw you must address first.

Honest Weaknesses

Every variation also includes a short Weaknesses section that names two specific places the idea is genuinely fragile. Read it before you commit time - high scores don't mean no risk.