Data Source
Company data pulled from YC's public Algolia API (ycombinator.com/companies). Founder bios scraped from individual company pages. Batches covered: Winter 2025, Spring 2025, Summer 2025, Fall 2025, Winter 2026.
Methodology
AI classification via keyword matching on tags + descriptions. NLP clustering using TF-IDF + K-Means. Competitive overlap via cosine similarity. Founder backgrounds extracted from bio text. Partner preferences computed as delta vs base rates. Wrapper vs deep-tech classification based on signal scoring across descriptions, tags, and founder credentials.
Disclaimer
This is an independent analysis and is not affiliated with, endorsed by, or connected to Y Combinator. All data is sourced from publicly available information. Classifications (AI/wrapper/deep-tech) are approximations based on automated text analysis and may not perfectly reflect each company's actual technology stack. Use for informational purposes only.