Data Formulator logo

Data Formulator

AI Developer ToolsVerified90% conf

AI-powered data exploration and visualization tool with semantic chart engine

data-formulator.ai

📍 Redmond, Washington, USA

Verified Data

💰
Est. Revenue<$100K ARR

Open-source research project funded by Microsoft with no commercial revenue model - purely R&D initiative

🚀
FundingMicrosoft Research internal funding

N/A (Microsoft Research Project)

👥
Users15,500+ GitHub stars, 1,400+ forks
🔗github.com
🧑‍💻
Team Size5-10 researchers

~5-10 core researchers and contributors (e.g., Chenglong Wang, Steven Drucker, Bongshin Lee)

📈
GrowthStar count increased from ~3k in 2024 to 15.5k in 2026

Star count increased from ~3k in 2024 to 15.5k in 2026

🏷️
StageMicrosoft Research internal funding
📅
Founded2023

Company Profile

ModelOpen Source
VerticalData Analytics, Research, Education
ClientsGlobal open-source community, Academic institutions, Microsoft data teams
BuyersData analysts, data scientists, students, non-technical business users aged 20-50
PricingFree / Open Source (MIT License)

Contact

Strategic Analysis

Strategy

Microsoft Research's strategic R&D initiative to advance AI-powered data analysis tools within the broader Microsoft AI ecosystem. Open-source approach to drive community adoption and gather feedback for future commercial applications. Focus on reducing friction in data visualization through natural language interfaces.

Tactics

Open-sourced under MIT license to maximize community adoption and contributions. Heavy presence in academic conferences like CHI to establish research credibility. Leveraging GitHub as primary distribution channel with active community engagement through issues and pull requests.

Competitive Positioning

Differentiates from traditional BI tools like Tableau and Power BI by offering natural language data transformation capabilities. Positioned as a research-first tool that bridges the gap between technical and non-technical users, unlike code-heavy alternatives like matplotlib or commercial no-code tools.

Marketing Approach

Academic publication strategy through conferences like CHI 2025 to establish thought leadership. GitHub-first distribution leveraging Microsoft's developer ecosystem. Community-driven growth through open-source contributions and recognition in industry roundups of top Python libraries.

Notable

Top trending repository on GitHub (2024-2025), recognized as one of the top Python libraries of 2025

🔗 Source ↗

Tech Stack

TypeScriptTypeScriptReactReactPythonPythonFastAPI/FlaskOpenAI GPT-4ViteViteD3.js

Recent News

Related AI Developer Tools Companies

Discovery Sources

Signals

growth rateStar count increased from ~3k in 2024 to 15.5k in 2026

Star count increased from ~3k in 2024 to 15.5k in 2026

team size5-10 researchers

~5-10 core researchers and contributors (e.g., Chenglong Wang, Steven Drucker, Bongshin Lee)

user count15,500+ GitHub stars, 1,400+ forks🔗 source ↗
funding raisedMicrosoft Research internal funding

N/A (Microsoft Research Project)

trend indicatorMicrosoft🔗 source ↗
trend indicatorData Visualization🔗 source ↗
trend indicatorAI🔗 source ↗

Evidence

github.com

30 chart types with a new semantic chart engine

github.com

15,500+ GitHub stars, 1,400+ forks