Data Formulator
AI-powered data exploration and visualization tool with semantic chart engine
data-formulator.ai ↗📍 Redmond, Washington, USA
Verified Data
“Open-source research project funded by Microsoft with no commercial revenue model - purely R&D initiative”
“N/A (Microsoft Research Project)”
“~5-10 core researchers and contributors (e.g., Chenglong Wang, Steven Drucker, Bongshin Lee)”
“Star count increased from ~3k in 2024 to 15.5k in 2026”
Company Profile
Contact
Strategic Analysis
Strategy
Microsoft Research's strategic R&D initiative to advance AI-powered data analysis tools within the broader Microsoft AI ecosystem. Open-source approach to drive community adoption and gather feedback for future commercial applications. Focus on reducing friction in data visualization through natural language interfaces.
Tactics
Open-sourced under MIT license to maximize community adoption and contributions. Heavy presence in academic conferences like CHI to establish research credibility. Leveraging GitHub as primary distribution channel with active community engagement through issues and pull requests.
Competitive Positioning
Differentiates from traditional BI tools like Tableau and Power BI by offering natural language data transformation capabilities. Positioned as a research-first tool that bridges the gap between technical and non-technical users, unlike code-heavy alternatives like matplotlib or commercial no-code tools.
Marketing Approach
Academic publication strategy through conferences like CHI 2025 to establish thought leadership. GitHub-first distribution leveraging Microsoft's developer ecosystem. Community-driven growth through open-source contributions and recognition in industry roundups of top Python libraries.
Notable
Top trending repository on GitHub (2024-2025), recognized as one of the top Python libraries of 2025
🔗 Source ↗Tech Stack
Recent News
Related AI Developer Tools Companies
Discovery Sources
Signals
“Star count increased from ~3k in 2024 to 15.5k in 2026”
“~5-10 core researchers and contributors (e.g., Chenglong Wang, Steven Drucker, Bongshin Lee)”
“N/A (Microsoft Research Project)”
Evidence
30 chart types with a new semantic chart engine
15,500+ GitHub stars, 1,400+ forks