Prerequisites: Before you begin, create an account and sign in.
Get started
After signing in, you’ll see the onboarding screen with three options. Click Use demo data to load a pre-configured dataset that mirrors real-world data challenges.
- HubSpot Contacts - Marketing leads, prospects, and customer information
- Product Users - Signups, logins, and feature engagement data
- Stripe Customers - Subscriptions, revenue, and payment status
Generate your data layer
After selecting the demo data, AstroBee begins building your data layer, a map of how your business concepts connect to your raw data and to each other. This is what lets you ask questions in plain language without knowing where the data lives.
Building a data layer generally takes 10-45 minutes depending on data complexity. For the demo data, expect a maximum of 10 minutes. Feel free to leave and come back later. AstroBee will continue working in the background.
Pattern Extraction Agent
Here’s where the magic happens. AstroBee uses a specialized sub-agent called the Pattern Extraction Agent to automatically discover how your tables connect. Watch it analyzing your data in real-time:- Phase 0: Initialize Workspace - Sets up the analysis environment
- Phase 1: Source Discovery - Scans all source tables and columns
- Pattern Discovery - Finds potential matches across systems using email, phone, and ID matching
- Build Entities - Creates unified business concepts from raw tables
Discovered patterns and validation
As the agent works, you’ll see patterns discovered between your systems. The Discovered Joins tab shows the connections AstroBee found automatically:- Product Users ↔ Stripe Customers via email
- HubSpot Contacts ↔ Product Users via email
- Product Events ↔ Product Users via user ID
- HubSpot Contacts ↔ Stripe Customers via phone
- Coverage Check - All 4 source tables are represented
- Connectivity Check - No orphan entities; everything is connected
- SUCCESS! - Integrated Data Layer Generated Successfully
What just happened? In a few minutes, AstroBee automatically solved a problem that typically requires weeks of data engineering: connecting disparate systems with inconsistent identifiers. The Pattern Extraction Agent found email matches, phone matches, and ID relationships across HubSpot, your product database, and Stripe, without you writing a single line of code.Behind the scenes, AstroBee applied entity resolution and normalization to match records even when formats differ (like
[email protected] vs [email protected]).Explore your data layer
Once building completes, you can explore the entities AstroBee created. Click any entity to see its data:- Data - Preview actual rows (200 rows shown with CSV download option)
- Description - AI-generated explanation of the entity
- Properties & Relationships - Schema details and connections to other entities
- SQL - The underlying query defining this entity
Edit your data layer
AstroBee’s automatic pattern discovery is just the starting point. You can enhance your data layer using natural language. Toggle to Edit mode by clicking the pencil icon, then describe what you want to create:Ask cross-system questions
Time to see what was impossible before. Previously, answering “How much revenue is at risk?” would require:- Exporting data from all three systems
- Matching emails manually in spreadsheets
- Cross-referencing with payment status
- Hours of data wrangling and potential errors
Chat response
AstroBee analyzes the data, understands your question, and generates a response that includes not just the answer, but business insights and strategic recommendations. AstroBee responds:I’ll help you visualize at-risk revenue by risk tier to identify where intervention would have the greatest financial impact.Risk tier breakdown:
| Risk Tier | MRR | % of At-Risk Revenue | Customers | Avg MRR/Customer |
|---|---|---|---|---|
| At Risk | $41,805 | 68.5% | 172 | $243 |
| High Risk | $10,307 | 16.9% | 51 | $202 |
| Critical | $8,915 | 14.6% | 650 | $14 |
Focus retention resources on the “At Risk” tier first - it represents nearly 70% of recoverable revenue with customers still showing engagement signals. The average customer value ($243 MRR) also makes personalized intervention cost-effective.
This is the “aha” moment. You asked one question, but AstroBee gave you:
- The exact revenue numbers you requested
- A breakdown by risk tier with customer counts
- Strategic recommendations for where to focus resources
- Analysis that required data from three completely separate systems
View the answer report
Click the Analysis Result card to view the full results panel:- Results
- Sources & logic
- SQL
- Pipeline
The Results tab shows the answer as interactive charts and tables. For this question, AstroBee generated a bar chart showing MRR by risk tier and an aggregated data table.

Try more questions
Now explore more cross-system queries. Each conversation is saved in the sidebar:- SaaS industry demonstrates 25% higher engagement than second-place Media industry
- Total of 1,849 product events were captured across 10 industries
- There’s a significant drop-off after the top 9 industries
Next steps
Ready to use AstroBee with your own data? Choose your path:Connect data sources
Connect warehouses like BigQuery or sources like HubSpot, PostHog, and Salesforce
Connect to MCP
Query your data layers directly from Claude Code, Claude Desktop, or VS Code




