| name | rdc-snowflake-navigator |
| description | Comprehensive guide to RDC's Snowflake data warehouse for marketing and consumer analytics. Use when querying clickstream data, lead attribution, marketing spend analysis, campaign performance, or revenue metrics. Triggers include queries about EFR, leads, spend, campaigns, clickstream, user journeys, SEM, app installs, paid social, or any Snowflake table exploration. |
RDC Snowflake Navigator Skill
This skill provides comprehensive knowledge of RDC's Snowflake data warehouse, including table schemas, business logic, attribution rules, and query patterns for marketing and consumer analytics.
Quick Reference
Core Tables
| Table | Location | Grain | Primary Use |
|---|---|---|---|
| clickstream_detail | RDC_ANALYTICS.CLICKSTREAM |
Event (hit) | User behavior, journeys, conversions |
| marketing_conversion_detail | RDC_ANALYTICS.REVENUE |
Lead | Lead attribution, EFR, delivery flags |
| spend | RDC_MARKETING.AGG_REPORTING |
Partner × Account × Day | Cross-channel spend consolidation, budget tracking |
| sem_summary | RDC_MARKETING.AGG_REPORTING |
Campaign × Day | SEM spend, leads, EFR |
| app_summary | RDC_MARKETING.AGG_REPORTING |
Campaign × Day | App install campaigns |
| psocial_summary | RDC_MARKETING.AGG_REPORTING |
Ad Set × Day | Facebook, Criteo spend |
Client Fulfillment Tables
| Table | Location | Primary Use |
|---|---|---|
| lead_inquiry | FIVETRAN_REFERRAL.PG_PUBLIC |
Lead detail with rdc_inquiry_guid (links to submitted_lead_id) |
| lead_inquiry_allocation | FIVETRAN_REFERRAL.PG_PUBLIC |
Lead-to-asset allocation records (paid/followup) |
| dim_asset_current | RDC_ENTSYS.SFDC |
SFDC asset/contract definitions |
| dim_account_current | RDC_ENTSYS.ACCOUNT |
Client/account information |
Key Identity Fields
- User ID:
adjusted_uu_id(canonical across tables) - Visit ID:
visit_id(clickstream) - Lead ID:
submitted_lead_idorinquiry_lead_id - GCLID:
google_click_id(paid search attribution) - Asset ID:
parent_asset_idorfulfill_to_asset_id(client fulfillment)
Key Date Fields
- Clickstream:
event_date_mst(Mountain Time)
Workflow
When a query involves these tables:
Identify the question type:
- User behavior → clickstream_detail
- Lead/revenue metrics → marketing_conversion_detail
- Campaign performance → sem_summary, app_summary, psocial_summary
- Cross-channel spend analysis → spend
- Client fulfillment analysis → lead_inquiry_allocation + dim_asset_current
Load relevant reference docs:
- Schema details → references/snowflake_core_tables.md
- Business rules → references/business_logic_reference.md
- Clickstream specifics → references/clickstream_view_annotated.md
- EFR calculations → references/marketing_conversion_detail_annotated.md
- Spend consolidation → references/spend_view_annotated.md
- Lead/property/delivery/fulfillment paths → references/lead_property_delivery_paths.md
Apply correct filters:
- Always filter on date first (these are large tables)
- Use the appropriate date field for each table
- Apply business-specific exclusions as documented
Use proper metrics:
- EFR for revenue attribution
- IEFR for incrementality-adjusted revenue
- Snapshot vs Cohort for app metrics
Critical Business Logic
Incrementality Multipliers (IEFR)
Different channels have different incrementality adjustments:
| Channel | Condition | Multiplier |
|---|---|---|
| SEM | B2C + Buy + Web | 0.70 |
| App - Apple Search | iOS | 0.46 |
| App - Google | iOS | 0.46 |
| App - Google | Android | 0.56 |
| App - Other Media | Any | 0.70 |
| PSocial - Criteo | B2C + Buy + Web | 0.65 |
| PSocial - FB Veterans/Brand | - | 0.70 |
| PSocial - FB Retargeting | - | 0.55 |
Product Verticals (Clickstream) - Field name: product_vertical
for_sale → Buy intent pages
for_rent → Rental pages
not_for_sale_ldp → Off-market/recently sold
owner_seller → Seller marketplace, My Home
sub_finance → Mortgage/finance pages
realtorpage → Agent pages
Lead Verticals (Conversions) - Field name: target_vertical
for_sale → Buy leads
for_rent → Rental leads
Seller → Sell leads (or DELIVERED_TO_UPNEST/SELLER_PARTNER flags)
FAR → Find a Realtor leads
EFR Components
Total EFR is calculated from multiple revenue sources:
| Component | Source | Description |
|---|---|---|
buy_efr_referral |
RCC referral model | 7-day rolling avg by price tier |
sell_efr_referral |
RCC referral model | 7-day rolling avg by price tier |
cplus_contract_revenue |
lead_sales_revenue | Connections Plus contracts |
mvip_revenue |
mvip_revenue | MVIP contracts |
adpro_revenue |
Zip code avg from C+ | Advantage Pro |
veterans_revenue |
Fixed rate schedule | Veterans United |
sales_builder_revenue |
nc_sales_builder_revenue | New Construction |
sell_realchoice_efr |
realchoice_revenue | UpNest sell |
rent_efr |
rentals_revenue | Rental leads |
Delivery Type Hierarchy
When classifying leads by client delivery product, use this mutually exclusive priority:
CASE
WHEN delivered_to_connections_plus_flag = 1 THEN 'Connections Plus'
WHEN delivered_to_mvip_flag = 1 OR delivered_to_mvip_package_flag = 1 THEN 'MVIP'
WHEN delivered_to_readyconnect_concierge_flag = 1 THEN 'RCC (Non-MVIP)'
ELSE 'No Premium Delivery'
END as delivery_type
Source: wbs2512_pcv_optimization test definition
Listing Type Classification
For property type segmentation, key low-value categories:
land → Vacant land/lots
farms/ranches → Farm properties
mobile home → Mobile/manufactured
Use: LOWER(listing_type) IN ('land', 'farms/ranches') for lot/land analysis.
Lead to Client Fulfillment Join Path
To connect leads to specific client fulfillment assets:
submitted_lead_id → lead_inquiry.rdc_inquiry_guid → lead_inquiry_allocation → dim_asset_current → dim_account_current
Key fields:
allocation_type:paid(monetized) orfollowup(non-monetized)parent_asset_id: Market-level contractfulfill_to_asset_id: Zip-level allocation
See references/lead_property_delivery_paths.md Section 5 for full documentation.
Common Query Patterns
Daily Traffic by Platform
SELECT
event_date_mst,
experience_type,
platform,
COUNT(DISTINCT adjusted_uu_id) as users,
COUNT(DISTINCT visit_id) as sessions,
SUM(is_pageview) as pageviews
FROM rdc_analytics.clickstream.clickstream_detail
WHERE event_date_mst >= DATEADD('day', -7, CURRENT_DATE())
GROUP BY 1, 2, 3
ORDER BY 1, 4 DESC;
Lead Volume by Channel
SELECT
event_date,
last_touch_marketing_channel,
submitted_lead_vertical,
COUNT(*) as leads,
SUM(estimated_future_revenue) as efr,
MEDIAN(lead_listing_price) as median_price
FROM rdc_analytics.revenue.marketing_conversion_detail
WHERE event_date >= DATEADD('day', -30, CURRENT_DATE())
GROUP BY 1, 2, 3
ORDER BY 1, 4 DESC;
SEM Campaign Performance
SELECT
campaign_name,
target_vertical,
SUM(spend) as spend,
SUM(leads) as leads,
SUM(efr) as efr,
SUM(iefr) as iefr,
SUM(spend) / NULLIF(SUM(leads), 0) as cpl,
SUM(iefr) / NULLIF(SUM(spend), 0) as iroas
FROM rdc_marketing.agg_reporting.sem_summary
WHERE event_date >= DATEADD('day', -30, CURRENT_DATE())
GROUP BY 1, 2
HAVING SUM(spend) > 1000
ORDER BY spend DESC;
App Install Efficiency
SELECT
media_source,
platform,
SUM(spend) as spend,
SUM(ssot_installs) as installs,
SUM(cohort_efr_for_sale) as cohort_efr,
SUM(snapshot_iefr_for_sale) as iefr,
SUM(spend) / NULLIF(SUM(ssot_installs), 0) as cpi
FROM rdc_marketing.agg_reporting.app_summary
WHERE date >= DATEADD('day', -30, CURRENT_DATE())
AND is_app = 1
GROUP BY 1, 2
HAVING SUM(spend) > 100
ORDER BY spend DESC;
Cross-Channel Spend Summary
SELECT
channel,
partner,
target_vertical,
SUM(spend) AS spend,
SUM(impressions) AS impressions,
SUM(clicks) AS clicks
FROM rdc_marketing.agg_reporting.spend
WHERE event_date >= DATEADD('day', -30, CURRENT_DATE())
AND target_customer = 'b2c'
GROUP BY 1, 2, 3
HAVING SUM(spend) > 1000
ORDER BY spend DESC;
User Journey to Lead
WITH lead_sessions AS (
SELECT DISTINCT visit_id, inquiry_lead_id
FROM rdc_analytics.clickstream.clickstream_detail
WHERE event_date_mst >= DATEADD('day', -7, CURRENT_DATE())
AND inquiry_lead_id IS NOT NULL
)
SELECT
c.product_vertical,
c.page_type_group,
COUNT(DISTINCT c.visit_id) as sessions,
COUNT(DISTINCT l.inquiry_lead_id) as leads,
COUNT(DISTINCT l.inquiry_lead_id)::FLOAT / NULLIF(COUNT(DISTINCT c.visit_id), 0) as conversion_rate
FROM rdc_analytics.clickstream.clickstream_detail c
LEFT JOIN lead_sessions l ON c.visit_id = l.visit_id
WHERE c.event_date_mst >= DATEADD('day', -7, CURRENT_DATE())
AND c.product_vertical IN ('for_sale', 'for_rent')
GROUP BY 1, 2
ORDER BY 1, 3 DESC;
Lead to Client Fulfillment
WITH lead_allocations AS (
SELECT DISTINCT
li.rdc_inquiry_guid as submitted_lead_id,
lia.parent_asset_id,
lia.allocation_type
FROM fivetran_referral.pg_public.lead_inquiry_allocation lia
JOIN fivetran_referral.pg_public.lead_inquiry li ON li.id = lia.lead_inquiry_id
WHERE lia.deleted_at IS NULL
)
SELECT
mcd.last_touch_marketing_channel,
la.allocation_type,
COUNT(DISTINCT mcd.submitted_lead_id) as leads,
SUM(mcd.estimated_future_revenue) as efr
FROM rdc_analytics.revenue.marketing_conversion_detail mcd
LEFT JOIN lead_allocations la ON mcd.submitted_lead_id = la.submitted_lead_id
WHERE mcd.event_date >= DATEADD('day', -30, CURRENT_DATE())
GROUP BY 1, 2
ORDER BY 3 DESC;
Data Caveats
RCC Market Mapping Coverage
The zip-to-RCC-market mapping only covers ~39% of leads. Join path:
postal_code → lead_zone_zipcode.zipcode → lead_zone.zone_id → market.id
Tables: fivetran_referral.pg_public.{lead_zone_zipcode, lead_zone, market}
Recommendation: Use state or dma_description fields from marketing_conversion_detail for complete geographic coverage (99.9%). Reserve RCC market mapping for RCC-specific inventory analysis only.
Lead Allocation Coverage
Not all leads have allocation records in lead_inquiry_allocation. The allocation tables only contain leads that were routed through the allocation system (primarily premium products like MVIP).
Historical Data Considerations
- SEM tracking codes changed multiple times (pre-Oct 2021, Oct-Jan 2022 issues, Jan 2022+)
- App data before June 2021 comes from separate historical table
- PSocial FB tracking broke in 2024 (special handling in view)
SKAN Privacy (iOS)
- iOS 14.5+ requires SKAdNetwork for attribution
ssot_installs= SKAN + non-SKAN combinedskan_multiplieradjusts for attribution gaps
Snapshot vs Cohort (App)
- Snapshot: Measured on event date (use for daily reporting)
- Cohort: Attributed to install date (use for LTV analysis)
Spend Table Considerations
- Linear TV (OceanMedia) has no impression/click data
- Recent TV data (last 7 days) may use pacing instead of actuals
- TikTok spend (
bytedanceglobal_int) is zeroed after 2024-01-01 - App accounts are excluded from platform aggregates (Google, Facebook, Bing, LinkedIn) to prevent double-counting
EFR Calculation Notes
- EFR is lagged 30 days - Recent leads have no referral EFR (rolling avg uses days 30-36 back)
- High price tier ($30M+) gets $0 referral EFR (intentional exclusion)
- NY excluded from Veterans United per VU agreement
- Seller EFR capped at $25K to prevent outlier skew
Reference Documents
- snowflake_core_tables.md - Complete schema documentation for all tables
- business_logic_reference.md - Incrementality, attribution rules, metric definitions
- clickstream_view_annotated.md - Detailed clickstream field derivation logic
- marketing_conversion_detail_annotated.md - EFR calculation methodology and revenue sources
- spend_view_annotated.md - Unified spend aggregation across all marketing channels
- lead_property_delivery_paths.md - Lead to property type, client delivery, geographic market, and client fulfillment paths
Tips for Effective Queries
- Always filter on date first - These tables have billions of rows
- Use
adjusted_uu_idfor cross-table user joins - Check data freshness - Some tables are T-1 or T-2
- Use IEFR for incrementality - Raw EFR overstates impact
- Prefer MEDIAN over AVG for price metrics (skewed distributions)
- Use CTEs for complex multi-step queries
- Handle NULLs with NULLIF when dividing
- Use
rdc_inquiry_guidto join leads to allocation records