Topic Not Found

Why this matters

Point-in-time (PIT) correctness prevents label leakage in machine learning. When you compute features for training or serving, you must use only information that was available at the prediction moment. This is critical for:

Credit risk approvals: using only data known before the decision time.
Fraud detection: computing features at swipe time, not after chargebacks arrive.
Recommendations/ads: joining user metrics as-of impression time, not after engagement.
Demand forecasting: using inventory and price snapshots valid at forecast cut-off.

If you skip PIT correctness, offline metrics look great but models fail in production. Getting this right is a core responsibility of a Machine Learning Engineer using feature stores.

Concept explained simply

At any prediction time t, your model can only see features that were both measured before t and actually available to the system by t. Two timestamps matter for each feature row:

event_time (when the fact happened or is effective)
available_at (when the fact became available to the ML system after processing/ETL)

PIT correctness rule: for a prediction at time t, include a feature row only if event_time ≤ t and available_at ≤ t.

Mental model

Imagine you’re watching a live game. You can comment on what you’ve already seen, not on the next play. event_time is when the play happened. available_at is when the TV broadcast reaches you after a small delay. Your comment at minute t can only reference plays that occurred and were broadcast to you by t.

Key definitions

Label leakage: Using future information when creating features, causing overly optimistic offline performance.
As-of join: A time-aware join that picks the latest feature snapshot where event_time ≤ t and available_at ≤ t.
Late-arriving data: Facts that occurred earlier but arrived in storage later. They must be guarded by available_at to avoid leakage.
Training-serving skew: Differences between how features are generated offline vs online. PIT correctness reduces this.

Worked examples

Example 1 — Credit decision

Decision time: 2023-04-10 10:00.

Feature: last_30d_missed_payments event_time=2023-04-09 23:00, available_at=2023-04-10 03:00 → Allowed (both ≤ 10:00).
Feature: delinquency_flag from bureau event_time=2023-04-08 00:00, available_at=2023-04-10 14:00 → Not allowed (available_at is in the future).
Feature: age_from_profile event_time=2015-06-01, available_at=2015-06-01 → Allowed.

Only use the first and third features.

Example 2 — Ads CTR features

Impression time: 2023-03-05 21:00.

User rolling_clicks_7d event_time=2023-03-05 20:55, available_at=2023-03-05 21:00 → Allowed.
User purchases_30d event_time=2023-03-05 20:50, available_at=2023-03-05 22:00 → Not allowed (arrives after the impression).

Example 3 — Fraud model with daily merchant score

Merchant_score updates nightly at 00:10 for the previous day.

Transaction at 2023-06-07 12:00 can use merchant_score with event_time ≤ 2023-06-07 and available_at ≤ 2023-06-07 12:00.
The score for 2023-06-07 that becomes available at 00:10 on 2023-06-08 cannot be used for the 12:00 transaction on 2023-06-07.

How to implement PIT correctness (practical steps)

Store timestamps per feature row
- event_time: when the feature is effective.
- available_at (or write_time): when the feature row became queryable.
Build as-of joins for training
- For each training example with prediction_time t, select the latest feature row where event_time ≤ t AND available_at ≤ t.
Serving consistency
- Online store updates should maintain the same available_at semantics.
- Optionally cache a read snapshot time per request to keep multi-feature consistency.
Test for leakage
- Unit test: No joined feature row has available_at > prediction_time.
- Data audit: Sample joined datasets and validate monotonicity constraints.

Mini reference: As-of join pattern (SQL-like)

-- For each training row r(prediction_time t, entity_id e)
SELECT r.id, f.value
FROM training r
LEFT JOIN (
  SELECT * FROM feature_store
) f
ON f.entity_id = r.entity_id
AND f.event_time <= r.prediction_time
AND f.available_at <= r.prediction_time
QUALIFY ROW_NUMBER() OVER (
  PARTITION BY r.id ORDER BY f.event_time DESC, f.available_at DESC
) = 1;

Exercises

These mirror the graded tasks below. Try here first, then open the Quick Test afterwards. Note: the Quick Test is available to everyone; only logged-in users get saved progress.

Exercise 1 — Spot leakage at prediction time

Prediction time t = 2023-07-10 12:00 for user U1. Which features are allowed?

Rows:
1) last_login_count: event_time=2023-07-09 22:55, available_at=2023-07-09 23:05
2) support_ticket_count: event_time=2023-07-10 11:55, available_at=2023-07-10 12:05
3) payment_failure_flag: event_time=2023-07-08 10:00, available_at=2023-07-12 09:00 (late-arriving)
4) profile_created_at: event_time=2021-02-01 00:00, available_at=2021-02-01 00:00

[ ] Row 1 allowed
[ ] Row 2 allowed
[ ] Row 3 allowed
[ ] Row 4 allowed

Show solution

Allowed: Rows 1 and 4. Reasoning: For t=12:00, Row 2 has available_at=12:05 (future) and Row 3 has available_at on 2023-07-12 (future). Rule requires both event_time and available_at ≤ t.

Exercise 2 — Do an as-of join

Training events:

A: user_id=42, t=2023-03-05 10:15
B: user_id=42, t=2023-03-06 09:00
C: user_id=7,  t=2023-03-05 22:00

User_metrics:

user=42: value=10, event_time=2023-03-05 09:00, available_at=2023-03-05 09:05
user=42: value=12, event_time=2023-03-06 08:00, available_at=2023-03-06 10:00
user=7:  value=3,  event_time=2023-03-05 20:00, available_at=2023-03-05 20:10

For each event A,B,C, select the correct value as-of t (latest where event_time ≤ t and available_at ≤ t).

Show solution

A → 10 (both conditions satisfied; 12 is in the future due to available_at=10:00 and t=10:15 for A? Wait, for A t=10:15, 12 has event_time 08:00 ≤ 10:15 but available_at=10:00 which is ≤ 10:15; however 12 belongs to 2023-03-06, not 2023-03-05. For A on 2023-03-05, only rows with event_time ≤ 2023-03-05 10:15 exist: value=10 is valid; value=12 has event_time on 2023-03-06 and is not ≤ A's t. So pick 10.)
B → 10 (12 is not available by 09:00 since available_at=10:00).
C → 3.

Checklist before you move on:
- You compare both event_time and available_at to prediction time.
- You pick the latest qualifying row when multiple rows match.
- You avoid filling gaps with future data.

Common mistakes and self-check

Mistake: Using only event_time in joins.
Self-check: Count how many joined rows have available_at > prediction_time. This must be zero.
Mistake: Recomputing historical features with today’s logic but tagging them as if known back then.
Self-check: Preserve available_at reflecting when the recomputed value would have been visible (often the batch completion time), not the original event_time.
Mistake: Mixing sources with different latencies without a unified available_at.
Self-check: For each source, document latency and set available_at accordingly; test the intersection constraint.
Mistake: Timezone drift and daylight saving errors.
Self-check: Normalize to UTC for event_time, available_at, and prediction_time; store offsets if needed.
Mistake: Training-serving skew due to different aggregations.
Self-check: Snapshot an offline feature row and compare with an online read at the same cutoff; values should match within tolerance.

Practical projects

Implement an as-of joiner: Given training events and a features table with event_time and available_at, build a utility that returns the latest valid row per entity and time. Include unit tests for late-arriving data.
Backfill with availability: Recompute a 7-day rolling metric historically, and write available_at equal to each day’s batch completion time. Validate no leakage.
PIT validator: Write a small checker that scans a training set and reports violations where available_at > prediction_time, with counts by feature source.

Learning path

Foundations: Timestamps, timezones, and windowed aggregations.
Feature store concepts: entities, feature views, event_time, available_at/write_time.
PIT joins: offline training data creation and time-travel semantics.
Online serving: freshness SLAs, TTL, and consistency checkpoints.
Monitoring: data audits, latency tracking, and leakage guards.

Who this is for

Machine Learning Engineers building training datasets and online features.
Data Engineers implementing feature pipelines.
Applied scientists validating offline vs online performance.

Prerequisites

Basic SQL or data frame joins.
Understanding of entity keys and timestamps.
Familiarity with batch vs streaming ingestion.

Mini challenge

You maintain a daily customer_spend_30d feature recomputed at 02:00 UTC. Write acceptance criteria for PIT correctness when joining to transactions:

State the rule using event_time and available_at.
Describe a unit test that must fail if a transaction at t uses the 02:00 update from the next day.
Describe a metric you’d monitor in production to catch violations.

Example answer

Rule: For transaction at t, use the latest row where event_time ≤ t and available_at ≤ t; available_at is the daily job completion time.
Unit test: Create a transaction at 2023-05-07 01:30 and assert the join does not select the row with available_at=2023-05-07 02:00.
Metric: Percentage of joined rows with available_at - prediction_time > 0 seconds; target 0% with alert at >0.01%.

Next steps

Finish the exercises above, then take the Quick Test to confirm understanding.
Apply the as-of join to one of your datasets and run the PIT validator.
Document event_time and available_at for every feature you own.

Menu

Point In Time Correctness

Table of Contents

Why this matters

Concept explained simply

Key definitions

Worked examples

How to implement PIT correctness (practical steps)

Exercises

Common mistakes and self-check

Practical projects

Learning path

Who this is for

Prerequisites

Mini challenge

Next steps

Practice Exercises

Spot leakage at prediction time

Instructions

Expected Output

As-of join with event_time and available_at

Have questions about Point In Time Correctness?

AI Assistant