TORCH Integration

TORCH is a service for extracting patient data from clinical systems. Aether connects to TORCH to download data based on your query.

How It Works

You provide a CRTDL query file (defines which patients/data you want)
Aether sends it to TORCH
TORCH extracts the matching data
Aether downloads the results

Configuration

Add TORCH credentials to your aether.yaml:

yaml

services:
  torch:
    base_url: "https://your-torch-server.org"
    username: "your-username"
    password: "your-password"

pipeline:
  enabled_steps:
    - torch
    - dimp

Running a TORCH Query

bash

aether pipeline start aether.yaml your-crtdl.json

Aether will show progress as it:

Submits your query
Waits for extraction
Downloads the data
Continues to DIMP (if enabled)

Advanced Options

Timeout Settings

For large queries that take longer:

yaml

services:
  torch:
    base_url: "https://your-torch-server.org"
    username: "your-username"
    password: "your-password"
    extraction_timeout: PT1H     # Default is PT30M
    polling_interval: PT10S      # Default is PT5S

For extractions that may take several days (e.g., large patient cohorts), set extraction_timeout_minutes accordingly:

yaml

services:
  torch:
    extraction_timeout_minutes: 4320  # 3 days

Polling Resilience

During status polling, transient HTTP errors (timeouts, connection resets) are treated as recoverable. If TORCH is temporarily unable to respond to status requests — for example, because it is saturated with CPU-intensive FHIR operations — aether logs a warning and continues polling with exponential backoff rather than failing the entire extraction. The extraction_timeout_minutes setting acts as the safety net: polling only stops when this overall timeout is exceeded.

Direct TORCH URL Import

If you already have a TORCH extraction or result URL, you can pass it directly to skip the CRTDL submission step:

bash

aether pipeline start aether.yaml crtdl.json "https://torch.example.com/fhir/extraction/result-123"

Aether auto-detects TORCH URLs by looking for /fhir/extraction/ or /fhir/result/ in the URL (case-sensitive). When a TORCH URL is provided, Aether:

Skips extraction submission — does not send a CRTDL query
Polls the URL — sends GET requests with exponential backoff until the extraction is complete (HTTP 200) or times out
Downloads all result files — fetches multiple NDJSON files from the extraction result

This is useful when:

Reusing results from a previous extraction
Sharing extraction URLs between team members
Resuming a download from a known TORCH endpoint

URL patterns

URLs must contain one of these path segments to be recognized as TORCH URLs:

/fhir/extraction/ — e.g., https://torch.example.com/fhir/extraction/result-123
/fhir/result/ — e.g., https://torch.example.com/fhir/result/abc-xyz

All other HTTP(S) URLs are treated as plain HTTP imports (single-file download, no polling).

Configuration

TORCH URL imports still require TORCH configuration for authentication:

yaml

services:
  torch:
    base_url: "https://your-torch-server.org"
    username: "your-username"
    password: "your-password"

pipeline:
  enabled_steps:
    - torch
    - dimp

The extraction_timeout and polling interval settings also apply.

Comparison: CRTDL vs TORCH URL vs HTTP

	CRTDL	TORCH URL	HTTP URL
Input example	`crtdl.json`	`https://torch/fhir/result/123`	`https://example.com/data.ndjson`
Submits extraction	Yes	No	No
Polls for completion	Yes	Yes	No
Downloads multiple files	Yes	Yes	No (single file)
Requires TORCH auth	Yes	Yes	No
First pipeline step	`torch`	`torch`	`http_import`

TORCH Integration ​

How It Works ​

Configuration ​

Running a TORCH Query ​

Advanced Options ​

Timeout Settings ​

Polling Resilience ​

Direct TORCH URL Import ​

URL patterns ​

Configuration ​

Comparison: CRTDL vs TORCH URL vs HTTP ​

TORCH Integration

How It Works

Configuration

Running a TORCH Query

Advanced Options

Timeout Settings

Polling Resilience

Direct TORCH URL Import

URL patterns

Configuration

Comparison: CRTDL vs TORCH URL vs HTTP