Augmentations¶
RF-DETR supports custom data augmentations via Albumentations, with automatic bounding box and mask handling for geometric transforms. Albumentations 1.4.24+ and 2.x are supported.
Quick Start¶
Pass aug_config to your training call. Import one of the built-in presets:
from rfdetr import RFDETRSmall
from rfdetr.datasets.aug_config import AUG_CONSERVATIVE, AUG_AGGRESSIVE, AUG_AERIAL, AUG_INDUSTRIAL
model = RFDETRSmall()
model.train(dataset_dir="path/to/dataset", epochs=100, aug_config=AUG_CONSERVATIVE)
Or pass a custom dict directly — keys are Albumentations transform names:
model.train(
dataset_dir="path/to/dataset",
epochs=100,
aug_config={
"HorizontalFlip": {"p": 0.5},
"Rotate": {"limit": 15, "p": 0.3},
"GaussianBlur": {"p": 0.2},
},
)
To disable augmentations: aug_config={}. Omitting it uses the default (horizontal flip at 50%).
Built-in Presets¶
| Preset | Best for |
|---|---|
AUG_CONSERVATIVE |
Small datasets (under 500 images) |
AUG_AGGRESSIVE |
Large datasets (2000+ images) |
AUG_AERIAL |
Satellite / overhead imagery |
AUG_INDUSTRIAL |
Manufacturing / inspection data |
All presets are plain dicts — inspect or extend them before passing:
from rfdetr.datasets.aug_config import AUG_AGGRESSIVE
my_config = {**AUG_AGGRESSIVE, "VerticalFlip": {"p": 0.1}}
model.train(dataset_dir="...", aug_config=my_config)
Recommendations by Dataset Size¶
| Dataset Size | Recommended preset |
|---|---|
| Under 500 images | AUG_CONSERVATIVE — flip + mild brightness/contrast |
| 500–2000 images | Default or AUG_CONSERVATIVE with a few extra transforms added |
| 2000+ images | AUG_AGGRESSIVE — rotations, affine, color jitter |
Nested Transforms¶
RF-DETR supports OneOf, SomeOf, and Sequential container transforms from Albumentations. The most common pattern is OneOf, which randomly picks one transform from a group:
aug_config = {
"HorizontalFlip": {"p": 0.5},
"OneOf": {
"transforms": [
{"Rotate": {"limit": 45, "p": 1.0}},
{"Affine": {"scale": (0.8, 1.2), "p": 1.0}},
],
},
"GaussianBlur": {"p": 0.2},
}
Each child's p controls its relative selection weight. The container itself always fires.
If you need the same transform twice, or want explicit ordering, pass a list instead of a dict:
aug_config = [
{"HorizontalFlip": {"p": 0.5}},
{"Rotate": {"limit": 45, "p": 0.3}},
{"Rotate": {"limit": 5, "p": 0.5}}, # second Rotate — only possible with list format
]
Bounding boxes are updated automatically when a container holds any geometric transform — no extra configuration needed.
Geometric vs. Pixel-Level Transforms¶
RF-DETR automatically handles bounding boxes for geometric transforms (flips, rotations, crops, affine, perspective). Pixel-level transforms (blur, noise, color) preserve coordinates unchanged. You don't need to handle this distinction — it's automatic based on the transform name.
Best Practices¶
Start Conservative
Begin with simple augmentations (horizontal flip, small brightness changes) and gradually add more as needed.
Geometric Transforms
Be careful with aggressive rotations and crops on datasets where object orientation matters (e.g., text detection, oriented objects).
- CPU-bound: Augmentations run on CPU during data loading — more transforms means slower loading
- Use
num_workers: Parallelize augmentation across data loader workers - Monitor training mAP vs validation mAP: With strong augmentations it's normal for training mAP to be lower — validation uses original images while training uses augmented (harder) ones
Troubleshooting¶
Training is slow — reduce the number of transforms or increase num_workers.
Boxes disappear after augmentation — aggressive rotations or crops can push boxes outside the image boundary. Reduce rotation angles or avoid large crops.
Model not improving — augmentations may be too aggressive. Start with AUG_CONSERVATIVE and add transforms gradually. Try removing geometric transforms first to isolate the cause.
Validation mAP is much higher than training mAP — this is expected with strong augmentations and not a bug. See the monitoring tip above.
Upgrading albumentations to 2.x with existing RandomSizedCrop configs? RF-DETR automatically adapts height/width kwargs to the size=(height, width) format required by albumentations 2.x. No config changes needed.
Advanced: Custom Transforms¶
Any Albumentations transform works by name. If your custom transform is geometric, register it in rfdetr/datasets/transforms.py so boxes are updated automatically:
Then use it like any other transform:
model.train(
dataset_dir="...",
aug_config={
"HorizontalFlip": {"p": 0.5},
"YourCustomTransform": {"param": 1, "p": 0.3},
},
)
Reference¶
Next Steps¶
- Monitor training with TensorBoard
- Use early stopping to prevent overfitting
- Export your trained model for deployment