Add Random X-axis Shifts

step_measure_augment_shift() creates a specification of a recipe step that applies random shifts along the x-axis for shift invariance training.

Usage

step_measure_augment_shift(
  recipe,
  max_shift = 1,
  measures = NULL,
  role = NA,
  trained = FALSE,
  skip = TRUE,
  id = recipes::rand_id("measure_augment_shift")
)

Arguments

recipe: A recipe object.
max_shift: Maximum shift amount in location units. The actual shift is uniformly sampled from [-max_shift, max_shift].
measures: An optional character vector of measure column names.
role: Not used.
trained: Logical indicating if the step has been trained.
skip: Logical. Should the step be skipped when baking? Default is TRUE.
id: Unique step identifier.

Value

An updated recipe with the new step added.

Details

This step adds random x-axis shifts to help models become invariant to small retention time or wavelength shifts. This is particularly useful for chromatographic data where peak positions may vary slightly.

The spectrum is interpolated to the shifted positions using linear interpolation. Values outside the original range use boundary values.

Default behavior (skip = TRUE): The shift is only applied during training. When predicting on new data, the step is skipped.

Examples

library(recipes)

rec <- recipe(water + fat + protein ~ ., data = meats_long) |>
  update_role(id, new_role = "id") |>
  step_measure_input_long(transmittance, location = vars(channel)) |>
  step_measure_augment_shift(max_shift = 2) |>
  prep()

bake(rec, new_data = NULL)
#> # A tibble: 215 × 6
#>       id water   fat protein .measures channel    
#>    <int> <dbl> <dbl>   <dbl>    <meas> <list>     
#>  1     1  60.5  22.5    16.7 [100 × 2] <int [100]>
#>  2     2  46    40.1    13.5 [100 × 2] <int [100]>
#>  3     3  71     8.4    20.5 [100 × 2] <int [100]>
#>  4     4  72.8   5.9    20.7 [100 × 2] <int [100]>
#>  5     5  58.3  25.5    15.5 [100 × 2] <int [100]>
#>  6     6  44    42.7    13.7 [100 × 2] <int [100]>
#>  7     7  44    42.7    13.7 [100 × 2] <int [100]>
#>  8     8  69.3  10.6    19.3 [100 × 2] <int [100]>
#>  9     9  61.4  19.9    17.7 [100 × 2] <int [100]>
#> 10    10  61.4  19.9    17.7 [100 × 2] <int [100]>
#> # ℹ 205 more rows

Usage

Arguments

Value

Details

See also

Examples