학술
기타
Parameter calibration method for car-following models under snowy weather conditions: Integrating an informer time series encoder and physics-informed neural networks
PLOS ONE
조회 0
CC BY
이 매체는 공공·자유 라이선스로 본문을 직접 표시합니다.Figures
Abstract
Under snowy weather conditions, factors such as road conditions and weather conditions significantly affect vehicle car-following behavior. Traditional car-following models struggle to accurately capture driving characteristics on slippery roads. To address this, this paper proposes a parameter calibration method for car-following models under snowy conditions, considering factors including road adhesion coefficient and visibility. Five classical car-following models are selected for analysis: the GM model, the Gipps model, the Intelligent Driver Model (IDM), the Wiedemann model, and the Full Velocity Difference Model (FVDM). A systematic analysis is conducted on the key parameters to be calibrated for each model in snowy environments. To overcome the poor adaptability and low accuracy of traditional calibration methods, an adaptive parameter calibration framework combining an Informer time series encoder and a physics-informed neural network is proposed. This method extracts features of snowy environments using the Informer time series encoder and achieves dynamic optimization of model parameters via the physics-informed neural network algorithm, making it applicable to multiple car-following models simultaneously. Validation results based on the NGSIM dataset and real vehicle test data under snowy conditions show that the proposed method improves calibration accuracy by 12.7% under snowy scenarios compared to the traditional genetic algorithm, and exhibits strong generalization capability across different car-following models. This research can provide fundamental models for traffic simulation systems and enhance simulation accuracy.
Citation: Sun Y, Li W, Yang M, Zhang X (2026) Parameter calibration method for car-following models under snowy weather conditions: Integrating an informer time series encoder and physics-informed neural networks. PLoS One 21(6): e0350550. https://doi.org/10.1371/journal.pone.0350550
Editor: Jinhao Liang, National University of Singapore, SINGAPORE
Received: April 9, 2026; Accepted: May 13, 2026; Published: June 9, 2026
Copyright: © 2026 Sun et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The NGSIM (Next Generation Simulation) dataset is publicly available and can be accessed via the official FHWA portal at: https://data.transportation.gov/Automobiles/Next-Generation-Simulation-NGSIM-Vehicle-Trajector/8ect-6jqj. The icy/snowy vehicle trajectory data collected from the Heilongjiang highway test section are subject to restrictions as per an agreement with the Heilongjiang Provincial Department of Transportation, which prohibits public release of raw data containing sensitive location information. These data are available upon request to the following institutional body, which is not affiliated with the authors of this manuscript: Institution: Heilongjiang Provincial Department of Transportation, Traffic Data Management Office Email: 87501676@163.com All data required to replicate the study findings, including the values behind the means, standard deviations, and the values used to build the graphs and figures in the manuscript, are available within the Supporting Information file minimal data set.xlsx.
Funding: This research was supported by Key Discipline of Henan Province: Traffic and Transportation Engineering (2023SZDXK318), Huanghe Jiaotong University, and the Subject and Specialty Construction Funding Project for Private Regular Institutions of Higher Education in Henan Province (2023). We would like to clarify that the funder provided financial support for data collection, the purchase of experimental equipment, and the publication fee for this manuscript.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Snowy weather is one of the major meteorological factors affecting highway traffic safety and operational efficiency. According to statistics, traffic accidents caused by snowy conditions account for more than 25% of all crashes on highways in northern China, among which rear‑end collisions constitute about 60% [1]. Car‑following behavior is a core component of microscopic traffic flow simulation; its accuracy directly determines the capability of traffic flow models to represent traffic operation under snowy environments. However, under such conditions, the road adhesion coefficient decreases significantly, visibility drops, driver perception‑reaction time increases, following distances enlarge, and acceleration/deceleration behavior becomes more conservative, rendering model parameters calibrated under dry pavement conditions invalid. Therefore, developing precise calibration methods for car‑following model parameters under snowy weather is of great significance for improving traffic simulation accuracy and optimizing dynamic traffic management strategies.
Parameter calibration of car‑following models serves as a crucial bridge linking theoretical models and real‑world driving behavior [2]. Its accuracy directly affects the reliability of traffic flow simulation, the effectiveness of vehicle dynamics control, and the trustworthiness of virtual validation for autonomous driving systems [3]. Parameter calibration combines optimization theory and statistical inference: by constructing an appropriate objective function, selecting a suitable optimization algorithm, and using high‑precision trajectory data, it inverts the parameter combination that best matches the model output to the measured data.
However, as research deepens, the academic community has gradually recognized the complexities hidden in this seemingly standardized procedure [4]. Traditional calibration studies have mostly focused on deterministic models, such as the Intelligent Driver Model (IDM), the Optimal Velocity Model (OVM), or the Gipps model. These models assume that the same input always yields the same output, so calibration efforts mainly concentrate on how to choose the objective function, handle data noise, and improve optimization efficiency [5–8]. Regarding the choice of objective function, researchers have found that using spacing error as the core metric is often more robust than using speed error, because spacing, as an accumulated variable, can smooth out high‑frequency noise [9]. At the optimization algorithm level, methods such as genetic algorithms, sequential quadratic programming, least squares, maximum likelihood estimation, and particle swarm optimization each have their applicable scenarios, and algorithm performance strongly depends on the specific data characteristics and model structure [10]. Li et al. [11] adopted a genetic algorithm to calibrate the IDM parameters, achieving good results under dry pavement. Treiber and Kesting proposed a Bayesian inference‑based calibration method that can quantify parameter uncertainty [7]. In recent years, deep learning approaches have gradually been applied to traffic modeling; for instance, Mo et al. [12] used LSTM networks to directly learn car‑following behavior, bypassing the constraints of traditional model structures. Moreover, data sampling intervals, trajectory completeness, interaction effects among parameters, and the global sensitivity of the model itself all systematically influence calibration results, prompting researchers to introduce global sensitivity analysis to identify key parameters, thereby enabling model simplification or hierarchical calibration [13].
In recent years, with a growing understanding of the stochastic nature of driving behavior, stochastic car‑following models have become a research hotspot [14–16]. Unlike deterministic models, stochastic models explicitly introduce random terms into the dynamic equations to capture the inherent variability in drivers’ perception, decision‑making, and execution processes [14]. For example, the Laval model describes the distribution of desired acceleration using a truncated normal distribution [15], while the Treiber model adds a stochastic process to the IDM [7]. However, the introduction of stochasticity makes calibration exceedingly complex [17]. Wagner et al. [9] pointed out that even if the calibration procedure is theoretically flawless, parameter estimation may still suffer systematic failures because the randomness within the model couples with data noise, making it difficult for traditional error‑minimization methods to distinguish parameter bias from stochastic fluctuations. To address this issue, Zhou et al. [1] recently proposed a method called “Minimum Error over Multiple Runs” (MRMin). Its core idea is to perform multiple stochastic simulations, find the parameter set that minimizes the error in each run, and then take statistical characteristics of the multiple results as the final estimate. Compared with traditional methods like Mean Error over Multiple Runs (MRMean) and Maximum Likelihood Estimation (MLE), MRMin can effectively avoid the tendency to “degenerate” a stochastic model into a deterministic one, achieving nearly unbiased parameter estimation [18–20]. More importantly, combined with Bootstrap sampling, this method can estimate the covariance matrix of the parameters, thereby providing a foundation for confidence interval construction and statistical inference [21]. This marks a shift from pure parameter optimization to a systematic framework that includes uncertainty quantification.
From the current state of research, the field of car‑following model parameter calibration is undergoing profound transformations: from deterministic to stochastic, from point estimation to probabilistic inference, and from single‑model to hybrid modeling. In stochastic model calibration, methods like MRMin have preliminarily solved the problem of biased parameter estimation caused by model randomness, but their generality in complex urban road scenarios and multi‑vehicle interaction situations still needs validation [22]. Regarding uncertainty quantification, although researchers have begun to pay attention to estimating the parameter covariance matrix, the propagation mechanism of parameter uncertainty still lacks a systematic analytical framework [23]. In terms of data fusion, with the increasing accumulation of naturalistic driving datasets (e.g., HighD, exiD) and closed‑field test data, there remains a lack of unified theoretical guidance on how to rationally fuse data from different sources, with different accuracies and sampling characteristics, during the calibration process [24]. Furthermore, the integration of calibration and validation is becoming increasingly evident. Researchers have started to realize that simply maximizing the goodness of fit is insufficient to guarantee the model's validity in virtual test scenarios [25]. A closed‑loop feedback mechanism must be established, linking calibration data selection, objective function design, and validation scenario construction, so that the calibration process is deeply aligned with the ultimate application goals. Overall, car‑following model parameter calibration is no longer a pure optimization computation problem; it has evolved into a comprehensive engineering science problem integrating statistics, system identification, uncertainty quantification, and application‑scenario validation. Future research needs to continue making progress in algorithm robustness, uncertainty propagation mechanisms, and cross‑scenario generalization.
Parameter calibration methods mainly include least‑squares‑based error minimization, maximum likelihood estimation, and Bayesian inference. These methods have achieved good results for deterministic models, but their limitations become increasingly evident when facing scenarios with strong nonlinearity, high noise, and dynamically changing conditions, such as snowy weather. In recent years, some scholars have attempted to introduce data‑driven methods for parameter identification. For example, using neural networks to directly fit the mapping between model parameters and driving environments, or employing recursive least squares for online parameter updating [26–27]. To some extent, these studies have improved the ability to respond to environmental changes, but two problems remain. First, most methods are designed for a specific car‑following model; when the model structure changes, the calibration framework must be redesigned, lacking generality. Second, purely data‑driven methods struggle to embed physical prior knowledge, and under extreme snowy conditions where training data are insufficient, they may produce parameter estimates that violate physical laws [28]. Moreover, most existing studies treat “snowy weather” merely as a discrete operational label, distinguishing dry pavement from snowy pavement by setting different parameter sets, without fully exploiting the continuously changing dynamic characteristics of the snowy environment. Key environmental variables such as road adhesion coefficient, visibility, and snow depth change continuously in real driving scenarios, and their effects on car‑following behavior are nonlinear and hysteretic. Existing calibration methods lack explicit extraction and fusion mechanisms for these environmental features, so the calibration results cannot adaptively adjust as the environment changes [29].
To address the above shortcomings, this paper proposes a parameter calibration framework that integrates an Informer time series encoder with a Physics‑Informed Neural Network (PINN). The Informer time series encoder employs a ProbSparse self‑attention mechanism to extract long‑range temporal features of car‑following behavior, generating a compact driving behavior representation vector. The physics‑informed neural network feeds the temporal features into a parameter generation network, while simultaneously incorporating the differential equations of the car‑following model as physical residual constraints into the loss function, thereby obtaining physically plausible model parameters through joint optimization. This framework is model‑agnostic and can be simultaneously applied to various classical car‑following models such as IDM, Gipps, and OVM, simply by mapping the parameter space of the corresponding model to the action output of the reinforcement learning component. In this way, the proposed method overcomes the static limitations of traditional calibration methods under time‑varying environments, achieving a leap from “fixed parameters” to “environmentally adaptive parameters”, and provides a new technical path for high‑fidelity vehicle simulation and autonomous driving decision‑making under complex weather conditions such as ice and snow.
2. Vehicle following characteristics in snow weather
The impact of snowy environments on car‑following behavior is mainly reflected in three aspects: the driver's perception ability, decision‑making logic, and vehicle physical response characteristics. Compared with dry pavement, under snowy conditions the road adhesion coefficient decreases significantly, visibility drops, and braking distance increases considerably. These objective environmental changes directly translate into systematic alterations in car‑following behavior characteristics.
Under snowy weather, drivers generally adopt conservative driving strategies, actively reducing speed to gain greater safety margins. Extensive naturalistic driving data indicate that average speeds under snowy conditions are about 15% to 30% lower than on dry pavement, with speed distributions being more concentrated and higher‑speed driving behavior significantly reduced [5]. Measured data show that the average time headway under snowy conditions can increase from 1.5‑2.0 seconds on dry pavement to 2.5‑4.0 seconds [10]. Some simulator experiments have found that driver reaction time under snowy conditions can increase by 0.3 to 0.8 seconds [11]. In car‑following models, this change is reflected as an increase in the reaction time parameter, which has a particularly notable impact on models that explicitly include reaction delays. The friction coefficient μ on snowy roads can drop from 0.7‑0.8 on dry pavement to 0.1‑0.3, resulting in a significant extension of vehicle braking distance. According to the kinematic formula, the braking distance s at initial speed v₀ is s = v₀²/(2 μg); when μ decreases from 0.7 to 0.2, the braking distance increases to 3.5 times its original value. During snowfall, visibility typically drops to 100‑500 meters, affecting the driver's ability to accurately perceive speed changes of the leading vehicle and leading to prolonged reaction times. When visibility is below 200 m, driver reaction time can extend from the normal range of 1.0‑1.5 seconds to 2.0‑3.0 seconds [5].
Based on real‑vehicle test data under snowy weather conditions (data source: Heilongjiang Highway Test Section, China, December 2022 to February 2023), the differences in car‑following behavior between snowy and dry conditions are statistically obtained as Table 1 and Figs 1 and 2. Under snowy pavement conditions, drivers significantly reduce vehicle speed, increase following distance, apply smoother braking, prolong reaction time, and maintain a more stable driving state. All these behavioral changes are adaptive safety responses to an environment characterized by low adhesion, low visibility, and high risk.
3. Car-following models and parameters to be calibrated
This paper selects five representative types of car-following models, covering different modeling approaches, in order to verify the universality of the calibration method.
3.1 GM model
The GM model is the oldest car-following model, expressing the acceleration of the following vehicle as a function of the speed difference and spacing between the leading and following vehicles:
(1)where, an is the acceleration of the following vehicle, vn is its speed, Δx is the spacing, and Δv is the speed difference. The parameters to be calibrated include the sensitivity coefficient α, the speed exponent β, and the spacing exponent γ. The GM model has a simple structure but limited capability in describing complex driving behavior. Under snowy conditions, α typically decreases (slower response), while γ increases (greater sensitivity to spacing).
3.2 Gipps model
The Gipps model is based on the safe following principle, which states that the following vehicle should maintain sufficient space to come to a safe stop in the event of an emergency brake by the leading vehicle. Its speed update formula is as follows:
(2)There are four parameters to be calibrated: the desired speedv0, the effective reaction time τ, the maximum acceleration amax, and the desired deceleration bmax. Under snowy weather, the desired speed decreases significantly, τ increases, and both amax and bmax decrease.
3.3 Intelligent Driver Model (IDM)
The IDM is currently the most widely used continuous car-following model, which describes driving behavior through the concept of desired spacing:
(3)(4)There are six parameters to be calibrated: the desired speed v0, the desired time headway T, the maximum acceleration amax, the comfortable deceleration b, the minimum spacing s0, and the acceleration exponent δ. Under snowy conditions, v0 decreases by 30%–50%, TT increases by 60%–100%, amax and b decrease, and s0 increases.
3.4 Wiedemann model
The Wiedemann model is the core model in simulation software such as VISSIM. It describes car-following behavior based on the driver's psychological perception thresholds. The model divides car-following states into four modes: free driving, approaching, following, and emergency braking, with state transitions controlled by multiple threshold parameters. Parameters to be calibrated: standstill desired spacing AX, desired spacing coefficient for following state BX, spacing threshold for following state SDX, speed difference threshold SDV, speed difference perception threshold CLDV, and speed difference approach threshold OPDV. Under snowy conditions, all spacing‑related parameters increase, while the speed difference thresholds decrease.
3.5 Full Velocity Difference Model (FVDM)
The FVDM introduces a speed difference term into the optimal velocity model, enabling better description of traffic flow instability phenomena:
(5)where the optimal velocity function V(Δx) is typically expressed in a hyperbolic tangent form: V(Δx) = V1 + V2·tanh(C1·Δx-C2). Parameters to be calibrated: sensitivity coefficient κ, speed‑difference sensitivity coefficient λ, V1, V2, C1, C2. Under snowy conditions, κ decreases (slower response) and λ decreases (reduced sensitivity to speed differences).
3.6 Summary of parameters to be calibrated
From the GM model to the FVDM model, the number of car-following model parameters shows an overall increasing trend, gradually rising from the three basic parameters of the classic GM model to six parameters in models such as IDM, Wiedemann, and FVDM. This reflects the evolution of car-following models from simplified linear descriptions toward more refined and complex representations that better align with real driving behavior, as shown in Table 2.
4. Calibrated method for car-following models
To address the issues of insufficient capture of long‑sequence temporal dependencies and poor physical interpretability in parameter calibration of car‑following models under snowy weather, this paper proposes a parameter calibration framework that integrates an Informer time series encoder with a Physics‑Informed Neural Network (PINN), termed Informer‑PINN‑Calib. The overall structure of this framework is shown in Fig. 3, and it mainly consists of three core modules: data preprocessing and feature embedding, the Informer time series encoder, and physics‑informed parameter generation.
The first module is data preprocessing and feature embedding: it slices and normalizes the raw car‑following trajectories and snowy environmental data, and constructs a time‑series input matrix. The second module is the Informer time series encoder: it uses the ProbSparse self‑attention mechanism to extract long‑range temporal features of car‑following behavior, generating a compact driving behavior representation vector. The third module is physics‑informed parameter generation: it feeds the temporal features into a parameter generation network while incorporating the differential equations of the car‑following model as physical residual constraints into the loss function, obtaining physically plausible model parameters through joint optimization. The framework adopts an end‑to‑end supervised learning approach, avoiding the instability of traditional reinforcement learning, while ensuring through physical constraints that the calibration results comply with car‑following dynamics.
4.1 Data processing and parameter selection
Each sample is a stable car‑following segment with a fixed sampling frequency of 1 Hz and a length of T = 60 seconds. Each sample contains two types of data: car‑following trajectory data and environmental characteristics. The car‑following trajectory includes the following vehicle's speed vn, following vehicle's acceleration an, spacing Δx, and speed difference Δv. The environmental characteristics include the road friction coefficient μ, visibility vis(t), and precipitation intensity p(t)(discrete levels: 0-none, 1-light snow, 2-moderate snow, 3-heavy snow). The input tensor for a single sample is denoted as , where 7 corresponds to the above seven feature dimensions. Each feature is normalized independently for each sample. Numerical features such as speed and spacing are standardized using Z‑score (zero mean, unit variance), while discrete environmental features retain their original values. To avoid edge effects, samples are generated using a sliding window with a step size of 5 seconds, resulting in overlap between adjacent samples. To ensure calibration quality, only segments satisfying the following conditions are selected: spacing 10m ≤ Δx ≤ 150m; duration ≥ 60s; following vehicle speed vn ≥ 5 km/h; speed difference ∣Δv∣ ≤ 5 km/h (to exclude non‑following behaviors such as lane changes).
4.2 Informer time series encoder
The Informer effectively reduces the computational complexity of the Transformer through the ProbSparse self‑attention mechanism and distillation operations, while also capturing long‑range dependencies in sequences. In this paper, it is used as a time‑series feature extractor to capture the characteristics of vehicle car‑following behavior under snowy weather. For the input sequence ^ (), data embedding is first performed:
(6)where, Conv1D () is a convolutional layer (kernel size 3, output dimension dmodel), and Ppos denotes the sinusoidal positional encoding used to preserve temporal order information.
The computational complexity of traditional self‑attention is O(L2), where L is the sequence length. The Informer reduces this complexity by selecting only the “active” queries that contribute most to the attention. For the input of the l-th layer, the ProbSparse self‑attention is defined as:
(7)where, is the set of dominant queries selected from all queries Q according to the Kullback‑Leibler divergence (the sampling factor c is typically set to 5). The selection is based on the sparsity measure M(qi,K) for each query qi:
(8)Queries with larger M values are considered active. This mechanism reduces the attention computational complexity to .
The Informer encoder is stacked with multiple encoding layers. Each layer contains: a ProbSparse self‑attention sub‑layer (with residual connection and layer normalization), and a feed‑forward network sub‑layer (two linear layers with GELU activation). After each layer, a one‑dimensional convolution with stride 2 is applied to halve the sequence length, thereby extracting multi‑scale features.
Suppose the encoder has N = 3 layers. After layer‑wise distillation, the final output feature map is , . Take the mean over all time steps of the last layer to obtain the global temporal feature vector:
(9)This vector z contains the dynamic characteristics of car‑following behavior under snowy environments. To obtain a high‑quality z, we pre‑train the Informer with an auxiliary task of “predicting the acceleration at the next time step” before formal calibration. The auxiliary loss function is:
(10)where, is obtained from the Informer output via a linear prediction head. Pre‑training enables the Informer to learn temporal features that are closely related to driving behavior. The core idea of the Physics‑Informed Neural Network is to embed the differential equations of the car‑following model as soft constraints into the loss function, guiding the network to output parameters that are physically plausible.
For a given car‑following model with parameters θ, the physical residual at time t is defined as:
(11)Where is the acceleration given by the model and is the true acceleration. We adopt a self‑consistent residual form: the difference between the model‑output acceleration and the true acceleration serves as the data term, while the internal consistency of the model's differential equation serves as the physics term. Specifically, for the IDM, the physical residual is:
(12)Where θ=[v0, T, amax, b, s0, δ].
For other models, the corresponding residual equations are defined analogously. For unified processing, we denote the residual of each model as . The mean squared error of the physical residual is computed over the entire sequence:
(13)This loss forces the calibrated parameters θ to make the model output consistent with the true trajectory (i.e., to minimize the residual).
We do not use the physics loss alone; instead, we combine it with a data‑driven loss (i.e., the acceleration prediction error) in a weighted manner to form a joint optimization objective. The temporal feature vector z extracted by the Informer is fed into a Parameter Generation Network (PGN), which outputs the parameters to be calibrated θ:
(14)The PGN consists of multiple fully connected layers, with hidden layer dimensions of 128 and 64 respectively, and an output layer dimension equal to the number of parameters of the target model. To satisfy the physical range of the parameters (e.g., acceleration cannot be negative), the output layer uses a sigmoid activation followed by linear scaling:
(15)where is the i-th output of the PGN, and is the parameter prior interval (determined based on historical statistics under snowy conditions).
The total loss function consists of three parts: the data loss , the physics loss , and the parameter prior .
Data loss : It measures the error between the acceleration output by the car‑following model when run with the generated parameters θ and the true acceleration.
(16)Where is the acceleration sequence obtained by numerically integrating (e.g., using the Euler method) the model's differential equations after substituting θ.
Physics loss: See Equation (13).
Parameter prior loss: It guides the parameters toward the prior means, preventing excessive deviation from reasonable ranges (especially when data are sparse).
(17)Where d is the number of model parameters, and and are determined from the empirical distributions under snowy conditions.
The total loss is:
(18)Typically, and . All losses are averaged over the batch.
The AdamW optimizer is used, with an initial learning rate of and weight decay of . The cosine annealing learning rate scheduler (CosineAnnealingLR) is applied with a period of 50 epochs. During training, early stopping is performed if the validation loss does not decrease for 10 consecutive epochs. To enhance generalization ability, small perturbations are added to the input sequences during training (Gaussian noise with a standard deviation of 0.01), and dropout with a probability of 0.1 is employed. To make the proposed framework applicable to multiple car‑following models (GM, Gipps, IDM, Wiedemann, FVDM), we design a model adaptation layer that dynamically adjusts the output dimension of the PGN and the physical residual calculation according to the target model type.
The target model type is encoded as a one‑hot vector (for the five models). This vector is concatenated with the feature vector z output by the Informer, serving as the input to the PGN: .
The output layer dimension of the PGN is dynamically adjusted according to the model type. During training, a multi‑task learning approach is adopted: for each model type, the PGN shares the first few layers, while the last layer is set separately for each model, with each model having its own independent set of output weights. The loss function is computed separately for each model type and then summed.
During the forward pass for computing the physics loss, the corresponding residual function is selected according to the model type m, and the corresponding parameters θ are substituted for calculation. The entire framework can be implemented via conditional branches in PyTorch.
Compared with traditional genetic algorithm‑based calibration (which requires multiple simulation iterations), this method requires only one forward pass (Informer + PGN) during inference, with a computational complexity of and an actual runtime of approximately 0.02 seconds per sample. During the training phase, due to the need to compute physical residuals and numerical integration, each epoch takes about 5–10 seconds (on an NVIDIA RTX 4090). Convergence is reached after about 200 epochs, with a total training time of approximately 30 minutes, which is far lower than the hours to days of computational cost required by traditional methods. The pseudocode of the algorithm is as follows:
Algorithm 1. Informer-PINN-Calib parameter calibration framework.
Input: raining dataset D = {(Xi, yi, mi)}, set of car‑following models M, maximum number of epochs E, batch size B
Output: Trained parameter generation network PGN, optimal parameters θ* for each model
1. // Pre‑train the Informer feature extractor
2. Initialize Informer encoder parameters ϕenc
3. For epoch = 1 to E_pre:
4. For each batch (X, y) in D:
5. z = Informer(X; ϕenc) // extract features
6. ŷ = LinearHead(z) // predict next‑step acceleration
7. Compute pre‑training loss Lpre = MSE(ŷ, y)
8. Backpropagate to update ϕenc
9. // Freeze the Informer encoder (or fine‑tune)
10. Initialize PGN parameters ϕpgn
11. For epoch = 1 to E:
12. For each batch (X, m) in D:
13. z = Informer(X; ϕenc) // extract features, ϕencfrozen
14. z’ = concat(z, one_hot(m))
15. θ = PGN(z’; ϕpgn) // generate parameters, followed by range mapping
16. // Compute data loss
17. Numerically integrate according to model type m and parameters θto obtain model acceleration sequence amodel
18. Ldata = MSE(amodel, atrue)
19. // Compute physics loss
20. R = residual(m, θ, X) // compute residual sequence according to model type
21. Lphy = mean(R2)
22. // Compute parameter prior loss
23. Lprior = sum((θ – μprior)^2 / σ2prior)
24. // Total loss
25. Ltotal = Ldata + λphy * Lphy + λprior * Lprior
26. Backpropagate to updateϕpgn
27. End For
28. End For
29. Return the trained PGN; for any new sample X’ directly output the parameters θ*
5. Experimental design and result analysis
5.1 Data
Two types of data sources were used in the experiments: 1) The NGSIM dataset: trajectory data from US‑101 and I‑80 highways in the United States, with a sampling frequency of 10 Hz, containing car‑following behavior under dry pavement conditions. From this dataset, 1,200 car‑following segments were extracted for model benchmark calibration. 2) Real‑vehicle test data in cold regions: field tests under snowy weather were conducted on a section of the Hatong Highway in Heilongjiang Province. Vehicle trajectory data were collected using drones (sampling interval of 0.1 seconds), yielding 832 car‑following trajectories under snowy pavement conditions, covering three intensities: light snow, moderate snow, and heavy snow. All data used in this study are publicly available and no specific permits were required for the described field studies.
The NGSIM dry road data were used only to pre train the Informer encoder to learn basic driving dynamics, and not for final calibration or validation in snowy conditions. The 832 icy/snowy trajectories were split into training (70%), validation (15%), and test (15%) sets. All reported calibration results are on the snowy test set. The filtering rules (10 m ≤ spacing ≤ 150 m, duration ≥ 60 s, speed ≥ 5 km/h, |Δv| ≤ 5 km/h) are intended to ensure stable car following episodes and exclude lane changes or free flow conditions. This is standard practice in car following calibration (see references). We acknowledge that this narrows the behavioral scope, but it is necessary for reliable parameter identification. We have added a sentence in the Limitations subsection to discuss this and suggest future work on more complex scenarios.
5.2 Analysis of calibration results
Fig 4 shows the calibrated IDM trajectory compared with the actual trajectory. It can be observed that the calibrated IDM model reproduces the vehicle trajectory well, with overall small errors. To further validate the effectiveness of the proposed method, the following comparative calibration methods were set up: 1) GA, PSO, Bayes, LSTM, DDPG, and Informer‑PINN. Evaluation metrics include acceleration RMSE, speed RMSE, spacing RMSE, and parameter calibration time. Taking the IDM model as an example, the calibration results of each method under snowy conditions are shown in Table 3.
Traditional optimization algorithms (GA, PSO, Bayes) yield IDM parameter values (desired speed, acceleration/deceleration, minimum spacing, etc.) that are concentrated and stable, consistent with conventional car‑following behavior and strong physical consistency, with PSO performing slightly better overall. Data‑driven and hybrid intelligent algorithms (LSTM, DDPG, Informer‑PINN) exhibit a general trend of continuously increasing desired time headway, continuously decreasing maximum acceleration / comfortable deceleration, and gradually decreasing acceleration exponent. Their fitting accuracy may be higher, but they deviate more noticeably from the physical interpretation of the classic IDM, leaning toward pure data fitting. The calibrated minimum spacing values from machine‑learning‑type models are generally larger than those from traditional algorithms, reflecting a tendency toward a more conservative, large‑spacing car‑following strategy. The IDM parameters calibrated by traditional intelligent optimization algorithms have clear physical meanings but lower data fitting accuracy, whereas data‑driven methods such as LSTM, DDPG, and Informer‑PINN, although gradually weakening physical plausibility, significantly improve parameter fitting accuracy. This exhibits a trade‑off between physical consistency and data fitting accuracy, as shown in Fig 5.
Table 4 presents the evaluation metrics obtained using different calibration methods. The RMSE values for each metric and the comprehensive error consistently decrease as the algorithm transitions from GA, PSO, Bayes to LSTM, DDPG, and Informer‑PINN. Among all methods, Informer‑PINN achieves the lowest RMSE in acceleration, speed, spacing, as well as the lowest comprehensive error, indicating that it provides the highest fitting accuracy for car‑following behavior. In terms of calibration time, traditional optimization algorithms (GA, PSO, Bayes) require seconds or even hundreds of seconds, whereas LSTM and DDPG have the shortest computation times. Although the time for Informer‑PINN is slightly higher than that of LSTM and DDPG, it remains far lower than that of traditional algorithms. Overall, there is a clear trend: higher fitting accuracy is accompanied by better calibration efficiency. The proposed method outperforms traditional methods in all accuracy metrics, reducing the acceleration RMSE by approximately 19% and the comprehensive error by about 18.7%. At the same time, because it avoids numerous simulation iterations, the calibration time is significantly reduced.
5.3 Validation of multi‑model applicability
The proposed method was applied to the calibration of five car‑following models and compared with the dedicated calibration method (GA optimized for each specific model) corresponding to each model. The results show that the proposed method achieves accuracy improvement for all five models, demonstrating its strong generalization capability, as shown in Table 5. The improvement in accuracy is most significant for the IDM model, which may be attributed to the strong coupling among the IDM parameters, making global optimization difficult for traditional methods. The proposed method achieves error reduction and accuracy improvement across multiple car‑following models, with the largest improvement observed for the IDM model and the smallest for the FVDM model. Overall, the improvement is substantial and stable.
5.4 Performance analysis under different snow intensities
snowy conditions were classified into three levels based on visibility and road surface state: light snow (visibility > 500 m, μ > 0.4), moderate snow (visibility 200–500 m, μ = 0.2–0.4), and heavy snow (visibility < 200 m, μ < 0.2). As shown in Table 6, with increasing snow intensity, the calibration error increases slightly, but still maintains high accuracy under heavy snow conditions. Parameter reasonableness was verified using an expert scoring method, confirming that the calibrated parameters satisfy physical constraints and driving experience.
5.5 Algorithm performance analysis
Fig. 6 shows the loss curves of different methods during training, using the IDM model as an example. The proposed method stabilizes after approximately 120 epochs, outperforming LSTM (180 epochs) and DDPG (400 epochs), as shown in Table 7. This benefit stems from the fact that the physical constraints provide effective regularization guidance for optimization. GA and PSO exhibit random oscillations during the evolution process; DDPG shows the most severe oscillations due to the variance in policy gradient estimation. The proposed method, employing supervised learning with physical constraints, yields a smooth loss curve and stable training. The initial loss of the proposed method (0.85) is lower than that of other deep learning methods (LSTM: 0.98), indicating that physical constraints help the network output reasonably plausible parameters even at the initial stage, as shown in Fig. 6.
The training time of the proposed method is 124 seconds, which is better than DDPG (450 seconds) and Bayes (356 seconds), and close to LSTM (180 seconds), as shown in Table 8. The physical constraints do not significantly increase the computational burden. The single‑sample inference time of the proposed method is 0.018 seconds (approximately 55 Hz), slightly better than LSTM (0.015 seconds) and much faster than traditional optimization methods (>0.3 seconds), as shown in Table 8. This benefit comes from the efficient attention mechanism of the Informer and the single‑forward‑pass architecture. The proposed method is suitable for scenarios requiring real‑time calibration (e.g., dynamic traffic simulation, online parameter adaptation), whereas GA/PSO are only suitable for offline batch calibration.
In terms of training time, among the traditional optimization algorithms, PSO takes the shortest time and Bayes the longest. The training time of deep learning models varies considerably, and the training time of the proposed method is moderate. In terms of inference time, LSTM, DDPG, and the proposed method are far superior to traditional optimization algorithms, demonstrating extremely high inference efficiency. Regarding the total time per sample, deep learning methods and the proposed method achieve an order‑of‑magnitude reduction compared to traditional algorithms, showing a significant overall efficiency advantage. Note: GA/PSO/Bayes require re‑optimization for each new sample to be calibrated, so their total time ≈ training time; LSTM/DDPG/the proposed method, after a single training session, can perform fast inference.
Fig 7 and the Table 9 present boxplots of acceleration prediction errors on the test set for each method (IDM model, moderate snow conditions). Considering the four metrics—mean error, standard deviation, 95th percentile error, and maximum error—the proposed method outperforms both traditional optimization algorithms (GA, PSO, Bayes) and deep learning algorithms (LSTM, DDPG). It achieves the smallest average fitting error, the lowest error fluctuation, and the lowest probability of extreme large errors, indicating the best overall robustness and prediction reliability.
5.6 Sensitivity analysis
To evaluate the impact of each parameter on calibration accuracy, a sensitivity analysis of the key hyperparameters of the proposed method was conducted using the control variable method (varying a single parameter while keeping others fixed).
The physical constraint weight λphy was set to values of 0, 0.05, 0.1, 0.2, and 0.5. When the physical constraint is not considered (λphy = 0), the physical residual is large and the parameter deviation is high, indicating that physical constraints are necessary. As λphy increases to 0.1, the acceleration error, parameter deviation, and physical residual gradually decrease, suggesting improved fitting accuracy, as shown in Table 10. However, when λphy increases further to 0.5, over‑constraint occurs, and the acceleration error gradually increases. Therefore, λphy = 0.1 achieves the best balance between accuracy and physical consistency. Too large a value (0.5) leads to over‑constraint, sacrificing data fitting accuracy; too small a value (0.0) results in a large physical residual and high parameter deviation.
The sensitivity analysis of the sampling length T is shown in Table 11. As the sampling length increases, richer data can be obtained, capturing more driving behavior characteristics. However, a longer sampling length also increases inference time and algorithm operating cost. Through experiments in this paper, it is found that the accuracy approaches saturation at T = 60 seconds; when T = 80 seconds, the accuracy improvement is less than 3%, while inference time increases by 33%. Considering both factors, T = 60 seconds is chosen as the optimal value.
The number of encoder layers N is a key factor determining algorithm runtime performance. Increasing the number of layers increases data training time and overhead. Through experiments, it is found that N = 3 achieves a good accuracy‑efficiency balance, as shown in Table 12 and Fig 8. When N is increased to 4, the accuracy improves by only 0.01, while the number of parameters increases by 0.5 million, and the training time increases significantly by 50%, as shown in Fig 8.
5.7 Robustness analysis of data sparse scenarios
To evaluate the performance of the algorithms under small‑sample conditions, we randomly extracted different proportions of the training data and compared the performance of each method as shown in Table 13 and Fig 9. With only 20% of the data (about 116 samples), the comprehensive error of the proposed method was 0.51, still outperforming GA/PSO/Bayes with 100% of the data, which verifies the effective regularization effect of the physical constraints. When the data volume was reduced from 100% to 20%, the error of the proposed method increased by 11%, compared with 21% for DDPG and 15% for LSTM. This indicates that deep learning methods experience a relatively rapid performance degradation under sparse data, whereas the proposed method, due to its physical constraints, shows a moderate level of degradation. It is recommended that in practical applications at least 300 car‑following segments under snowy conditions be used to ensure stable calibration results.
6. Conclusions
This paper systematically investigates the problem of parameter calibration for car‑following models under snowy weather conditions. The sensitivity of five classical car‑following models—GM, Gipps, IDM, Wiedemann, and FVDM—under snowy environments is systematically analyzed, and the theoretical direction of change for each parameter is clarified, providing prior constraints for calibration. An adaptive parameter calibration framework based on LSTM‑DDPG is proposed, which can automatically extract temporal features from car‑following trajectories and directly generate optimal parameters via reinforcement learning, achieving end‑to‑end calibration.
Experimental results show that under snowy conditions, the calibration accuracy of the proposed method is improved by more than 12.7% compared with the traditional genetic algorithm, with the acceleration RMSE reduced to 0.38 m/s² and the calibration time shortened by an order of magnitude. Furthermore, the method exhibits good generalization capability across multiple car‑following models. Under moderate snow conditions, the comprehensive error of the proposed method is 0.41, which is 21.6% lower than that of DDPG and 36.3% lower than that of GA. The acceleration RMSE is reduced to 0.31 m/s², and the physical residual is reduced to 0.68 × 10 ⁻ ³, significantly outperforming the comparison methods. The proposed method converges in about 120 epochs with a smooth and stable loss curve, whereas DDPG requires 400 epochs with severe oscillations, and GA/PSO exhibit random fluctuations. The physical constraints effectively accelerate convergence and improve stability. The total training time of the proposed method is 124 seconds, and the inference time is 0.018 seconds per sample, meeting the requirements of real‑time applications. Traditional optimization methods (GA/PSO/Bayes) require tens to hundreds of seconds per calibration and are unsuitable for online scenarios. Sensitivity analysis shows that λphy = 0.10, T = 60 seconds, and N = 3 layers are the optimal configuration. Under data‑sparse scenarios (20% of the training data), the proposed method still achieves a comprehensive error of 0.51, outperforming some of the results of the comparison methods with 100% of the data, demonstrating the strong regularization capability of the physical constraints.
Future research can extend this study in the following directions: Introduce latent variables to represent individual driving styles, enabling personalized parameter calibration; Explore meta‑learning or transfer learning methods to reduce the dependence on data under snowy conditions; Investigate online adaptive calibration methods in combination with vehicle‑infrastructure cooperative technologies to achieve real‑time parameter updates as the environment changes; Apply the calibrated parameters to the optimization of strategies such as dynamic speed limits and ramp metering, forming a closed‑loop validation.
Supporting information
S1 File. Dataset.
The dataset contains the values behind all means, standard deviations, RMSE values, and other statistical measures reported in the manuscript.
https://doi.org/10.1371/journal.pone.0350550.s001
(XLSX)
References
- 1. Zhou S, Zheng S, Xu T, Treiber M, Tian J, Jiang R. On the calibration of stochastic car following models. Transport Res Part B: Methodol. 2025;196:103224.
- 2. Wang X, Chen T, Wang R, Lu J, Dou G. Review of state estimation methods for autonomous ground vehicles: perspectives on estimation objects, vehicle characteristics, and key algorithms. Sensors (Basel). 2025;25(13):3927. pmid:40648185
- 3. Liang J, Li Y, Yin G, Xu L, Lu Y, Feng J, et al. A MAS-based hierarchical architecture for the cooperation control of connected and automated vehicles. IEEE Trans Veh Technol. 2023;72(2):1559–73.
- 4. Liang J, Yang K, Tan C, Wang J, Yin G. Enhancing high-speed cruising performance of autonomous vehicles through integrated deep reinforcement learning framework. IEEE Trans Intell Transport Syst. 2025;26(1):835–48.
- 5. Lee G, Kim W, Oh H, Youn BD, Kim NH. Review of statistical model calibration and validation—from the perspective of uncertainty structures. Struct Multidisc Optim. 2019;60(4):1619–44.
- 6. Punzo V, Montanino M. A comparison of the calibration of car-following models. Transp Res Part C. 2016;64:101–17.
- 7. Treiber M, Kesting A. The intelligent driver model with stochasticity. Transp Res Rec. 2018;2672(40):100–10.
- 8. Xu T, Laval JA. A stochastic framework for car-following models. Transp Res Part B. 2020;135:43–58.
- 9. et al. Calibration of stochastic traffic flow models. Transp Res Procedia. 2016;15:156–67.
- 10. Jiang R, Hu MB, Zhang HM, Gao ZY. On the experimental study of complex traffic flow. Transp Res. 2014;47:136–46.
- 11. Li L, Chen X, Zhang L. A genetic algorithm-based vehicle following model calibration method. J Transp Eng. 2015;141(3):04014082.
- 12. Mo Z, Fu R, Guo Y. A hybrid deep learning approach for car-following behavior modeling. IEEE Transac Intelligent Transp Syst. 2021;22(9):5583–93.
- 13.
Krajewski R, Bock J, Kloeker L, Eckstein L. The highD dataset: a drone dataset of naturalistic vehicle trajectories. In: IEEE ITSC, 2018. 1–7.
- 14. Sharma A. Calibration of car-following models using trajectory data. Transp Res Rec. 2019;2673(5):123–34.
- 15. Laval JA. A stochastic model of car-following. Transp Res Part B. 2014;65:48–60.
- 16. Tian J. Stochastic car-following model with consideration of driver’s randomness. Transportmetrica B. 2019;7(1):1024–42.
- 17. Punzo V. A systematic review of car-following model calibration. Trans Rev. 2021;41(3):345–69.
- 18. Li L. DIRECT algorithm for calibration of car-following models. J Adv Transp. 2016;50(8):1856–72.
- 19. Taylor J. Dynamic time warping for calibration of car-following models. Transp Res Record. 2015;2490(1):85–94.
- 20. Dona R, Ciuffo B. Virtual testing of automated driving systems. a survey on validation methods. IEEE Access. 2022;10:24349–67.
- 21. Kabzan J, Hewing L, Liniger A, Zeilinger MN. Learning-based model predictive control for autonomous racing. IEEE Robot Autom Lett. 2019;4(4):3363–70.
- 22. Fényes D, Németh B, Gáspár P. A novel data-driven modeling and control design method for autonomous vehicles. Energies. 2021;14(2):517.
- 23. James SS, Anderson SR, Lio MD. Longitudinal vehicle dynamics: a comparison of physical and data-driven models under large-scale real-world driving conditions. IEEE Access. 2020;8:73714–29.
- 24. Ao D, Hu Z, Mahadevan S. Dynamics model validation using time-domain metrics. J Verification, Validation and Uncertainty Quant. 2017;2(1).
- 25. Riedmaier S, Schneider D, Watzenig D, Diermeyer F, Schick B. Model validation and scenario selection for virtual-based homologation of automated vehicles. Appl Sci. 2020;11(1):35.
- 26. Neurohr B, Koopmann T, Möhlmann E, Fränzle M. Determining the validity of simulation models for the verification of automated driving systems. IEEE Access. 2023;11:102949–60.
- 27. Unsal O, Yavuz H. Bridging Regimes: A State-Dependent Blending Methodology for Parsimonious and Robust Heavy Vehicle Dynamics Modeling. Actuators. 2025;15(1):2.
- 28. Lü X, Zhou J-Y, Jiang R, Jia B, Gao Z. Novel car-following model incorporating steady-state control effect in intelligent transportation systems. Physica A: Stat Mech Appl. 2026;687:131354.
- 29. Yin Y-H, Lü X, Jiang R, Jia B, Gao Z. Kinetic analysis and numerical tests of an adaptive car-following model for real-time traffic in ITS. Physica A: Stat Mech its Appl. 2024;635:129494.
이 뉴스, 독자들은 어떻게 느꼈나요?
첫 반응을 남겨보세요로그인하면 감정 반응에 참여할 수 있어요.
관련 뉴스
관련 뉴스 제보는 로그인 후 가능합니다.
'research' 카테고리 뉴스
Correction: A new criterion for defining tunnel portal failure using the strength reduction method
PLOS ONE
Drug-induced gastric motility disorders: A disproportionality analysis from the FAERS and CVARD databases
PLOS ONE
Musculoskeletal surgeons use mixed reasoning rather than pure Bayesian strategies in clinical practice
PLOS ONE
PLOS의 다른 기사
Correction: Efficacy and safety of the ayurvedic formulation ‘<i>Trikatu</i>’ as an add-on to standard care in dyslipidemia: Study protocol for a randomized, double-blind, placebo-controlled trial evaluating lipid parameters, and gut microbiota
PLOS ONE
KYNU in macrophages contributes to the unique immune feature of LUAD via integrating single-cell and bulk RNA sequencing data: an exploratory analysis
PLOS ONE
Perspectives of community-dwelling older adults with chronic diseases on Baduanjin practice: A qualitative study
PLOS ONE