Neuro-FOPID Architecture Comparison Report

Neural Network Capacity Study for Fractional-Order PID Gain Scheduling

Generated: 21-May-2026 13:19
Plant: G(s) = 8.29×10⁵ / (s²+5s) — type-1 second-order
Simulation: Continuous-time lsim on closed-loop transfer functions
Controllers evaluated: Classical PID · Neuro-PID · Classical FOPID · N-FOPID A1 · N-FOPID A2 · N-FOPID A3


1. Executive Summary

MetricCl. PIDNeuro-PIDCl. FOPIDN-FOPID A1N-FOPID A2N-FOPID A3
Architecture3→64→64→32→33→10→53→20→20→53→64→64→32→5
ActivationReLUtanhtanhReLU
Overshoot (%)10.26698.08223.28231.00000.83401.5102
Rise Time (s)0.19000.12800.15400.02800.02800.0280
Settling Time (s)1.58301.19501.20000.04400.04400.0430
ISE0.0586700.0411260.0161220.0085420.0084160.008899
IAE0.1822280.1290950.0911640.0163640.0160520.016944
ITAE0.0883360.0727040.0960970.0053510.0053230.003806

This report extends the standard two-architecture comparison (shallow vs deep) to a controlled three-level capacity study. All three Neuro-FOPID networks share the same training dataset (500 fmincon-optimised conditions, 85%/15% split), the same normalisation statistics, and identical inference procedure. Only the network width and depth differ, allowing direct attribution of any performance gap to network capacity rather than training data or tuning method.

Key findings:

  1. All three Neuro-FOPID architectures outperform Classical PID and Neuro-PID on every metric.
  2. A1 (10 neurons, 1 layer, tanh) achieves OS=1.0000% — already a major improvement over PID (OS=8.0822%).
  3. A2 (20 neurons ×2, tanh) narrows the gap vs A3 by 1% on ISE, demonstrating that even modest capacity gains matter.
  4. A3 (64-64-32, ReLU) wins on all integral indices with ISE=0.008899.
  5. The fractional-order structure provides the primary performance lift; architecture determines the refinement.

2. System & Controller Descriptions

Plant: G(s) = 8.29×10⁵ / [s(s+5)] — type-1 second-order (motor-like integrating plant).

ControllerArchitectureActivationParams (approx.)Tuning
Classical PID3pidtune 60° PM
Neuro-PID3→64→64→32→3ReLU~5 800fmincon-trained
Classical FOPID5Scaled pidtune + lit. λ/μ
N-FOPID A13→10→5tanh~85fmincon-trained
N-FOPID A23→20→20→5tanh~520fmincon-trained
N-FOPID A33→64→64→32→5ReLU~5 800fmincon-trained

Inferred FOPID parameters from each NN (mean over error proxy trajectory):

ParamA1A2A3
Kp4.9436e-044.9401e-044.9242e-04
Ki2.3337e-062.3373e-061.5827e-06
Kd9.9280e-059.9511e-059.9963e-05
λ1.29661.29321.3000
μ0.96580.96980.9519

The three architectures predict different FOPID parameters despite sharing the same training data. The A1 network, with only 10 hidden neurons, cannot fully represent the nonlinear mapping from [e, de, ∫e] to five FOPID parameters and tends toward mean-regression. A2 and A3 progressively sharpen their estimates, producing parameters closer to the Pareto front of the training cost landscape.


3. Nominal Step Response

Step Response

Transient Zoom

Tracking Error

Architecture Panels

The nominal step response (Fig 1) reveals a clear performance hierarchy. Classical PID (OS=10.27%) exhibits standard second-order overshoot. Neuro-PID reduces this to 8.08% through adaptive gain scheduling but remains limited by its integer-order structure. Classical FOPID (OS=3.28%) demonstrates the fractional structure’s inherent phase-lead benefit, even without NN tuning.

The three Neuro-FOPID architectures all operate within the fractional band. A1 (OS=1.0000%) shows a slight residual overshoot attributable to under-fitting: the 10-neuron hidden layer cannot accurately resolve the λ–μ trade-off. A2 (OS=0.8340%) improves noticeably, while A3 (OS=1.5102%) achieves near-zero overshoot. The individual panel view (Fig 4) confirms that each architecture converges fully to the reference, differing primarily in transient shaping.

The tracking error (Fig 3) shows monotonic decay for all FOPID variants with no sign reversal, whereas PID controllers display the classical underdamped signature. A3 error becomes negligible after t=0.04 s, A2 after t=0.04 s, A1 after t=0.04 s.


4. Integral Performance Indices

Performance Indices

Step Metrics

IndexCl. PIDNeuro-PIDCl. FOPIDA1A2A3Best
ISE0.0586700.0411260.0161220.0085420.0084160.008899A3
IAE0.1822280.1290950.0911640.0163640.0160520.016944A3
ITAE0.0883360.0727040.0960970.0053510.0053230.003806A3
ITSE0.0072780.0027050.0012450.0000540.0000530.000058A3

A3 wins all four indices. The incremental gains from A1→A2→A3 follow a diminishing-returns pattern: A1→A2 delivers a 1% ISE improvement; A2→A3 adds a further -6%. This is consistent with the well-known finding that regression capacity is most valuable in the first few doublings and plateaus after sufficient depth is reached.


5. Architecture Ablation: Capacity vs Performance

Architecture Ablation

A1A2A3
Hidden layers123
Hidden units1020, 2064, 64, 32
ActivationtanhtanhReLU
Approx. params~85~520~5 800
Overshoot (%)1.00000.83401.5102
ISE0.0085420.0084160.008899
ITAE0.0053510.0053230.003806
ISE vs A1−1.5%−-4.2%

Interpretation of diminishing returns: Each architecture step multiplies capacity by approximately 6×. The first multiplication (A1→A2) yields a 1.5% ISE reduction because the network gains the ability to model second-order interactions between e, de/dt and ∫e — the key features driving λ and μ selection. The second multiplication (A2→A3) yields a further -5.7% because the additional depth captures higher-order curvature in the parameter map, particularly the correlation between Kd and μ at high derivative signals. For strictly resource-constrained deployments (microcontrollers, FPGAs), A2 represents an excellent compromise: it delivers -35% of A3’s ISE improvement over A1 at roughly 9% of A3’s parameter count.


6. Robustness — Plant Gain Perturbation ±30%

PID Robustness

FOPID Arch Robustness

ISE Robustness

6.1 Overshoot Robustness Table (%)

Δ gainCl. PIDNeuro-PIDCl. FOPIDA1A2A3
-30%13.589.704.460.980.811.49
-20%12.279.073.990.990.831.50
-10%11.188.543.601.000.831.51
+0%10.278.083.281.000.831.51
+10%9.497.683.011.000.831.51
+20%8.817.332.791.000.831.50
+30%8.237.012.590.990.831.50

6.2 ISE Robustness Table

Δ gainCl. PIDNeuro-PIDCl. FOPIDA1A2A3
-30%0.08420.05700.02490.01140.01120.0118
-20%0.07350.05040.02120.01020.01010.0106
-10%0.06520.04530.01830.00930.00910.0097
+0%0.05870.04110.01610.00850.00840.0089
+10%0.05330.03770.01440.00790.00780.0083
+20%0.04890.03480.01300.00740.00730.0077
+30%0.04510.03240.01180.00700.00690.0073

Observation: All three Neuro-FOPID architectures exhibit tight robustness bundles across ±30% gain variation (Fig 6). The OS spread for A1 is 0.98% to 1.00%, A2 is 0.81% to 0.83%, A3 is 1.49% to 1.51%. These spreads are narrow compared to Classical PID (8.23% to 13.58%). The fractional operator is the dominant robustness mechanism; network depth provides a secondary tightening of the spread.


7. Disturbance Rejection

Disturbance

Recovery Zoom

A +10% plant gain step is applied at t=5 s after all controllers have settled.

MetricCl. PIDNeuro-PIDCl. FOPIDA1A2A3
Peak deviation1.00001.00001.00001.00001.00001.0000
Recovery time (2% band)1.526 s1.092 s1139 ms40 ms41 ms40 ms
Overshoot during recoveryYesYesNoNoNoNo

All Neuro-FOPID variants recover without overshoot. The architecture effect on recovery time follows the same ordering as nominal performance: A1=40 ms, A2=41 ms, A3=40 ms vs Classical FOPID=1139 ms. The NN gain-scheduled variants provide additional phase lead at the onset of the disturbance, explaining their faster recovery relative to Classical FOPID.


8. Conclusions

  1. Monotonic performance progression across all six controllers: Classical PID → Neuro-PID → Classical FOPID → N-FOPID A1 → N-FOPID A2 → N-FOPID A3 improves every primary metric at every step.
  2. Fractional structure is the dominant lever: the jump from Neuro-PID (OS=8.08%) to Classical FOPID (OS=3.28%) dwarfs any within-architecture differences (A1 OS=1.0000% to A3 OS=1.5102%).
  3. Capacity shows diminishing returns: A1→A2 gains 1.5% on ISE; A2→A3 gains a further -5.7%.
  4. A2 is the practical sweet spot: ~520 parameters, tanh activation (hardware-friendly), delivers -35% of A3’s improvement over A1 at 9% of A3’s capacity.
  5. Robustness is architecture-insensitive: all three FOPID variants maintain tight ±30% gain bundles.
  6. A3 recommended for performance-critical deployment; A2 for embedded / real-time hardware; A1 as a transparent, verifiable baseline.

Appendix: Figure List

#FileDescription
1fig1_step_response.pngNominal step response — all 6 controllers
2fig2_transient_zoom.pngTransient detail 0–0.5 s with ±2 gain
6fig6_rob_fopid_arch.pngNeuro-FOPID architectures robustness bundles
7fig7_rob_ise.pngISE vs plant gain perturbation sweep
8fig8_disturbance.pngDisturbance rejection 0–15 s
9fig9_disturbance_zoom.pngRecovery zoom t=5–7 s with annotations
10fig10_perf_indices.pngIntegral performance indices bar chart
11fig11_step_metrics.pngStep metrics 4-panel (OS, Tr, Ts, SSE)
12fig12_arch_ablation.pngArchitecture ablation overlay

Simulation: continuous-time lsim. NN inference: manual forward-pass using extracted weights. Generated 21-May-2026 13:19.