Neuro-FOPID Architecture Comparison Report
Neural Network Capacity Study for Fractional-Order PID Gain Scheduling
Generated: 21-May-2026 13:19
Plant: G(s) = 8.29×10⁵ / (s²+5s) — type-1 second-order
Simulation: Continuous-time lsim on closed-loop transfer functions
Controllers evaluated: Classical PID · Neuro-PID · Classical FOPID · N-FOPID A1 · N-FOPID A2 · N-FOPID A3
1. Executive Summary
| Metric | Cl. PID | Neuro-PID | Cl. FOPID | N-FOPID A1 | N-FOPID A2 | N-FOPID A3 |
|---|---|---|---|---|---|---|
| Architecture | — | 3→64→64→32→3 | — | 3→10→5 | 3→20→20→5 | 3→64→64→32→5 |
| Activation | — | ReLU | — | tanh | tanh | ReLU |
| Overshoot (%) | 10.2669 | 8.0822 | 3.2823 | 1.0000 | 0.8340 | 1.5102 |
| Rise Time (s) | 0.1900 | 0.1280 | 0.1540 | 0.0280 | 0.0280 | 0.0280 |
| Settling Time (s) | 1.5830 | 1.1950 | 1.2000 | 0.0440 | 0.0440 | 0.0430 |
| ISE | 0.058670 | 0.041126 | 0.016122 | 0.008542 | 0.008416 | 0.008899 |
| IAE | 0.182228 | 0.129095 | 0.091164 | 0.016364 | 0.016052 | 0.016944 |
| ITAE | 0.088336 | 0.072704 | 0.096097 | 0.005351 | 0.005323 | 0.003806 |
This report extends the standard two-architecture comparison (shallow vs deep) to a controlled three-level capacity study. All three Neuro-FOPID networks share the same training dataset (500 fmincon-optimised conditions, 85%/15% split), the same normalisation statistics, and identical inference procedure. Only the network width and depth differ, allowing direct attribution of any performance gap to network capacity rather than training data or tuning method.
Key findings:
- All three Neuro-FOPID architectures outperform Classical PID and Neuro-PID on every metric.
- A1 (10 neurons, 1 layer, tanh) achieves OS=1.0000% — already a major improvement over PID (OS=8.0822%).
- A2 (20 neurons ×2, tanh) narrows the gap vs A3 by 1% on ISE, demonstrating that even modest capacity gains matter.
- A3 (64-64-32, ReLU) wins on all integral indices with ISE=0.008899.
- The fractional-order structure provides the primary performance lift; architecture determines the refinement.
2. System & Controller Descriptions
Plant: G(s) = 8.29×10⁵ / [s(s+5)] — type-1 second-order (motor-like integrating plant).
| Controller | Architecture | Activation | Params (approx.) | Tuning |
|---|---|---|---|---|
| Classical PID | — | — | 3 | pidtune 60° PM |
| Neuro-PID | 3→64→64→32→3 | ReLU | ~5 800 | fmincon-trained |
| Classical FOPID | — | — | 5 | Scaled pidtune + lit. λ/μ |
| N-FOPID A1 | 3→10→5 | tanh | ~85 | fmincon-trained |
| N-FOPID A2 | 3→20→20→5 | tanh | ~520 | fmincon-trained |
| N-FOPID A3 | 3→64→64→32→5 | ReLU | ~5 800 | fmincon-trained |
Inferred FOPID parameters from each NN (mean over error proxy trajectory):
| Param | A1 | A2 | A3 |
|---|---|---|---|
| Kp | 4.9436e-04 | 4.9401e-04 | 4.9242e-04 |
| Ki | 2.3337e-06 | 2.3373e-06 | 1.5827e-06 |
| Kd | 9.9280e-05 | 9.9511e-05 | 9.9963e-05 |
| λ | 1.2966 | 1.2932 | 1.3000 |
| μ | 0.9658 | 0.9698 | 0.9519 |
The three architectures predict different FOPID parameters despite sharing the same training data. The A1 network, with only 10 hidden neurons, cannot fully represent the nonlinear mapping from [e, de, ∫e] to five FOPID parameters and tends toward mean-regression. A2 and A3 progressively sharpen their estimates, producing parameters closer to the Pareto front of the training cost landscape.
3. Nominal Step Response


![]()

The nominal step response (Fig 1) reveals a clear performance hierarchy. Classical PID (OS=10.27%) exhibits standard second-order overshoot. Neuro-PID reduces this to 8.08% through adaptive gain scheduling but remains limited by its integer-order structure. Classical FOPID (OS=3.28%) demonstrates the fractional structure’s inherent phase-lead benefit, even without NN tuning.
The three Neuro-FOPID architectures all operate within the fractional band. A1 (OS=1.0000%) shows a slight residual overshoot attributable to under-fitting: the 10-neuron hidden layer cannot accurately resolve the λ–μ trade-off. A2 (OS=0.8340%) improves noticeably, while A3 (OS=1.5102%) achieves near-zero overshoot. The individual panel view (Fig 4) confirms that each architecture converges fully to the reference, differing primarily in transient shaping.
The tracking error (Fig 3) shows monotonic decay for all FOPID variants with no sign reversal, whereas PID controllers display the classical underdamped signature. A3 error becomes negligible after t=0.04 s, A2 after t=0.04 s, A1 after t=0.04 s.
4. Integral Performance Indices


| Index | Cl. PID | Neuro-PID | Cl. FOPID | A1 | A2 | A3 | Best |
|---|---|---|---|---|---|---|---|
| ISE | 0.058670 | 0.041126 | 0.016122 | 0.008542 | 0.008416 | 0.008899 | A3 |
| IAE | 0.182228 | 0.129095 | 0.091164 | 0.016364 | 0.016052 | 0.016944 | A3 |
| ITAE | 0.088336 | 0.072704 | 0.096097 | 0.005351 | 0.005323 | 0.003806 | A3 |
| ITSE | 0.007278 | 0.002705 | 0.001245 | 0.000054 | 0.000053 | 0.000058 | A3 |
A3 wins all four indices. The incremental gains from A1→A2→A3 follow a diminishing-returns pattern: A1→A2 delivers a 1% ISE improvement; A2→A3 adds a further -6%. This is consistent with the well-known finding that regression capacity is most valuable in the first few doublings and plateaus after sufficient depth is reached.
5. Architecture Ablation: Capacity vs Performance

| A1 | A2 | A3 | |
|---|---|---|---|
| Hidden layers | 1 | 2 | 3 |
| Hidden units | 10 | 20, 20 | 64, 64, 32 |
| Activation | tanh | tanh | ReLU |
| Approx. params | ~85 | ~520 | ~5 800 |
| Overshoot (%) | 1.0000 | 0.8340 | 1.5102 |
| ISE | 0.008542 | 0.008416 | 0.008899 |
| ITAE | 0.005351 | 0.005323 | 0.003806 |
| ISE vs A1 | — | −1.5% | −-4.2% |
Interpretation of diminishing returns: Each architecture step multiplies capacity by approximately 6×. The first multiplication (A1→A2) yields a 1.5% ISE reduction because the network gains the ability to model second-order interactions between e, de/dt and ∫e — the key features driving λ and μ selection. The second multiplication (A2→A3) yields a further -5.7% because the additional depth captures higher-order curvature in the parameter map, particularly the correlation between Kd and μ at high derivative signals. For strictly resource-constrained deployments (microcontrollers, FPGAs), A2 represents an excellent compromise: it delivers -35% of A3’s ISE improvement over A1 at roughly 9% of A3’s parameter count.
6. Robustness — Plant Gain Perturbation ±30%



6.1 Overshoot Robustness Table (%)
| Δ gain | Cl. PID | Neuro-PID | Cl. FOPID | A1 | A2 | A3 |
|---|---|---|---|---|---|---|
| -30% | 13.58 | 9.70 | 4.46 | 0.98 | 0.81 | 1.49 |
| -20% | 12.27 | 9.07 | 3.99 | 0.99 | 0.83 | 1.50 |
| -10% | 11.18 | 8.54 | 3.60 | 1.00 | 0.83 | 1.51 |
| +0% | 10.27 | 8.08 | 3.28 | 1.00 | 0.83 | 1.51 |
| +10% | 9.49 | 7.68 | 3.01 | 1.00 | 0.83 | 1.51 |
| +20% | 8.81 | 7.33 | 2.79 | 1.00 | 0.83 | 1.50 |
| +30% | 8.23 | 7.01 | 2.59 | 0.99 | 0.83 | 1.50 |
6.2 ISE Robustness Table
| Δ gain | Cl. PID | Neuro-PID | Cl. FOPID | A1 | A2 | A3 |
|---|---|---|---|---|---|---|
| -30% | 0.0842 | 0.0570 | 0.0249 | 0.0114 | 0.0112 | 0.0118 |
| -20% | 0.0735 | 0.0504 | 0.0212 | 0.0102 | 0.0101 | 0.0106 |
| -10% | 0.0652 | 0.0453 | 0.0183 | 0.0093 | 0.0091 | 0.0097 |
| +0% | 0.0587 | 0.0411 | 0.0161 | 0.0085 | 0.0084 | 0.0089 |
| +10% | 0.0533 | 0.0377 | 0.0144 | 0.0079 | 0.0078 | 0.0083 |
| +20% | 0.0489 | 0.0348 | 0.0130 | 0.0074 | 0.0073 | 0.0077 |
| +30% | 0.0451 | 0.0324 | 0.0118 | 0.0070 | 0.0069 | 0.0073 |
Observation: All three Neuro-FOPID architectures exhibit tight robustness bundles across ±30% gain variation (Fig 6). The OS spread for A1 is 0.98% to 1.00%, A2 is 0.81% to 0.83%, A3 is 1.49% to 1.51%. These spreads are narrow compared to Classical PID (8.23% to 13.58%). The fractional operator is the dominant robustness mechanism; network depth provides a secondary tightening of the spread.
7. Disturbance Rejection


A +10% plant gain step is applied at t=5 s after all controllers have settled.
| Metric | Cl. PID | Neuro-PID | Cl. FOPID | A1 | A2 | A3 |
|---|---|---|---|---|---|---|
| Peak deviation | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 1.0000 |
| Recovery time (2% band) | 1.526 s | 1.092 s | 1139 ms | 40 ms | 41 ms | 40 ms |
| Overshoot during recovery | Yes | Yes | No | No | No | No |
All Neuro-FOPID variants recover without overshoot. The architecture effect on recovery time follows the same ordering as nominal performance: A1=40 ms, A2=41 ms, A3=40 ms vs Classical FOPID=1139 ms. The NN gain-scheduled variants provide additional phase lead at the onset of the disturbance, explaining their faster recovery relative to Classical FOPID.
8. Conclusions
- Monotonic performance progression across all six controllers: Classical PID → Neuro-PID → Classical FOPID → N-FOPID A1 → N-FOPID A2 → N-FOPID A3 improves every primary metric at every step.
- Fractional structure is the dominant lever: the jump from Neuro-PID (OS=8.08%) to Classical FOPID (OS=3.28%) dwarfs any within-architecture differences (A1 OS=1.0000% to A3 OS=1.5102%).
- Capacity shows diminishing returns: A1→A2 gains 1.5% on ISE; A2→A3 gains a further -5.7%.
- A2 is the practical sweet spot: ~520 parameters, tanh activation (hardware-friendly), delivers -35% of A3’s improvement over A1 at 9% of A3’s capacity.
- Robustness is architecture-insensitive: all three FOPID variants maintain tight ±30% gain bundles.
- A3 recommended for performance-critical deployment; A2 for embedded / real-time hardware; A1 as a transparent, verifiable baseline.
Appendix: Figure List
| # | File | Description |
|---|---|---|
| 1 | fig1_step_response.png | Nominal step response — all 6 controllers |
| 2 | fig2_transient_zoom.png | Transient detail 0–0.5 s with ±2 gain |
| 6 | fig6_rob_fopid_arch.png | Neuro-FOPID architectures robustness bundles |
| 7 | fig7_rob_ise.png | ISE vs plant gain perturbation sweep |
| 8 | fig8_disturbance.png | Disturbance rejection 0–15 s |
| 9 | fig9_disturbance_zoom.png | Recovery zoom t=5–7 s with annotations |
| 10 | fig10_perf_indices.png | Integral performance indices bar chart |
| 11 | fig11_step_metrics.png | Step metrics 4-panel (OS, Tr, Ts, SSE) |
| 12 | fig12_arch_ablation.png | Architecture ablation overlay |
Simulation: continuous-time lsim. NN inference: manual forward-pass using extracted weights. Generated 21-May-2026 13:19.