Update docs with latest architecture and results
This commit is contained in:
@@ -11,3 +11,6 @@
|
||||
|
||||
## Two-stage training with curriculum
|
||||
- Hypothesis: train diffusion on residuals only after temporal GRU converges to low error.
|
||||
|
||||
## Discrete calibration
|
||||
- Hypothesis: post-hoc calibration on discrete marginals can reduce JSD without harming KS.
|
||||
|
||||
Reference in New Issue
Block a user