Kronos

mirror of https://github.com/shiyu-coder/Kronos.git synced 2026-06-20 16:16:04 +08:00

Files

Pengxiao Song e027051b38 fix: add torch.cuda.empty_cache() during autoregressive inference

Without releasing cached GPU memory, usage will keep growing during autoregressive prediction, leading to significant memory increase or OOM. Calling torch.cuda.empty_cache() prevents this accumulation.

2025-09-02 10:26:27 +08:00

__init__.py

initial

2025-07-01 10:57:41 +08:00

kronos.py

fix: add torch.cuda.empty_cache() during autoregressive inference

2025-09-02 10:26:27 +08:00

module.py

initial

2025-07-01 10:57:41 +08:00