pip install torch torchvision --break-system-packages

Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple
Requirement already satisfied: torch in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (2.7.1)
Requirement already satisfied: torchvision in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (0.22.1)
Requirement already satisfied: filelock in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (3.16.1)
Requirement already satisfied: typing-extensions>=4.10.0 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (4.12.2)
Requirement already satisfied: setuptools in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (75.6.0)
Requirement already satisfied: sympy>=1.13.3 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (1.14.0)
Requirement already satisfied: networkx in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (3.4.2)
Requirement already satisfied: jinja2 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (3.1.4)
Requirement already satisfied: fsspec in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (2024.12.0)
Requirement already satisfied: nvidia-cuda-nvrtc-cu12==12.6.77 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (12.6.77)
Requirement already satisfied: nvidia-cuda-runtime-cu12==12.6.77 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (12.6.77)
Requirement already satisfied: nvidia-cuda-cupti-cu12==12.6.80 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (12.6.80)
Requirement already satisfied: nvidia-cudnn-cu12==9.5.1.17 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (9.5.1.17)
Requirement already satisfied: nvidia-cublas-cu12==12.6.4.1 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (12.6.4.1)
Requirement already satisfied: nvidia-cufft-cu12==11.3.0.4 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (11.3.0.4)
Requirement already satisfied: nvidia-curand-cu12==10.3.7.77 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (10.3.7.77)
Requirement already satisfied: nvidia-cusolver-cu12==11.7.1.2 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (11.7.1.2)
Requirement already satisfied: nvidia-cusparse-cu12==12.5.4.2 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (12.5.4.2)
Requirement already satisfied: nvidia-cusparselt-cu12==0.6.3 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (0.6.3)
Requirement already satisfied: nvidia-nccl-cu12==2.26.2 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (2.26.2)
Requirement already satisfied: nvidia-nvtx-cu12==12.6.77 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (12.6.77)
Requirement already satisfied: nvidia-nvjitlink-cu12==12.6.85 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (12.6.85)
Requirement already satisfied: nvidia-cufile-cu12==1.11.1.6 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (1.11.1.6)
Requirement already satisfied: triton==3.3.1 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torch) (3.3.1)
Requirement already satisfied: numpy in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torchvision) (1.26.0)
Requirement already satisfied: pillow!=8.3.*,>=5.3.0 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from torchvision) (10.2.0)
Requirement already satisfied: mpmath<1.4,>=1.1.0 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from sympy>=1.13.3->torch) (1.3.0)
Requirement already satisfied: MarkupSafe>=2.0 in /home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages (from jinja2->torch) (3.0.2)

[notice] A new release of pip is available: 24.2 -> 25.3
[notice] To update, run: python -m pip install --upgrade pip
Note: you may need to restart the kernel to use updated packages.

import torch
import torch.nn as nn
import numpy as np
import matplotlib.pyplot as plt

# 1️⃣ Generate synthetic data
T = 500
t = np.arange(0, T)
x = np.sin(0.02 * t) + 0.1 * np.random.randn(T)

plt.plot(t, x)
plt.title("Synthetic Time Series")
plt.show()

# 2️⃣ Create supervised learning dataset
def create_dataset(series, L=20):
    X, Y = [], []
    for i in range(len(series) - L):
        X.append(series[i:i+L])
        Y.append(series[i+L])
    return np.array(X), np.array(Y)

L = 20
X, Y = create_dataset(x, L)
X = torch.tensor(X).float().unsqueeze(-1)  # (N, L, 1)
Y = torch.tensor(Y).float().unsqueeze(-1)  # (N, 1)

# 3️⃣ Split train/test
train_size = int(0.8 * len(X))
X_train, X_test = X[:train_size], X[train_size:]
Y_train, Y_test = Y[:train_size], Y[train_size:]
print(f"Train size: {len(X_train)}, Test size: {len(X_test)}")

Train size: 384, Test size: 96

# 4️⃣ Define Transformer model
class TimeSeriesTransformer(nn.Module):
    def __init__(self, input_size=1, d_model=64, nhead=4, num_layers=2):
        super().__init__()
        self.input_proj = nn.Linear(input_size, d_model)
        encoder_layer = nn.TransformerEncoderLayer(
            d_model=d_model, nhead=nhead, dim_feedforward=128, dropout=0.1
        )
        self.encoder = nn.TransformerEncoder(encoder_layer, num_layers=num_layers)
        self.decoder = nn.Linear(d_model, 1)

    def forward(self, src):
        src = self.input_proj(src)        # (batch, L, d_model)
        src = src.permute(1, 0, 2)        # (L, batch, d_model)
        memory = self.encoder(src)        # (L, batch, d_model)
        out = self.decoder(memory[-1])    # last token
        return out

model = TimeSeriesTransformer()
criterion = nn.MSELoss()
optimizer = torch.optim.Adam(model.parameters(), lr=1e-3)

/home/fli/.virtualenvs/python3.12/lib/python3.12/site-packages/torch/nn/modules/transformer.py:382: UserWarning: enable_nested_tensor is True, but self.use_nested_tensor is False because encoder_layer.self_attn.batch_first was not True(use batch_first for better inference performance)
  warnings.warn(

# 5️⃣ Training loop
for epoch in range(100):
    model.train()
    optimizer.zero_grad()
    output = model(X_train)
    loss = criterion(output, Y_train)
    loss.backward()
    optimizer.step()

    if (epoch+1) % 20 == 0:
        model.eval()
        with torch.no_grad():
            val_pred = model(X_test)
            val_loss = criterion(val_pred, Y_test)
        print(f"Epoch {epoch+1}: train loss={loss.item():.4f}, test loss={val_loss.item():.4f}")

Epoch 20: train loss=0.0328, test loss=0.0415
Epoch 40: train loss=0.0274, test loss=0.0225
Epoch 60: train loss=0.0220, test loss=0.0249
Epoch 80: train loss=0.0231, test loss=0.0236
Epoch 100: train loss=0.0214, test loss=0.0230

# 6️⃣ One-step ahead predictions on test set
model.eval()
with torch.no_grad():
    preds_test = model(X_test).squeeze().numpy()

# 7️⃣ Plot true vs predicted on test portion
plt.figure(figsize=(10,5))
plt.plot(range(len(x)), x, label="True Series", alpha=0.6)
plt.plot(range(train_size+L, T), preds_test, label="Predicted (test)", color="red")
plt.axvline(train_size+L, color="gray", linestyle="--", label="Train/Test split")
plt.legend()
plt.title("Transformer One-Step Forecasting")
plt.show()

Forecasting with Transformer¶

Feng Li¶

Guanghua School of Management¶

Peking University¶

feng.li@gsm.pku.edu.cn ¶

Course home page: https://feng.li/forecasting-with-ai ¶

生成时间序列数据¶

转换成监督学习数据¶

划分训练集与测试集¶

定义 Transformer 模型¶

参数解释¶

`input_size = 1`：每个时间点的输入维度¶

`d_model = 64`：内部表示的维度 (embedding dimension)¶

`nhead = 4`：注意力头（multi-head attention）数量¶

`num_layers = 2`：Transformer 编码层的层数¶

Q (Query), K (Key), V (Value) 在哪里？¶

定义损失函数¶

模型训练¶

在测试集上预测¶

可视化结果¶

Forecasting with Transformer¶

Feng Li¶

Guanghua School of Management¶

Peking University¶

feng.li@gsm.pku.edu.cn¶

Course home page: https://feng.li/forecasting-with-ai¶

生成时间序列数据¶

转换成监督学习数据¶

划分训练集与测试集¶

定义 Transformer 模型¶

参数解释¶

input_size = 1：每个时间点的输入维度¶

d_model = 64：内部表示的维度 (embedding dimension)¶

nhead = 4：注意力头（multi-head attention）数量¶

num_layers = 2：Transformer 编码层的层数¶

Q (Query), K (Key), V (Value) 在哪里？¶

定义损失函数¶

模型训练¶

在测试集上预测¶

可视化结果¶

feng.li@gsm.pku.edu.cn ¶

Course home page: https://feng.li/forecasting-with-ai ¶

`input_size = 1`：每个时间点的输入维度¶

`d_model = 64`：内部表示的维度 (embedding dimension)¶

`nhead = 4`：注意力头（multi-head attention）数量¶

`num_layers = 2`：Transformer 编码层的层数¶