少女祈祷中...

Home

Archives

About

Friend

Archives

博客搭建

Automata

Codeforces

Compilers

文艺作品补完计划

Algorithms

misc

RL

ML

hexo

ControlSystem

obsidian

Ubuntu

Complier

stable-baselines3

PyTorch

Conda

poems

RLHF

docker

env

conda

Lutris

fcitx

mujoco python

WorkLog

Note

LearnLog

Eassy

2024

12-17
[Automata] Ch9 Petri网络 PN
12-16
[Automata] Ch9 变迁系统 TS
12-14
[Automata] Ch8 图灵机 TM
12-14
[ControlSystem] Ch13 数字控制系统简介
12-14
[ControlSystem] Ch11 状态变量反馈控制系统设计
12-13
[ControlSystem] Ch3 状态空间模型
12-13
[ControlSystem] Ch7 根轨迹法
12-13
[ControlSystem] Ch6 控制系统的稳定性
12-05
[RLHF] OpenRLHF食用指南 (并非指南)
12-05
[docker] nvidia-docker使用教程
12-03
[RL] TRPO 和 PPO
12-02
[misc] 24-12 那些我看到的
11-25
[Automata] Ch7 上下文无关语言 CFL
11-18
[misc] 24-11 那些我看到的
11-14
[ControlSystem] Ch5 控制系统的性能
11-13
[misc] 先装ubuntu, 再装win11双系统
11-13
[Automata] Ch6 下推自动机 PDA
11-13
[Automata] Ch3 正则表达式与正则语言: Regular Expressions and Languages
11-13
[Automata] Ch5 上下文无关语法 CFG
11-11
[RL] PyTorch实现RL框架算法及 DQN
11-10
[misc] 11-10 折腾的一些杂项
11-08
[PyTorch] 关于自动求导机制以及优化器的工作原理
11-07
[RL] stable-baselines3实现DQN, double DQN, Rainbow, DDPG, TD3, SAC, TRPO, PPO
11-07
[misc] 11-06 折腾的一些杂项
11-05
[misc] 11-05 折腾的一些杂项
11-05
当妖精们舞动翅膀
11-05
[RL] 第八讲: 深度策略梯度
11-05
[RL] 第七讲: 深度强化学习
11-05
[RL] 第六讲: 价值和策略近似逼近方法
11-04
笔记本续航省电攻略 Ubuntu22.04

12 3 Next »

RIKKA421

Posts

75

Categories

4

Tags

25

Home

Archives

About

Friend

Categories

Eassy
LearnLog
Note
WorkLog

Tags

Algorithms
Automata
Codeforces
Compilers
Complier
Conda
ControlSystem
Lutris
ML
PyTorch
RL
RLHF
Ubuntu
conda
docker
env
fcitx
hexo
misc
mujoco python
obsidian
poems
stable-baselines3
博客搭建
文艺作品补完计划

Recent Posts

[misc] 25-03 那些我看到的
[misc] 25-03 那些我看到的
[Automata] Ch9 Petri网络 PN
[Automata] Ch9 变迁系统 TS
[Automata] Ch8 图灵机 TM

2020-2025 RIKKA421

Powered by Hexo Theme.Reimu

142k | 08:58

Number of visits | Number of visitors

RIKKA421

Posts

75

Categories

4

Tags

25

Home

Archives

About

Friend